Coder Social home page Coder Social logo

upgoerfive / phase-1-project-movie-data Goto Github PK

View Code? Open in Web Editor NEW

This project forked from westonshuken/phase-1-project-movie-data

0.0 0.0 0.0 52 MB

Flatiron School Phase 1 Project exploring & analyzing movie data to make recommendations to a new film production studio.

Jupyter Notebook 100.00%

phase-1-project-movie-data's Introduction

Phase-1-Project-Movie-Data

banner

Microsoft Movie Studio Business Analysis

Authors: Madoria Thomas, Gregory Han, Nathaniel Martin, Weston Shuken

Overview

Microsoft wants to launch a new movie studio.

By analyzing historical data, we uncovered data-driven tips for ensuring a the company's production is high-performing with an efficient return on investment. We suggest ideas for genres, runtime, and studio acquisitions.

Business Problem

Microsoft does not know anything about creating movies, yet they want to create a new movie studio.

There are many unknown factors associated with how to make a profit.

  • How long should the films be?
  • What genre should Microsoft produce?
  • Should Microsoft start from scratch or acquire a studio?

Using the data provided and additional resources, we devised a plan to answer these questions about how best Microsoft can break into the movie studio market and ensure profitability.

Data Understanding

We are using movie datasets from 5 different sources:

The data includes information on movie genres, titles, runtime, production expense, foreign & domestic box office gross, production studio name, and release dates.

With these datasets, we will search for insights in box office performance and user ratings compared to various factors like genre, personel, runtime, production budget and release time.

Methods

We analyzed hundreds of thousands of data records across multiple sources to conduct descriptive statistics and visualizations.

However, much of the data was irrelevant to our needs:

  • Dating back to 1915
  • $0 Box Office Gross (likely due to streaming only releases)
  • Extremely low budget films
  • Missing entries

We cleaned, merged, and built data frames that would best suit our analysis for a modern movie studio, interested in movie theater releases, and with a budget sufficient budget for major film productions.

Analysis and Visualisations

We used descriptive analysis to discover trends in the data. We also looked at trends, frequencies and distributions to visualize our findings.

Runtime Frequency

The graph below shows the frequency of movies by runtime given they have a ROI over 50%.

graph1

Return on Investment (ROI) by Genres

We can see that horror films have an extremely high ROI while music and science fiction have the lowest returns. The graph shows the mean return on investment (in percent) for each film genre over the years 2000-2020 from the The Numbers and TMDb datasets.

graph2

graph3

Top Studios by Profit Margin

The graphs below show the top 10 studios by profit margin.

  • Left Graph: Domestic Profit Margin
  • Right Graph: Worldwide Profit Margin

graph4

Movie Profit by Month of Release

The graphs show average profit by month (left) a film was released and average ROI by month (right) film was released. The first set of graphs is specific to only the Horror genre, while the second set of graphs is industry wide data. The averages are computed as the median average of the films released from year 2000 to 2020.

graph5

graph6

Profit Counts Worldwide

The graphs below show counts of a film being profitable or not profitable by genre from 2000-2020.

graph7

Conclusions

This analysis leads to three recommendations for Microsoft when creating a movie studio:

  1. Don't make cliffhangers or movies too long. Runtime for movies should be made between 90-120 minutes; movies with ROI above 50% were made in this range. Avoid producing movie runtimes out of this recommened range improve ROI.
  2. Acquire Neon or Orchard Film Studios. Both Neon and Orchard are ranked highly in profits for both domestic and worldwide markets. Acquiring either one of these two would allow Microsoft to make movies that are competitive in the box office.
  3. Create films with a historically high ROI: Horror (release in June, July, and October), Mystery & Thriller with considerations to low risk films: Adventure & Animation. Adding these genres as part of the production portfolio would garner Microsoft a higher return on investment.

Next Steps

Further analyses could provide additional insights to having a more successful movie studio for Microsoft:

  • Net Profit to Studio Data. As of now we are seeing the net profit of the film at box office but not what profits are returning directly to the studios involved. Having the net profit by film to studio would increase the strength of our analysis of when to release movies.
  • Other expense data. Further inspection to types of expense could also help for your capital budgeting. E.g. marketing expense, distribution expense, etc.
  • Competitors' financial performance. Deeper understanding of competitive landscape — to have organized data on competitors' financial performance would help strengthen our M&A recommendations. E.g. historical market cap data
  • Straight to streaming and licensing. Although the box office is important for new films, much of the data had $0 box office gross which led us to believe some films sell straight to streaming licensing. Exploring data related to streaming and licensing would also provide a realistic prediction of how a film could be exploited.

For More Information

Please review our full analysis in our Jupyter Notebook or our presentation.

For any additional questions, please contact Madoria Thomas, Gregory Han, Nathaniel Martin and Weston Shuken — Collectively known as 'The Four Forks'

Repository Structure

├── data                          <- Both sourced externally and generated from code 
├── images                        <- Both sourced externally and generated from code
├── .gitignore                    <- gitignore 
├── Movie_Recommendations.ipynb   <- Narrative documentation of analysis in Jupyter notebook
├── README.md                     <- The top-level README for reviewers of this project
└── project-slides.pdf            <- PDF version of project presentation

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.