Original TV Shows

Nan(Lainey) Liu
Posted on Mar 8, 2018

Since the streaming service is gradually replacing the traditional TV, streaming companies started to produce their own TV series over the past decent. Netflix seems to be the first and leading original content producer out there, however, Hulu and Amazon took different edges: Hulu focused more on acquiring well-known shows or up-coming star shows to expand their portfolio, and Amazon appears to emphasize on movie production.
I am curious about how does it impact those company. Ultimately, the streaming company wants to know how much money they are able to make through the TV shows. Regardless all the other revenue the companies bring in because of the original(eg: winning awards, merchandise sale), this research solely focus on the popularity of the show to measure if it is successful, therefore bringing more loyal subscribers.  So, I decided to scrape Wikipedia and IMDB to construct my own data. I focused my research on Netflix, Hulu, and Amazon. The following timeline I draw demonstrate some major events that happened in past decade.


After the scaping, it's obvious that the number of original series has been increasing exponentially, however, the IMDB ratings are going down. It doesn't necessary means that the quality of the contents is going down, when we have large enough pool of sample, it's more likely to have TV shows rated in the lower level. Another theory of mine is that Netflix saw success on original content, therefore they don't mind experimenting good ideas for shows. If the show doesn't get as many views as Netflix expected, they simply cancel the show to prevent further cost. After all, it's the hit show matters.

To dig in further into the data, I extrapolate the top ten genres that are rated the highest among the samples and the presence of original series among the world.


Most importantly, what value did those show create? From the graph below, we can tell that: it seems that, after every hit show released, the slope went a little steeper for subscription count.


About Author

Nan(Lainey) Liu

Nan(Lainey) Liu

Nan(Lainey) is a master student at New York University studying Financial Engineering. She is passionate in the applications of machine learning technique in financial industry eg. High-Frequency Trading, Option Pricing. Nan developed a shiny app to research on...
View all posts by Nan(Lainey) Liu >

Related Articles

Leave a Comment

No comments found.

View Posts by Categories

Our Recent Popular Posts

View Posts by Tags

#python #trainwithnycdsa 2019 airbnb Alex Baransky alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus API Application artist aws beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep Bundles California Cancer Research capstone Career Career Day citibike clustering Coding Course Demo Course Report D3.js data Data Analyst data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization Deep Learning Demo Day Discount dplyr employer networking feature engineering Finance Financial Data Science Flask gbm Get Hired ggplot2 googleVis Hadoop higgs boson Hiring hiring partner events Hiring Partners Industry Experts Instructor Blog Instructor Interview Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter lasso regression Lead Data Scienctist Lead Data Scientist leaflet linear regression Logistic Regression machine learning Maps matplotlib Medical Research Meet the team meetup Networking neural network Neural networks New Courses nlp NYC NYC Data Science nyc data science academy NYC Open Data NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time Portfolio Development prediction Prework Programming PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn Selenium sentiment analysis Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau team TensorFlow Testimonial tf-idf Top Data Science Bootcamp twitter visualization web scraping Weekend Course What to expect word cloud word2vec XGBoost yelp