Media Industry: A Textual Analysis of Al-Jazeera English
The skills I demoed here can be learned through taking Data Science with Machine Learning bootcamp with NYC Data Science Academy.
The media industry is a multi-billion dollar enterprise, consisting of a number of small-time producers of content through to large media conglomerates. In this project, I began the process of a mock market survey for Al-Jazeera English's Opinion's column. I scraped such information as date of publication, title, header and caption text, author and the body text of the articles to inform my analysis.
Some broad conclusions that I gathered: based upon word frequency analysis, topics pertaining to specific countries, such as the United States, Israel and Iran, featured heavily. In addition, discussion of presidential politics (especially in the US) featured heavily within the US category. The relative frequency of publication did vary by day of the week, though subsequent analysis and cross comparison with social media outreach studies will be necessary to ascertain if Al Jazeera is reaching its target audience optimally.
In future work, I will want to improve the webscraping of the body text.