Who actually owns S&P 500?

Posted on Feb 19, 2018

What are some of the biggest firms in United States of America that come to your mind?

Apple? Google? Amazon? and likes right? But have we thought how big these firms really are?

For instance, let's take Apple. According to Investopedia Apple's market capital in 2017, was greater than some of the countries GDPs. (ref)

This actually made me curious to find out who are the investors in Apple and other big companies. And I found out that the distribution of stock ownership showed that most of the share (almost more than avg 70%) was owned by institutional owners.

Institutional owners are  mutual or pension funds, insurance companies, investment firms, private foundations or endowments that own a companies available stocks. Since their ownership is so high, following questions came to my mind -

  • What is the size of the Institutional Holding in any company and in S&P 500 as a whole?
  • Who are the big Institutional Owners?
  • Whether high Institutional Ownership is a sign of good valuation of a company?
  • How does Institutional Holding affect a companies' performance?

To analyze this, I decided to study S&P 500 companies in detail. To collect the needed data points, I used web crawling technologies such as  Scrapy and Selenium in Python to scrape some of the financial websites. Overview of the execution of my project is illustrated in the diagram below. After collection of the data, I used Pandas for data cleaning and Plotly/Seaborn packages to investigate details. My github repo for the project can be found here. The dashboard in plot.ly can be found here.

First graph that I plotted is top 10 Institutional Owners in the individual S&P 500 companies. The dropdown can be used to select the desired company's ticker to see that company's ownership in billion dollars.

Using this graph, when we analyze multiple companies, we can clearly see that the same institutions hold highest ownership in almost all of the companies. Their share size is so high, that I wondered their impact on S&P 500 itself. So I plotted my second graph of top 10 Institutional Owners in S&P 500.



Above graph clearly shows, how some handful institutions almost own most of the S&P500 companies. Their investment share just in S&P 500 is so high, that it I wanted to see what is their total asset under management (their total worth).

In the graph above, I tried to compare AUM of these investment firms and some of the top world GDPs. If we just put together the AUMs of top 5 investment companies, it surpasses  USA's GDP of 19 trillion dollars. The graph illustrates how top 2 firms are bigger, than some countries like India which has sixth largest GDP in the world. Not being from a finance background this fact kind of amazed me.

I further wanted to see which sectors do these large institutes favor for investments. After plotting below graph, I noticed Finance sector is the most popular sector for institutional ownership followed by Health.

I further wanted to analyze how the some of the institutional holdings affect the stock price; if it is Undervalued or Overvalued with increasing percentage in ownership.

The graph is for Institutional Ownership against Price/Book Ratio. If we notice the colors of the dots in the above graph, the colors seem to be darkening as we move to the right on X-axis, which clearly means that as the percentage of institutional ownership increases the stock prices may be over valued or even significantly overvalued.

The next graph was to check how the performance of the company affects the institutional ownership. I used some book values to calculate the the Return on Asset (ROA) of the companies and plotted it against the Institutional Ownership. The distribution in the below graph shows that performance of the company doesn't really affect the institutional ownership. Some of the company even have ROA less than 1 with high ownership.

Conclusion and Future Work : 

Thus, above analysis shows that institutional ownership though not talked about much is completely separate investigation topic. Some of these companies are so big and own such a large ownership in top companies that a small change in the value can affect the stock price and performance of the company.

My next steps would be to actually do some statistical analysis with above data to see the effect of ownership on performance and stock values.

About Author

Related Articles

Leave a Comment

No comments found.

View Posts by Categories

Our Recent Popular Posts

View Posts by Tags

#python #trainwithnycdsa 2019 airbnb Alex Baransky alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus API Application artist aws beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep Bundles California Cancer Research capstone Career Career Day citibike clustering Coding Course Demo Course Report D3.js data Data Analyst data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization Deep Learning Demo Day Discount dplyr employer networking feature engineering Finance Financial Data Science Flask gbm Get Hired ggplot2 googleVis Hadoop higgs boson Hiring hiring partner events Hiring Partners Industry Experts Instructor Blog Instructor Interview Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter lasso regression Lead Data Scienctist Lead Data Scientist leaflet linear regression Logistic Regression machine learning Maps matplotlib Medical Research Meet the team meetup Networking neural network Neural networks New Courses nlp NYC NYC Data Science nyc data science academy NYC Open Data NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time Portfolio Development prediction Prework Programming PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R Data Analysis R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn Selenium sentiment analysis Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau team TensorFlow Testimonial tf-idf Top Data Science Bootcamp twitter visualization web scraping Weekend Course What to expect word cloud word2vec XGBoost yelp