Data Study NBA Stats (1947-2015)

Posted on May 14, 2016
The skills the author demoed here can be learned through taking Data Science with Machine Learning bootcamp with NYC Data Science Academy.


This has been an exciting season in the NBA. Throughout the season, the headlines on the NBA site read “Warriors having the best start to a season”, “Lebron James becoming the youngest player to hit the 26,000 points milestone”, “Russell Westbrook having the most triple-doubles in the past 50 years”, “Steph Curry shooting over 400 3-pointers”, and of course the “Golden State Warriors obtaining the best winning ratio of a season”. And there were many more. This lead me to create a Shiny app see what to other data records are there that can be broken in the future.

Data Set

I located a dataset from Putdat that had NBA stats from 1947 – 2015. The dataset contained all the teams that ever existed from 1947 even though the team was discontinued. However the stats of the teams that changed names or changed location were preserved under the new name.


Stats Leader

The first tab of the app is the Stats Leaders. This section is where we can find the overall stats of the NBA. The graph below shows the teams that won at least one championship. The Boston Celtics won an astounding 17 championships (eleven of them was during Bill Russell’s career). So far 17 out of the 30 teams in the current NBA won at least one championship. That leaves 13 teams to win their first NBA championship and break the streak.


Data Study NBA Stats (1947-2015)


The Wins graph (below) shows the top ten records of teams from 1947 through 2015 in both the seasons and the playoffs. The first graph shows the top ratios of wins to total games and using the slider, we can adjust the range from one to ten (five is shown in the screenshot). For instance (before the Golden State Warriors stunning record of 73-9), the best record was the Chicago Bulls record of 72 wins and 10 losses for a winning percentage of 88% in 1996. The Bulls also managed to maintain the best winning ratio in the season for the following year.


Data Study NBA Stats (1947-2015)


Average PPG

On the next tab, Team, team stats are also available for both the season and the playoffs. Sticking with our Bulls’ example, we get these two graphs:


Data Study NBA Stats (1947-2015)




The first graph (red bar graph) shows the points per game every year since the start of their franchise. It is interesting to see that the Bulls’ best record in ’96 was not their highest point performance. This can apply that they had a great balance between offense and defense that helped them win that many games.


The second graph shows the winning ratio split into home and way games. This is the Bulls’ season winning ratio. One last point I want to make about Bulls is that after the Michael Jordan era, the Bulls’ faced a sharp decline in points and wins in general. It was not until the signing of Derrick Rose that some light started shining in the windy city. It is actually interesting to see that majority of the teams win more home games than away. This just proves that fans are part of the team. There are definitely a few exceptions. The Timberwolves had an away winning percentage of 32% and their home was 27%.


With more time, I will be able to create more graphs that demonstrate more current records. Also it was interesting to see trends through the graphs. When I saw them, I wanted to know why this was the case or which player made an impact that year. For instance I saw that Bulls had a phenomenal record in the year 1996. Not only would I want to be able to click on the bar for 1996 and see Jordan, Pippen, and Rodman along with their respective stats. I would also like to find a dataset with individual stats and include them into this Shiny app.

About Author

Taraqur Rahman

During his career as a Sales Associate, Taraqur analyzed data to help support both the sales and marketing teams. Seeing through his own eyes how much data can influence decisions, Taraqur joined NYCDSA as a data scientist in...
View all posts by Taraqur Rahman >

Related Articles

Leave a Comment

No comments found.

View Posts by Categories

Our Recent Popular Posts

View Posts by Tags

#python #trainwithnycdsa 2019 2020 Revenue 3-points agriculture air quality airbnb airline alcohol Alex Baransky algorithm alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus ames dataset ames housing dataset apartment rent API Application artist aws bank loans beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep boston safety Bundles cake recipe California Cancer Research capstone car price Career Career Day citibike classic cars classpass clustering Coding Course Demo Course Report covid 19 credit credit card crime frequency crops D3.js data data analysis Data Analyst data analytics data for tripadvisor reviews data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization database Deep Learning Demo Day Discount disney dplyr drug data e-commerce economy employee employee burnout employer networking environment feature engineering Finance Financial Data Science fitness studio Flask flight delay gbm Get Hired ggplot2 googleVis H20 Hadoop hallmark holiday movie happiness healthcare frauds higgs boson Hiring hiring partner events Hiring Partners hotels housing housing data housing predictions housing price hy-vee Income Industry Experts Injuries Instructor Blog Instructor Interview insurance italki Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter las vegas airport lasso regression Lead Data Scienctist Lead Data Scientist leaflet league linear regression Logistic Regression machine learning Maps market matplotlib Medical Research Meet the team meetup methal health miami beach movie music Napoli NBA netflix Networking neural network Neural networks New Courses NHL nlp NYC NYC Data Science nyc data science academy NYC Open Data nyc property NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time performance phoenix pollutants Portfolio Development precision measurement prediction Prework Programming public safety PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R Data Analysis R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn seafood type Selenium sentiment analysis sentiment classification Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau teachers team team performance TensorFlow Testimonial tf-idf Top Data Science Bootcamp Top manufacturing companies Transfers tweets twitter videos visualization wallstreet wallstreetbets web scraping Weekend Course What to expect whiskey whiskeyadvocate wildfire word cloud word2vec XGBoost yelp youtube trending ZORI