Hut, Hut, Hut, Scrape! Scraping 35 years of College Football Player Statistics

Marc Fridson
Posted on Feb 21, 2017

This serves as the next phase in building my NFL Draft Outcome Prediction Tool. Previously I collected 30 years of NFL Draft History and resulting player outcomes. Scraping college football statistics for those players provides more potential predictor variables for NFL Draft Outcomes. The difficult part is data is not available for all positions and some positions do not have as many years of historical statistics available (Source of statistics: So the scope of the scraping effort was QB, RB, FB, WR, TE (1980-2017); K, P (1990 to 2017). Here are some of the key findings from my tool:

CFB QB Findings:

·       High but not record high College QB Ratings led to the most successful NFL QBs, Every QB

·       Drafted from 1985 to 2007 with a QB Rating of 150 or higher started 5 or more years in the NFL.

·       No strong correlation between College Passing Yards and NFL Success

·       Most Successful QBs averaged between 7 and 9.5 Yard per Attempt

·       Most Successful College QBs threw 70-90 TDs in college

·       No strong correlation between College Interceptions and Success

·       No correlation between College Rushing Yards and Success, but poor Avg. Yards Per Rush does correlate with poor NFL success

Screenshot (27)


CFB RB/FB Findings:

·       College RB/FB ended up at 8 different positions when they got to the NFL

·       >750 Rushing Attempts in College correlates with poor NFL careers or over 4,000 rushing yards

·       Most Successful RBs/FBs average approximately 5 yards per carry

·       Rushing TDs do not correlate with NFL success

·       No correlation between receptions and success

·       Negative correlation between NFL Success and Receiving TDs

 Screenshot (28)

CFB WR/TE Findings:

·       College WR/TE ended up at 9 different positions when they got to the NFL

·       Most successful WRs/TEs played 20-40 games in college

·       Most successful WRs/TEs in the NFL had less than 100 total receptions

·       Most successful WR/TEs had <1250 total receiving yards in college

·       Most successful WR/TEs avg. 10-20 yards per catch

·       Most successful WR/TEs had <10 receiving TDs in college

·       No correlation between scrimmage yards/plays and NFL success

Screenshot (29)

Screenshot (30)

Screenshot (31)

About Author

Marc Fridson

Marc Fridson

In addition to my current participation in the Data Science Academy, I am a Course Designer/Facilitator for Columbia University's Applied Analytics Program and the CEO/Founder of Instant Analytics an analytical technology start-up. Prior to this I was the...
View all posts by Marc Fridson >

Related Articles

Leave a Comment

Google December 11, 2019
Google That may be the end of this post. Right here you will uncover some web-sites that we consider you will value, just click the hyperlinks.
Edmundo August 3, 2017
I enjoy the article
Genevieve July 29, 2017
I like the article
Timothy July 29, 2017
Thanks to the terrific guide
Avatar July 29, 2017
Thanks, it's very informative
Http://Www.Gipsonwvl.Com July 28, 2017
This is really helpful, thanks.
Avatar July 25, 2017
This is truly useful, thanks.
Nate July 22, 2017
Thanks for the wonderful manual
Avatar July 21, 2017
It works really well for me
www.Denverlinux.Com July 20, 2017
Thanks to the great guide
Avatar July 20, 2017
I spent a great deal of time to find something like this
Reuben July 20, 2017
It works very well for me
Avatar July 14, 2017
I spent a lot of time to find something such as this
Www.Gipsonwvl.Com July 14, 2017
Thanks for the terrific article
Avatar July 14, 2017
This is really helpful, thanks.
Cortez July 14, 2017
I spent a great deal of time to locate something such as this
Jorge June 28, 2017
Thanks for the terrific guide

View Posts by Categories

Our Recent Popular Posts

View Posts by Tags

#python #trainwithnycdsa 2019 airbnb Alex Baransky alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus API Application artist aws beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep Bundles California Cancer Research capstone Career Career Day citibike clustering Coding Course Demo Course Report D3.js data Data Analyst data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization Deep Learning Demo Day Discount dplyr employer networking feature engineering Finance Financial Data Science Flask gbm Get Hired ggplot2 googleVis Hadoop higgs boson Hiring hiring partner events Hiring Partners Industry Experts Instructor Blog Instructor Interview Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter lasso regression Lead Data Scienctist Lead Data Scientist leaflet linear regression Logistic Regression machine learning Maps matplotlib Medical Research Meet the team meetup Networking neural network Neural networks New Courses nlp NYC NYC Data Science nyc data science academy NYC Open Data NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time Portfolio Development prediction Prework Programming PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn Selenium sentiment analysis Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau team TensorFlow Testimonial tf-idf Top Data Science Bootcamp twitter visualization web scraping Weekend Course What to expect word cloud word2vec XGBoost yelp