Data Analysis on the Driving Techniques (PGA Tour)

Posted on Dec 17, 2020
The skills I demoed here can be learned through taking Data Science with Machine Learning bootcamp with NYC Data Science Academy.

Introduction

To be the best on the PGA tour in the modern era, data shows you have to be among the best drivers. Let's explore why that is the case, increasingly so. I got the idea to do this from an exclamation a golf commentator made during this year's US Open, one of the four major golf tournaments and notorious for selecting among the most challenging golf courses in the country. 

The exclamation was, "If I had a kid right now into golf, I would tell them to swing as hard as they can." You see, that was so counter-intuitive to what I understood about golf, a game, to me, that was all about power and precision. Meaning, swing as hard as you can while hitting the sweetest spot of the ball. Isn't that the beauty of precision, a boxer landing a pinpoint punch can be as effective as a haymaker, a more graceful racecar driver over many laps is faster than the aggressive, edgier driver who will make more mistakes. 

It was clear that the announcer's claim was an earnest reflection on the state of the modern PGA Tour, 200 of the best golfers in the world. Although the tour featured a broad range of athletic frames, from Abraham Ancer, a sharpshooting Mexican standing at 5'7", 155 lbs to Bryson Dechambaue, an imposing 6'2", 245 lbs that one could mistake for a middle linebacker. 

 

Data Analysis on the Driving Techniques (PGA Tour)

Winged Foot Golf Club

It was actually this tournament, the US Open at Winged Foot, that Bryson left a heavy impression on the game, not only winning but winning by a lot. Winged Foot had incredibly unforgiving rough grass, meaning in order to score even remotely well you had to keep the ball in the fairway, or so people thought. Bryson tore through the rough grass as he tore through the competition, ignoring the fundamental rules of the historic course and the sport itself. 

Bryson is just another exclamation point to a trend in golf that was inspired by Tiger Woods in the early 2000s. Make no mistake, Tiger was not only one of the longest hitters of the ball, he had an accurate mid-range game, and was one of the best putters on top of it. No better golfer has played the game, period. The 2010s turned the trend up a notch with Bubba Watson, Dustin Johnson, Rory McIlroy, Brooks Kaepka to name a few. 

It is important to note before we move forward that there are also several examples of golfers that have succeeded over their heavier hitting rivals, Jordan Speith, Rickie Fowler, and Justin Thomas have all proven that control, touch, and discipline are not lost arts in the game of golf. However, it becomes clear during our analysis that not only is hitting the ball further an advantage.

Data Analysis on the Driving Techniques (PGA Tour)

Hitting the ball further may seem like an obvious advantage but to truly quantify it we will introduce the idea of Strokes Gained. Before that, let's set a precedent that hitting the ball further in golf IS NOT necessarily advantageous. 

Data Analysis on the Driving Techniques (PGA Tour)

In the example above, note that although the long hitter hit the drive further than the accurate hitter in each of the 3 simulations, 2 of the 3 times the long hitter ends up in a worse position, either taking a penalty for hitting into the water or in hazardous conditions to the right. 

Moving on to Strokes Gained, from the PGA Tour website, "The Strokes Gained concept is a by-product of the PGA TOUR's ShotLink Intelligence Program, which encourages academics to perform research against ShotLink statistical data. Professor Mark Broadie from Columbia Business School developed the early concept which was later refined by the TOUR." It is defined as "the per round average of the number of Strokes the player was better or worse than the field average on the same course & event."

If that is unclear, let me illustrate first with an example of how hitting the ball further CAN be advantageous. 

Data Analysis on the Driving Techniques (PGA Tour)

In the example above, being aggressive paid off (high-risk, high reward). Now, we move to the situation in which high-risk turned into punishment instead. 

 

 

 

In this example, we can see that although the long-hitter hit a further shot because the ball landed in hazardous conditions, the player ended up taking 1 more shot to get to the green than the accurate hitter. In terms of Strokes Gained, the long hitter has -1 Strokes Gained or has lost a stroke. 

Now that we understand the dynamics in play and Strokes Gained, we can answer the question: do longer hitters of the ball reach the green quicker than accurate hitters of the ball on average? The answer is simply, yes. However, evidence suggests that this notion has grown stronger over time (2004-2018).

This is a chart of the top 10 drivers each year, and how much the total Strokes Gained they were able to capture grew over time.

The chart has a strong correlation (r) = .84, indicating there is evidence that hitting longer matters now more than ever. 

Additional Resources to understand golf better:

A look into data collection:

About Author

Leave a Comment

No comments found.

View Posts by Categories


Our Recent Popular Posts


View Posts by Tags

#python #trainwithnycdsa 2019 2020 Revenue 3-points agriculture air quality airbnb airline alcohol Alex Baransky algorithm alumni Alumni Interview Alumni Reviews Alumni Spotlight alumni story Alumnus ames dataset ames housing dataset apartment rent API Application artist aws bank loans beautiful soup Best Bootcamp Best Data Science 2019 Best Data Science Bootcamp Best Data Science Bootcamp 2020 Best Ranked Big Data Book Launch Book-Signing bootcamp Bootcamp Alumni Bootcamp Prep boston safety Bundles cake recipe California Cancer Research capstone car price Career Career Day citibike classic cars classpass clustering Coding Course Demo Course Report covid 19 credit credit card crime frequency crops D3.js data data analysis Data Analyst data analytics data for tripadvisor reviews data science Data Science Academy Data Science Bootcamp Data science jobs Data Science Reviews Data Scientist Data Scientist Jobs data visualization database Deep Learning Demo Day Discount disney dplyr drug data e-commerce economy employee employee burnout employer networking environment feature engineering Finance Financial Data Science fitness studio Flask flight delay gbm Get Hired ggplot2 googleVis H20 Hadoop hallmark holiday movie happiness healthcare frauds higgs boson Hiring hiring partner events Hiring Partners hotels housing housing data housing predictions housing price hy-vee Income Industry Experts Injuries Instructor Blog Instructor Interview insurance italki Job Job Placement Jobs Jon Krohn JP Morgan Chase Kaggle Kickstarter las vegas airport lasso regression Lead Data Scienctist Lead Data Scientist leaflet league linear regression Logistic Regression machine learning Maps market matplotlib Medical Research Meet the team meetup methal health miami beach movie music Napoli NBA netflix Networking neural network Neural networks New Courses NHL nlp NYC NYC Data Science nyc data science academy NYC Open Data nyc property NYCDSA NYCDSA Alumni Online Online Bootcamp Online Training Open Data painter pandas Part-time performance phoenix pollutants Portfolio Development precision measurement prediction Prework Programming public safety PwC python Python Data Analysis python machine learning python scrapy python web scraping python webscraping Python Workshop R R Data Analysis R language R Programming R Shiny r studio R Visualization R Workshop R-bloggers random forest Ranking recommendation recommendation system regression Remote remote data science bootcamp Scrapy scrapy visualization seaborn seafood type Selenium sentiment analysis sentiment classification Shiny Shiny Dashboard Spark Special Special Summer Sports statistics streaming Student Interview Student Showcase SVM Switchup Tableau teachers team team performance TensorFlow Testimonial tf-idf Top Data Science Bootcamp Top manufacturing companies Transfers tweets twitter videos visualization wallstreet wallstreetbets web scraping Weekend Course What to expect whiskey whiskeyadvocate wildfire word cloud word2vec XGBoost yelp youtube trending ZORI