Econometrics Research Paper

profileTejasvini
projectEx1.pdf

Golf Performance: Key Attributes for a New Age

Introduction:

According to the National Golf Foundation, golf is one of the most popular sports in the United

States. With a participation rate of 8.2%, it ranks just behind tennis as the 7th most popular sport

by TV viewership (Das). Unlike many sports, where athleticism was assumed to be a key

component of success, this realization has been a recent phenomenon in Golf. Seeing

photographs of Ben Hogan or Arnold Palmer smoking while considering a shot reinforces this

opinion or John Daly’s admission: "I went in the locker room and downed like five beers, and I

think I shot 4 under on the front nine.” (Torres)

That attitude has now changed. Golfers today are noticeably fitter, leaner and more athletic in

appearance to even the most casual observer. In 1997, John Daly became the first golfer on the

PGA Tour to average more than 300 yards per drive. That year, he drove the ball 30 yards longer

than the average Tour player. Now, 22 years later, the entire PGA Tour averages 295.3 yards off

the tee — the longest average ever. In 1980 (the first year that the PGA Tour’s driving distance

stats are available), Dan Pohl led the field while averaging 274.3 yards per drive. The Tour

average was a meek 256.89 yards (Wilco).

What this paper seeks to answer is: with the geometry of golf changing with longer driving

distance, what determines the success of a professional golfer? The common golf phrase “drive

for show and putt for dough” is thought of as gospel. This paper will take it further and answer

the questions: does making one-putts on the last day of tournament demonstrate a “killer factor”

of being able to perform when the money is on the line?” Or, does the golfer who averages

longer distance on one-putts indicate higher skill level on the green?

Golf Performance: Key Attributes for a New Age

3

Secondly, we want to answer questions such as “if a pro golfer could improve driving distance

by 10 yards, what effect would that have on his average score? How will it affect prize money?

The interest in golf models should be reignited as the PGA tour commissioner recently

announced that the PGA will allow legalized gambling on-site as soon as the 2020 season

(Leonard).

Literature review:

The change in golf driving distance has been controversial. This increase has led to calls for

changes in ball and club regulation. The PGA of America has resisted such requests.

“Based on the information we have seen, we are highly skeptical that rolling back the golf ball in whole or part will be in the best interest of the sport and our collective efforts to grow the game.”

PGA of America CEO, Pete Bevacqua responding to concerns on technology’s role in changing the distance increases in golf shots. (Gray)

The changes to golf due to increased driving distance were studied by Alexander and Kern who

first challenged the adage of “drive for show, putt for dough”. They argued that the game of the

21st century is all about power, citing the fact that in 2001, the players listed in the top ten in

driving distance on the PGA tour earned 30% more in total prize money than the 10 best putters

on the tour. Their study added limited support to this idea (2, 59).

Numerous papers have explored the components that determine golf score, commonly using

standard golf shot types in a model to determine season-ending average score or season ending

Golf Performance: Key Attributes for a New Age

4

earnings. Callan and Scott determined that a player’s season-ending score can be illustrated as:

The variable names are described on page five and six except for the following:

1. RANK is the average ranking for a golfer in the season. 2. SCORE is the average score throughout all tournament rounds. 3. EVENTS is the number of tour events entered in the given season, EARN is the SEASON

ending tournament earnings. 4. TOURNC is the number of tournaments completed (7).

This model is representative of much of the research on the subject with Finley and Halsey

showing that GIR, scrambling, distance, driving accuracy, sand saves, “bounce back” (how

effectively a golfer follows a bogie hole with a birdie hole) and putts-per-round provided an 𝑅"

value of over 94% (1104). This is intuitive. The more effectively that a golfer can move from the

tee-box to the green in the fewest strokes, the more likely that golfer would be able to score well.

The most radical, and arguably effective measure was introduced in the book “Every Shot

Counts” where the author had professional and amateur golfers mark their score cards with their

exact location on the course (now performed with lasers), and developed a “strokes gained”

statistic that measures the number of strokes that are gained from a statistical baseline. His “SG

approach” measurement is now included in PGA official statistics and will be used for this study.

Interestingly, the first chapter in his book is titled “putting is overrated” (Brodie, 17,18).

Golf Performance: Key Attributes for a New Age

5

Heiny argued that if a player hits the ball far enough, driving accuracy is not an important

statistic, citing the rise in p-value of driving accuracy from .1193 in 1992 to .3959 in 2003. He

quoted tour pro Hal Sutton’s 2003 comments: “If you were to ask everyone out there if you

wanted distance or accuracy, they would say distance.”(18).

Methodology: Data was obtained from the PGA tour statistics website for the 150 top money winners in the

2019 season. Variables were selected based on domain knowledge and to prevent perfect

collinearity (such as putts over 10’ and putts under 10’). Below are summary statistics of the

data:

Variable Description

1. Score was used as the dependent variable as a proxy for earnings (78% correlation)

because tournaments paid different levels of prize money and several golfers only

competed in one or two tournaments as they primarily play on the European tour.

2. Money is the official amount of tour winnings for the 2019 season.

3. Avg. Approach is the “strokes gained” statistic that measures the number of strokes that

are gained from a statistical baseline on approach shots.

4. Distance is driving distance in yards.

5. GIR is the measure of greens-in-regulation. The percentage of time that a golfer gets on

the green with two putts to make par.

Golf Performance: Key Attributes for a New Age

6

6. OneputtR4 is the “killer factor.” Percentage of time that a player sinks his first putt on

the green on the fourth day of the tournament.

7. Percmade is the percentage of holes that the player sinks his first putt.

8. Rounds is the number of rounds played in the 2019 season.

9. Scbperc is the percentage of time that a player misses the green in regulation, but still

makes par or better.

10. Daccuracy is the percentage of time a player hits his first shot (drive) into the fairway.

11. Puttdist is the distance to the average made putt in inches.

12. Sandsv is the percentage of time that a player lands in a sand trap and still saves par.

13. Puttavg is the average number of putts that a player has taken on each hole in the 2019

season.

We used score as a proxy for prize money. Money was not used as an independent variable in

our model of average score because score has the strongest effect on prize winnings. However,

the high level of correlation (-.77) suggests this was an acceptable substitute.

Golf Performance: Key Attributes for a New Age

7

After the data was gathered the scoring data followed a normal distribution and had of standard

deviation of .5681 strokes.

To further explore the data, Pearson correlation coefficients were developed and are listed in the

table below. High levels of multicollinearity were seen. However, this was expected as measures

such as driving accuracy and driving distance would show a strong negative correlation. The

interesting thing is the relationship to score is much higher for driving distance (-.252) than

driving accuracy (-.118) from a simple correlation viewpoint.

Golf Performance: Key Attributes for a New Age

8

Correlation Matrix

Golf Performance: Key Attributes for a New Age

9

An initial linear regression model was run on the full set of variables:

After running the data through the model, only the variables Scbperc, puttavg, Daccuracy,

Sandsv, distance and Avgapprch appeared significant at the 1% level. The model exhibited an 𝑅"

of .8071, an adjusted 𝑅" of .7909 and a p-value of 2.2e-16. The model was highly significant,

showing that a substantial portion of the variation in scores could be explained by the

independent variables.

A new model was run excluding the insignificant variables. This model exhibited an 𝑅" of .803,

an adjusted 𝑅" of .7943 and a p-value of 2.2e-16. This adjusted 𝑅" value was a slight

improvement

Golf Performance: Key Attributes for a New Age

10

An F-test was then run on the variables that appeared to be insignificant in the unrestricted model

to see if they were jointly significant. Using ANOVA, they were found to be jointly insignificant

with a p-value of 0.7285.

Insignificant variables were then tested one at a time for impact on score to test for individual

significance. GIR (greens in regulation) proved significant individually with a p-value of 1.11e-

06, as did permade (percentage of one-putts made). These two variables were added back into

the model and run with the results of:

Hence, this improved the model slightly.

Golf Performance: Key Attributes for a New Age

11

A test was then conducted for heteroskedasticity using the White Test. With a p-value of .5191

there was no reason to reject the null hypothesis of homoskedasticity or to report robust standard

errors.

Finally, an examination of the effect of regressing the natural log of score was performed and

showed virtually no change.

The variables Percmade and GIR added back into the model. This provided the optimal model,

which can be expressed as:

Score = 71.757 - .00166GIR - .07052Scbper - .03371Daccuracy + 9.066puttavg (SE) (5.640) (.01859) (.01122) (.00720) (2.1153)

- .01669Sandsv - .03561Distance - .657047Avgapprch + .02822percmade (.00457) (.00416) (.07429) (.03386)

Two surprises appeared in the final model. First, the positive relationship between percentage of

one-putts made and score. Intuitively, one would assume that if you sink more of your first putts

you would have a lower score. The explanation for this could be that a player who is making a

higher percentage of one-putts is taking more strokes to reach the hole and is thus, much closer

on his first putt since he was closer on most recent approach shot. The other surprise is the

relative strength of the variable Avgapprch. As discussed in the literature review, this measure is

a new statistic measuring approach shots from a statistical locational baseline versus the field.

Golf Performance: Key Attributes for a New Age

12

With the model developed, the prediction accuracy was then tested. The actual values fell within

the 5% confidence intervals which equates to approximately one stroke.

As a final check, the residuals were reviewed to ensure the normality requirement was

maintained.

Conclusions: The results of this analysis lead to several important conclusions. Most importantly, we have

answered the question proposed at the beginning of this document: What key skills determine

the success of a professional golfer? Those are the skills in the final model.

Golf Performance: Key Attributes for a New Age

13

1. Driving distance and driving accuracy have a similar effect on average score with

coefficients of -.034 and -.036 respectively.

2. Scrambling percentage (the percentage of time that a player misses the green in

regulation, but still makes par or better) has more than twice the impact on score than

either driving distance or driving accuracy. This metric has not shown this level of

strength in previous studies, and could be a reflection of how longer driving distance

leads to lower driving accuracy requiring higher skill in scrambling to make up for the

additional out-of-fairway shots

3. The expected “killer factor” that is often discussed on television of the golfer with laser-

focused putting skills on the last day of a tournament (one putt in round four) is not

significant on either average score or money earned.

4. The “strokes gained” measurement statistic from Brodie, is a strong tool for determining

golf success, and is a strong addition for future modelling. Like the scrambling measure,

the level of approach efficiency is very impactful on performance.

5. At the highest level of golf, marginal changes are difficult to achieve. The narrow standard

deviation levels reflect that. While it might be tempting to recommend to a golfer to

practice enough to take a half stroke off of his putting average, that can be akin to asking

an Olympic sprinter to “just” take a second from his 100-meter dash time.

6. The model is very accurate in capturing the amount of variation expressed in a golfer’s

average score by the independent variables (approximately 80%)

Golf Performance: Key Attributes for a New Age

14

7. The model is also very accurate in predicting scores with the information contained in the

independent variables-within approximately one stroke at the 5% confidence level.

With the information in this paper and earlier studies, the obvious question is “why don’t players

concentrate on approach shots and putting instead of trying to always hit the ball so hard?” It is

fair speculation to say that good putters don’t have fan appeal, groupies and large followings on

the course. People love to see power.

We discussed in the literature review section the Alexander and Kern (2005) paper which

proposed that the 21st century golfer was all about power, and that in 2001 the top ten drivers

earned 30% more than the top ten putters. The 2019 numbers show the opposite. In 2019, the

top ten putters earned 31% more ($2,298,662) than the top ten drivers ($1,600,195).

Based on this study, what do we tell the young golfer who thinks that a new driver will add ten

yards to his drive and be a game changer? Tell him to expect a drop in his average score of a

third of a stroke. What do we tell the pro golfer who finds a new putter that can take a half a

stroke from his putting average? Tell him to find a new tax attorney. He is about to become a

wealthy man.

Golf Performance: Key Attributes for a New Age

15

Further Study: This analysis is simply a starting point. Additional research could be done to determine the odds

of a player winning based on a logistical regression (bet/no bet) model. I would like to take this

model to a point of accuracy that would achieve a higher probability of predicting winners than

the casinos’ service charge, which requires a win rate of 52.4%. A logical next step would be to

take the PGA courses and determine the key course attributes (such as narrow fairways or long

holes) and match those to the strengths of a particular golfer. A classification tree could be used

to determine the sub groups who’s strengths match a particular course.

.

Golf Performance: Key Attributes for a New Age

16

Bibliography

1. Alexander, Donal and Kern, Willima. “Drive for Show and Putt for Dough?” Journal of

Sports Economics, Volume 6 (1) 15-Feb 1, 2005.

2. Brodie, Mark. N. “Every Shot Counts” Gotham Books 2014. Print

3. Callan, Scott J, and Janet M. Thomas. “Modelling the Determinants of a Professional

Golfer’s Tournament Earnings: A Multiequation approach” Journal of Sports Economics,

Volume 8 (4) August 1st, 2007 https://doi.org/10.1177/1527002506287697

4. Das, Sourav. “Top 10 Most Popular Sports in America 2019” sportsshow.net, November

25, 2019 https://sportsshow.net/most-popular-sports-in-america/

5. Finley, Peter S., and Jason J Halsey. “Determinants of PGA Tour Success: An Examination

of Relationships among Performance, Scoring and Earnings.” Perceptual and Motor Skills,

Volume 98, 1100-1106

6. Gray, Wil. “PGA of America “sceptical” on rolling back the ball”; TheGolfChannel.com,

March 5, 2018 https://www.golfchannel.com/news/pga-tour-pga-america-issue-statements-study

Golf Performance: Key Attributes for a New Age

17

7. Heiny, Erik. “PGA Tour Pro: Long, but Not so Straight.” Chance; Volume 21(1)-Feb 15,

2009

8. Leonard, Todd. “PGA Tour goes all in on Legalized Gambling.” San Diego Union

Tribune.com, November 30, 2019.

9. Torres, Aaron. “John Daly admits he once drank five beers in the middle of a PGA

round.” FoxSports.com November 15, 2016

10. Wilco, Daniel. “How driving distance has changed over the past 40 years on the PGA

Tour.” PGATOUR.COM, July 3, 2018