An Analysis of Carbon Emissions from College Football Recruiting Visits

Authors: Jeffrey J. Fountain1, Thomas Wuerzer2, & Peter S. Finley1

1Department of Management, Nova Southeastern University, Fort Lauderdale, FL, USA

2Department of Public Administration and Real Estate Development, Nova Southeastern University, Fort Lauderdale, FL, USA

 

Corresponding Author:

Jeffrey J. Fountain, Ph.D.

3301 College Avenue

Fort Lauderdale, FL 33314

[email protected]

954-262-8129

Jeffrey Fountain, Ph.D., and Peter Finley, Ph.D., are Professors of Sport Management at the H. Wayne Huizenga College of Business and Entrepreneurship at Nova Southeastern University. Their research interests focus on issues in college athletics.

Thomas Wuerzer, Ph.D., is Professor in the Department of Public Administration & Real Estate Development at Nova Southeastern University. His research focus is on Geographic Information Systems.

ABSTRACT 

Recruiting college football players to come play for a National Collegiate Athletic Association (NCAA) Power-5 school is highly competitive, with each school inviting recruits nationwide on official campus visits. By estimating the carbon emissions generated, this study examined the environmental impact of official recruiting visits (n = 7,045) to Power-5 schools from 2013 to 2020. Using Geographic Information Systems (GIS) to geocode recruits’ hometowns and calculate travel distances, a Recruit Visit Carbon Footprint (RVCF) was calculated to approximate the CO2eq emissions for each visit. The analysis focused on the 23 Power-5 schools with over 250 reported official visits. The findings revealed substantial variability in RVCF among schools, with 15 of the 23 schools increasing their carbon footprint in the latter years of the study. Still, the higher-spending athletic departments tended to have lower RVCFs. The findings provide valuable insights into the environmental impact of recruiting activities and highlight the importance of addressing this overlooked aspect of college sports’ carbon emissions.

KEYWORDS: Carbon Footprint, Power-5, Recruiting, Official Visit, College Football

INTRODUCTION 

As societal awareness of the environmental impact of both mega sporting events and routine contests (regular season games) has increased, many sports organizations, teams, and sponsors have come to understand the need to assess the carbon footprint they create (10). As noted by Dolf et al. (13), several researchers have stressed that sports events are worth investigating to leverage broader change because of the high-profile nature of such events, because they are capable of creating real and meaningful action (11, 19). Several athletic departments have promoted their initiatives throughout the last decade and publicized their efforts to reduce and offset their environmental impact by tracking and reducing carbon dioxide-equivalent emissions (CO2eq). The typical path toward claiming to be carbon neutral for college athletic departments is to assess the environmental impact of the day-to-day operations, home game operations, and off-campus travel for official tournaments and games. However, it is important to recognize that the carbon footprint begins long before sporting events are played; for college sports, this goes back to the initial recruitment phase of the athletes, which typically requires them to travel as part of the recruiting process.

In 2020, the Power-5 conferences included the Atlantic Coast Conference (ACC), Big 12 Conference, Big Ten Conference, Pacific 12 Conference, and the Southeastern Conference (SEC). Over the years, the number of Power-5 schools increasing their investment in recruiting athletes has grown, with 38 of the 52 public Power-5 schools reporting a significant growth in overall athletic department recruiting expenditures (37). One extreme example was the University of Georgia’s athletic department, which increased its overall recruiting budget from $308,000 in 2005 to $4.5 million by 2022 (23).

Recruiting

Each recruit is permitted one official visit per school, extendable only if there is a change in the coaching staff, with each visit lasting no more than 48 hours or one weekend (29). Visits are classified by the funding source; when the host school covers expenses such as transportation, lodging, meals, and entertainment for the recruit and their parents or guardians, it is deemed an official visit (29). Historically, recruits were limited to five official visits; however, this cap was removed in 2023, allowing unlimited visits while maintaining the “only one visit per school” rule (30).

College football recruiting visits often feature expensive, extravagant events designed to attract recruits (12, 24, 36). The financial commitment to a recruiting weekend at Clemson University in the fall of 2019, during which the Tigers brought eleven prospects to campus (they would eventually sign ten of them), ended with a total bill of $85,000 (32). While the NCAA prohibits media from attending recruiting events or interacting with prospects while on campus, the expenditures from that weekend provided insight into the itinerary, which included travel by professional car service to and from local airports, flights to Greenville-Spartanburg, and transportation to the campus, about 40 miles away. In addition, two charter buses were used to transport prospects and their families to the finest restaurants in the area, including a steakhouse about 45 minutes from campus (32). Another example was the University of Texas spending over $280,000 during a single weekend in June 2022 to host nine recruits, including highly touted quarterback Arch Manning (20).

Carbon Footprinting

The concept and measurement of an “ecological footprint” was introduced by Wackernagel and Rees (34) and originally quantified the land and sea area necessary to support human populations. Subsequent adaptations of this concept have focused on the “carbon footprint,” which estimates the land required to sequester CO2 emissions attributable to human activities. This notion has evolved into broader assessments such as the “life cycle impact,” which evaluates the environmental impact of products and services throughout their life cycles (31).

Research by Čuček et al. (9) and Pandey et al. (31) expanded the scope of assessment to include calculating sustainability metrics and measuring energy, water, and ecological impacts. These studies defined a carbon footprint as “the quantity of Greenhouse Gases (GHGs), expressed in terms of CO2 equivalents, emitted by an individual, organization, process, product, or event within a specified boundary” (31) and as “a quantitative measurement describing the appropriation of natural resources by humans,” (9). This study adopted these definitions to evaluate the carbon footprint of prospective college football players while making their official recruiting visits to college campuses.

Attempts to measure carbon footprint related to sports have historically focused on major events and the travel of sports teams. Examples include the findings that approximately 560 tons of CO2eq was created during the 2004 Football Association (FA) Cup Final in the United Kingdom (one soccer game) (4), 1,260 tons of CO2eq for the 2004 Wales Rally (an Autosport’s event over four days) (5), and 144,120 tons of CO2eq for the stages of the Tour de France held in the United Kingdom in 2007 (the Prologue and Stage One) (6). Most studies focused solely on the carbon footprint of spectators, though a limited number of studies examined participants, such as teams and staff members.

The environmental impact of all college activities, including collegiate sports has garnered significant attention (28). However, there appears to be no available research that has explicitly focused on the environmental impact (carbon footprint) produced throughout the college football recruiting season. Therefore, the researchers sought to explore and determine the approximate carbon emissions produced during official college football recruiting visits from Power-5 schools. This study utilized the reported official recruiting visits between 2013 and 2020. Using Geographic Information Systems (GIS) to conduct spatial analysis of multimodal travel, including car and plane trips, the researchers were able to calculate the approximate travel distances and corresponding carbon footprint of each recruit.

The Recruit Visit Carbon Footprint (RVCF) was created as a proxy measure utilizing prior carbon footprinting research of sport tourism. This approach enabled a systematic exploration of three primary research questions.

RQ1: Which Power-5 schools generated the largest RVCF between 2013 and 2020?

RQ2: Did RVCF totals increase or decrease over time?

RQ3: Was there a correlation between each school’s financial, recruiting, and performance variables and their RVCF?

METHODS 

Data Collection

Data on official recruiting visits, published by 247sports.com, was collected for the years 2013 to 2020. Previous research has utilized data from 247sports.com, recognizing it as a well-established source of college football recruiting information (21, 27, 35). The dataset included dates of official school visits and recruits’ hometowns. Prior research also utilized GIS to geocode locations such as athletes’ hometowns or high school locations for analysis (1, 26, 38). GIS geocoding takes a specific location, such as addresses or towns, and references it as a mapped location. Therefore, this study geocoded each football recruit’s hometown, the location of each college visited, and the closest major airport to calculate the approximate travel distances for spatial analysis.

The study utilized ESRI ArcPRO 3.5 (Esri, Redlands, CA, USA) software with the Business Analyst extension to geocode the dataset. To focus on the highest-producing RVCF programs and to make the data set more manageable, a minimum threshold of 250 visits was established. Of the 64 Power-5 schools, 23 (35.9%) met the 250-visitor threshold, totalling 7,045 reported official visits. The travel routes for each visit were then calculated using GIS to determine the most efficient mode of travel. Driving directly to the school was the most efficient mode for 1,636 visits. Typically, these distances were around 200 miles or less to the campus. For recruits living over 200 miles from the visiting campus, if their distance from their home to an airport necessitated a long drive followed by a flight, driving was deemed more efficient due to the extensive travel time involved in flying to the campus. For the remaining 5,409 visits, air travel was deemed the most efficient mode. For these visits, three travel distances were calculated: 1) the drive from the recruits’ hometown to the nearest major airport, 2) the flight miles from that airport to the nearest major airport to the campus they visited, and 3) the drive from that airport to the campus. These distances were doubled to account for the return trip and integrated into a travel matrix to approximate CO2eq emissions from transportation.

Additionally, financial data for athletic departments (i.e., Football Revenue, Football Recruiting was sourced from the Knight-Newhouse College Athletics database (25), team performance was sourced from ESPN.com (16). The descriptions and summary statistics for these variables are provided in Table 1. Utilizing these variables allowed for additional analysis to explore potential correlations between an athletic department’s RVCF and financial data, performance data, and recruiting data.

Table 1 Descriptive Analysis of Variables: Mean and Standard Deviation
VariableDescriptionMeanSD
FB_TotalRevTotal Revenue from Football$66,518,526$25,205,244
Mens_TotalRevTotal Revenue from all Men’s Sports (including Football)$84,428,967$25,300,581
FB_MensRev%Football’s Revenue as a Percentage of all Men’s Sports Revenues77.40%11.17%
Dept_TotalRevTotal Revenue from the entire Athletic Department$125,143,966$31,108,327
FB_DeptRev%Football’s Revenue as a Percentage of the entire Athletic Department Revenues52.50%13.10%
Mens_RecruitExpTotal Recruiting Expenses from all Men’s Sports (including Football)$1,391,362$704,861
Dept_RecruitExpTotal Recruiting Expenses from the entire Athletic Department$1,878,962$855,080
FB_OpsExpTotal Operation Expenses for Football$5,683,499$2,558,649
Mens_OpsExpTotal Operation Expenses for all Men’s Sports (including Football)$8,800,193$4,035,500
Dept_OpsExpTotal Operating Expenses for the entire Athletic Department$12,787,529$5,068,156
FB_TotalExpTotal Expenses for the entire Football Program$33,846,192$11,218,516
Mens_TotalExpTotal Expenses for all Men’s Sports Programs (including Football)$53,035,310$13,927,935
FB_MensExp%Football Expenses as a Percentage of all Men’s Sports Expenses63.18%7.58%
Dept_TotalExpTotal Expenses for the entire Athletic Department$116,141,712$27,071,219
FB_DeptExp%Football Expenses as a Percentage of the entire Athletic Department Expenses63.18%7.58%
Win_PercentageFootball teams Win Percentage62.43%19.97%
    

Recruit Visit Carbon Footprint

Calculating CO2eq emissions from travel can vary depending on the methods and formulas used. In this study, the researchers approximated the RVCF utilizing established methods from prior sport tourism carbon footprint research. The framework by Franchetti and Apul (18) required three boundaries. 1) Temporal Boundary, which refers to the period used for analysis, which, in this study, included Power-5 official recruiting visits from 2013 to 2020. 2) Organizational Boundary, which defines the measured entity, ensuring that only emissions produced from the designated entity are included. Here, it refers to the travel for a single recruit’s official visit to a Power-5 school. 3) Operational Boundary, which is based on the scope of emissions, including direct emissions, indirect emissions, and indirect products. The operational boundary was set at direct emissions only for this study.

In order to operationalize the boundaries, calculations were used to approximate each recruit’s carbon footprint as they travelled from their hometown to their selected school for an official recruiting visit. Cooper’s (2020) approximation of the University of Tennessee’s football gameday tourism carbon footprint was used as a framework for this study. The method for approximating the carbon footprint of sport tourism was applied to the dataset to calculate the approximated total amount of CO2eq emissions produced by each recruiting visit. The total carbon footprint of each visit was calculated by considering direct emissions from transportation (car and flight miles), food consumption per day, waste per day, and hotel stays (8, 14). The EPA formula for the average gasoline-powered passenger vehicle (3.91 × 10^-4 metric tons CO2eq per mile) was applied and converted into kilograms (15). For air travel emissions, the formula (air miles × 0.24 × 1.891) combined the Blue Sky Model formula and the Carbon Fund’s radiative forcing factor (1.891) to provide a total CO2eq per person per pound figure, which was then converted to kilograms (2, 3). Hotel accommodation emissions were calculated using Filimonau’s (17) factor of 11.65 kg CO2eq per night, multiplied by two to account for the typical two-night stay during a recruiting visit. For food and waste emissions, factors from Cooper’s (7) study were used: 7.4 kg CO2eq per person per day for food and 1.1 kg CO2eq per day for waste, multiplied by two for the typical 48-hour visit. Utilizing these formulas allowed the researchers to approximate the RVCF for each reported recruiting visit.

RESULTS AND DISCUSSION

Over the eight years, the top 23 highly-visited schools collectively emitted 2.3 million kg of CO2eq, averaging 328 kg CO2eq per recruiting visit. For context, the global average annual CO2eq emission per person is approximately 4.7 tons (4,263 kg), according to the IEA (22). Thus, the CO2eq for a single 48-hour recruiting visit represents about 7.7% of the average person’s global annual CO2eq footprint.

Table 2 provides a breakdown of RVCF variables along with the means and totals for all 23 schools to address RQ1, “Which Power-5 schools generated the largest RVCF between 2013 and 2020?” Washington State (n = 276) reported the highest total RVCF at 171,489.84 kg CO2eq, and the highest mean RVCF at 621.34 kg CO2eq. In contrast, the University of South Carolina (n = 263) had the smallest carbon footprint, with a total RVCF of 55,621.71 kg CO2eq and an average RVCF per visit of 211.49 kg CO2eq. All official visits to Washington State and South Carolina are depicted using GIS maps in Figure 1, which shows Washington State attracted several recruits from the Midwest, Florida, and Texas. At the same time, South Carolina only invited a few recruits who required a long-distance flight to visit Columbia, South Carolina.

Table 2   RVCF by school for all reported official visits from 2013 to 2020  
Schooln% Drove (No Flight)Car (No Flight)Car  (To/From Airport)FlightHotelFoodWasteMSDTotal
Washington State2762.17%564.4821,782.22137,796.796,405.954,084.79855.60621.34362.98171,489.84
Oregon2813.20%667.817,322.00150,332.796,522.004,158.79871.10604.54323.99169,874.50
Nebraska3735.90%1,437.039,203.29131,283.868,657.325,520.391,156.30421.60171.96157,258.20
Alabama37817.99%6,138.0121,504.7481,842.828,773.375,594.391,171.80330.34219.30125,025.12
Minnesota32815.55%1,543.639,279.4494,019.657,612.874,854.391,016.80360.75190.85118,326.78
Louisville3437.87%1,814.239,356.0792,166.147,961.025,076.391,063.30342.38189.87117,437.16
Oklahoma31521.27%7,627.739,318.5385,483.067,311.144,661.99976.50364.79187.59115,378.96
Tennessee35614.89%5,866.3411,129.2172,691.178,262.755,268.791,103.60293.04213.71104,321.87
Texas A&M32746.18%15,370.2818,162.7656,491.357,589.664,839.591,013.70314.50223.19103,467.35
Washington25122.31%2,329.335,626.1481,958.425,825.713,714.80778.10399.33250.68100,232.49
Ohio State30126.58%7,676.345,596.1469,715.536,986.204,454.79933.10316.82223.4995,362.10
Arkansas32515.38%4,563.0713,222.0664,152.597,543.244,809.991,007.50293.23166.9795,298.46
Indiana27315.38%3,307.9013,293.6462,085.006,336.324,040.39846.30329.34169.1389,909.57
Florida33328.53%8,658.696,782.7656,932.067,728.924,928.391,032.30258.45179.7686,063.13
Miami30139.53%3,920.904,735.8663,566.686,986.204,454.79933.10280.01263.3084,597.54
Florida State31714.20%3,929.026,600.1159,945.347,357.564,691.59982.70262.44182.3083,506.32
Auburn31335.14%9,914.5514,971.1242,452.127,264.724,632.39970.30256.25161.7180,205.21
Georgia27033.70%6,709.3215,279.3141,800.056,266.693,995.99837.00276.78196.7074,888.37
Penn State25429.53%9,210.7316,663.5938,471.375,895.343,759.20787.40289.49192.6574,787.61
Mississippi State29356.31%17,224.2017,413.1326,745.766,800.524,336.39908.30250.61176.9073,428.32
Kentucky27419.71%4,331.997,066.4249,895.236,359.534,055.19849.40264.81146.5772,557.76
LSU30038.00%7,465.215,491.1341,783.316,962.994,439.99930.00222.53146.7367,072.63
South Carolina26332.70%7,526.485,361.7631,921.566,104.233,892.40815.30211.49128.8055,621.71
Total7,04523.57%137,797.27255,161.411,633,532.66163,514.32104,265.8721,839.48328.91203.012,316,111.00
Note: Car, Flight, Hotel, Food, Waste, Mean, Standard Deviation, and Total are in kg CO2eq

To explore the second research question, “Did RVCF totals increase or decrease over time?” the dataset needed to be segmented. During this time period college football programs did not get an entirely new roster of players each year; consequently, examining each year’s change would yield varying results based on how many recruits the school needed that year. Rosters typically turn over every 4 to 5 years. Therefore, with eight years of data available, the dataset was subdivided into two four-year periods (2013-2016 and 2017-2020) to better examine changes over a longer period of time.

Table 3 shows the schools with the largest changes in their mean RVCFs. Fifteen schools experienced an increase in mean RVCF between the two time periods. Ohio State had the largest increase in mean difference (MD = 74.77 kg CO2eq), with its mean RVCF rising from 280.80 kg CO2eq in 2013-2016 to 355.57 kg CO2eq in 2017-2020. Oregon saw the largest overall increase in total RVCF, increasing 29,617.65 kg CO2eq during the latter period. Figure 2 utilizes GIS maps to depict all recruiting visits to Ohio State for each period, highlighting an expanded recruiting range that targeted more players from Texas and the Western United States. Conversely, eight schools showed a reduction in mean RVCF between the two time periods, with the University of Miami experiencing the largest decrease in mean difference (MD = -61.96 kg CO2eq). Although Washington State’s mean reduction was not as considerable as the bottom three schools, it had the largest total reduction in RVCF, decreasing by 19,562.28 kg CO2eq between the two periods.

Table 3 Largest Mean Difference in RVCF between the two time periods
 2013-2016 2017-2020 
SchoolsnTotalM nTotalM DifferenceMD
Ohio State15643,804.86280.80 14551,557.24355.57 7,752.3974.77
Penn State10225,847.68253.41 15247,683.11313.70 21,835.4460.30
Oregon12270,128.43574.82 15999,746.07627.33 29,617.6552.51
Florida St.16546,656.85282.77 15236,535.52240.37 -10,121.32-42.40
Arkansas15349,311.35322.30 17245,987.11267.37 -3,324.24-54.93
Miami15146,944.88310.89 15037,339.37248.93 -9,605.51-61.96
Note: Totals and Means are in kg CO2eq

Wuerzer et al. (38) identified county-level geographical hotspots in the United States overproducing elite college football talent, necessitating migration to other states to find available roster spots on Power-5 football teams. Consequently, Power-5 schools in regions with minimal elite talent and far from these hotspots must expand their recruiting efforts, increasing their RVCF. Schools that rely heavily on air travel for recruiting will naturally have a larger carbon footprint, as air travel is the primary contributor to total RVCF. This is evident from the top three schools with the highest total RVCF also have the lowest percentages of recruits visiting within driving distance to their campuses (Washington State (2.17%), Oregon (3.20%), and Nebraska (5.90%)). Despite this, schools still make strategic choices in their recruiting practices. For example, as shown in Figure 1, Washington State invited several recruits from Florida, a state with prominent county-level recruiting hotspots, instead of focusing on nearby regions or closer recruiting hotspots in California and Texas.

A Pearson correlation coefficient analysis was conducted to address research question three: “Were there any correlations between schools’ financial, recruiting, and performance variables and their RVCF?” The analysis identified two significant correlations, both negative: Total RVCF and Athletic Department Total Annual Revenue [r(176) = -.202, p = .007] and Athletic Department Total Annual Expenses [r(176) = -.198, p = .008]. These findings suggest that athletic departments with higher revenues and expenses tend to have lower RVCFs. This could be attributed to the fact that Power-5 programs with substantial financial resources often have well-established and highly regarded football programs, enabling them to attract top recruits from within a closer geographical range. Consequently, these programs would be less dependent on long-distance recruiting, which typically requires greater air travel, the primary contributor to a school’s RVCF, thereby lowering their overall RVCF.

Overall, these findings highlight the multifaceted nature of college football recruiting, shaped by a complex interplay of positional needs, recruits’ availability, and recruits’ geographical location. The competitive nature of Power-5 college football recruiting requires substantial time and effort to build top-tier recruiting classes, prompting many schools to expand their recruiting reach over time, which subsequently increases their RVCF. The findings show that 15 of the 23 schools increased their RVCF over the two periods. Given the fierce competition for elite talent, it is unlikely that any football program would willingly reduce its recruiting-related carbon emissions if it jeopardizes on-field performance. This creates a significant challenge for universities wanting to adopt more sustainable operations.

CONCLUSION 

This study provides a substantial initial assessment of the carbon footprint associated with college football recruiting. By utilizing GIS for recruits’ hometowns, college locations, and nearest major airports to calculate travel distances, the researchers provided an approximation of each school’s RVCF Recruiting Visit Carbon Footprint (RVCF). The findings reveal substantial variability in RVCF among schools, highlighting the different levels of environmental impact of recruiting. The study also found that higher-spending athletic departments tended to have lower RVCFs, suggesting that successful programs may not need to extend their recruiting reach as widely.

However, several limitations must be acknowledged. The data for this study came from a third-party recruiting website, thus allowing for only an approximate carbon footprint for each official visit. Also, various models and formulas can be used to estimate CO2eq emissions from travel, but each carries assumptions and biases. Moreover, policy changes during the study period, such as the NCAA’s 2016 rule change allowing schools to cover travel costs for up to two parents or guardians accompanying a recruit, could result in a higher actual carbon footprint than the reported RVCF from this study (33). More detailed research is essential for a more accurate and comprehensive understanding of the carbon emissions associated with college football recruiting. Unfortunately, without a governing body mandating standardized reporting of recruiting carbon emissions using consistent formulas, it will remain difficult to fully assess and compare the carbon emissions of different athletic departments.

APPLICATIONS IN SPORT

For universities aiming to reduce their athletic department’s carbon footprint, including all recruiting activities in their calculations is crucial. A comprehensive approach would enable the development of effective strategies that promote sustainability without sacrificing athletic success. Athletic departments can better incorporate sustainability into their planning and decision-making processes by understanding the true carbon footprint generated by each sport, school, and conference.

ACKNOWLEDGMENTS
This research was supported by a college-level seed grant focused on sustainability issues from the Huizenga College of Business and Entrepreneurship’s Societal Impact Seed Grant program.

REFERENCES 

1. Andris, C., Measuring geographic pull power: A case study of US college athletic teams. The Professional Geographer, 2018. 70(3): p. 476-490.

2. BlueSkyModel.org. 1 Air Mile. 2024; Available from: https://blueskymodel.org/air-mile.

3. CarbonFund.org. Calculation Methods. 2024; Available from: https://carbonfund.org/calculation-methods/.

4. Collins, A., et al., Assessing the environmental consequences of major sporting events: The 2003/04 FA Cup Final. Urban studies, 2007. 44(3): p. 457-476.

5. Collins, A., C. Jones, and M. Munday, Assessing the environmental impacts of mega sporting events: Two options? Tourism management, 2009. 30(6): p. 828-837.

6. Collins, A., M. Munday, and A. Roberts, Environmental consequences of tourism consumption at major events: An analysis of the UK stages of the 2007 Tour de France. Journal of Travel Research, 2012. 51(5): p. 577-590.

7. Cooper, J.A., Making orange green? A critical carbon footprinting of Tennessee football gameday tourism. Journal of Sport & Tourism, 2020. 24(1): p. 31-51.

8. Cooper, J.A. and D.H. Alderman, Cancelling March Madness exposes opportunities for a more sustainable sports tourism economy. Tourism Geographies, 2020. 22(3): p. 525-535.

9. Čuček, L., J.J. Klemeš, and Z. Kravanja, A review of footprint analysis tools for monitoring impacts on sustainability. Journal of Cleaner Production, 2012. 34: p. 9-20.

10. Daddi, T., et al., Environmental management of sport events: a focus on European professional football. Sport, Business and Management: An International Journal, 2022. 12(2): p. 208-232.

11. Death, C., ‘Greening’the 2010 FIFA World Cup: Environmental sustainability and the mega-event in South Africa. Journal of Environmental Policy & Planning, 2011. 13(2): p. 99-117.

12. Dimengo, N., Crazy Sports Recruiting Stories. Bleacher Report, 2014.

13. Dolf, M. and P. Teehan, Reducing the carbon footprint of spectator and team travel at the University of British Columbia’s varsity sports events. Sport Management Review, 2015. 18(2): p. 244-255.

14. El Hanandeh, A., Quantifying the carbon footprint of religious tourism: the case of Hajj. Journal of cleaner Production, 2013. 52: p. 53-60.

15. EPA. Greenhouse Gases Equivalencies Calculator – Calculations and References. US EPA 2024; Available from: https://www.epa.gov/energy/greenhouse-gases-equivalencies-calculator-calculations-and-references.

16. ESPN.com. College Football Stats. 2024; Available from: https://www.espn.com/college-football/stats/_/view/team.

17. Filimonau, V., et al., Reviewing the carbon footprint analysis of hotels: Life Cycle Energy Analysis (LCEA) as a holistic method for carbon impact appraisal of tourist accommodation. Journal of cleaner production, 2011. 19(17-18): p. 1917-1930.

18. Franchetti, M.J. and D. Apul, Carbon footprint analysis: concepts, methods, implementation, and case studies. 2012: CRC press.

19. Getz, D., Policy for sustainable and responsible festivals and events: Institutionalization of a new paradigm. Journal of Policy Research in Tourism, Leisure and Events, 2009. 1(1): p. 61-78.

20. Glasspiegel, R., Texas spent $280,000 on Arch Manning recruitment weekend. New York Post, 2022.

21. Huml, M.R., et al., If we build it, will they come? The effect of new athletic facilities on recruiting rankings for power five football and men’s basketball programs. Journal of Marketing for Higher Education, 2019. 29(1): p. 1-18.

22. IEA. The world’s top 1% of emitters produce over 1000 times more CO2 than the bottom 1% – Analysis – IEA. 2024 2024/08/31/; Available from: https://www.iea.org/commentaries/the-world-s-top-1-of-emitters-produce-over-1000-times-more-co2-than-the-bottom-1.

23. Jeyarajah, S., College football recruiting expenses by conference: SEC stands out as Georgia spends big for success in 2022, in CBSSports. 2023: CBSSports.com.

24. Jude, A., Ice carvings and lavish dinners: A look at the cost of football recruiting at Washington. Spokesman, 2020.

25. Knight-Newhouse. Knight-Newhouse College Athletics Database 2024; Available from: https://knightnewhousedata.org/.

26. Macaulay, C., J. Cooper, and S. Dougherty, High school football and the athletic-market economy: Recruiting, producing, and manufacturing talent. Sociology of Sport Journal, 2019. 36(3): p. 203-212.

27. Mankin, J.A., J. Rivas, and J.J. Jewell, The effectiveness of college football recruiting ratings in predicting team success: A longitudinal study. Mankin, J., Rivas, J. & Jewell, J.(2021) The Effectiveness of College Football Recruiting Rankings in Predicting Team Success: A Longitudinal Study. Research in Business and Economics Journal, 2021. 14: p. 4-22.

28. Natural Resources Defense Council, Collegiate game changers: How campus sport is going green. 2013, Natural Resources Defense Council.

29. NCAA. Recruiting – Eligibility Center. 2024; Available from: https://www.ncaa.org/sports/2014/10/8/recruiting.aspx.

30. NCSA. Official Visits | How Does an Official Visit Work? 2024; Available from: https://www.ncsasports.org/ncaa-eligibility-center/recruiting-rules/official-visits.

31. Pandey, D., M. Agrawal, and J.S. Pandey, Carbon footprint: current methods of estimation. Environmental monitoring and assessment, 2011. 178(1): p. 135-160.

32. Raynor, G., Behind the scenes of Clemson’s biggest recruiting weekend. New York Times, 2020.

33. VanHaaren, T., NCAA allows schools to pay for two guardians to accompany recruits on official visits, in ESPN. 2016.

34. Wackernagel, M. and W. Rees, Ecological footprint: Reducing human impact on the earth. New Society, Gabriola Island, BC, Canada, 1996.

35. Watkins, J. and K. Slater, Talent level and major distribution in “Power Five” conference football programs. Journal for the Study of Sports and Athletes in Education, 2021. 15(2): p. 150-170.

36. Wiltfong, S., New LSU 2024 QB commit Colin Hurley takes you inside an epic weekend. 247Sports, 2022.

37. Wittry, A., An Analysis Of College Football Recruiting Costs, in Athletic Director U. 2022: https://athleticdirectoru.com.

38. Wuerzer, T., J.J. Fountain, and P.S. Finley, An Analysis of the Geographic Origins and Migration Patterns of Elite College Football Players. Geographical Review, 2023. 114(2): p. 157-179.

2025-10-13T15:18:13-05:00May 27th, 2026|Contemporary Sports Issues, Research, Sports Studies, Sports Studies and Sports Psychology|Comments Off on An Analysis of Carbon Emissions from College Football Recruiting Visits

Total Goalkeeper Performance (TGP): A Comprehensive Metric for Evaluating Modern Soccer Goalkeepers

Authors: Daniel J. Marcolongo1, Bret R. Myers2

1Graduate of Sports Industry Management Program, Georgetown University, Washington DC, USA

2Department of Management and Operations, Villanova University, Villanova, PA, USA

 

Corresponding Author:

Bret R, Myers, Ph.D.

Department of Management and Operations

Villanova School of Business

800 E Lancaster Avenue

Villanova, PA 19085

[email protected]

Daniel J. Marcolongo is a 2025 graduate of Georgetown University’s Sports Industry Management masters program. His focus is in soccer analytics, which he developed as a collegiate soccer player and lifelong student of the game.

Bret R. Myers, Ph.D. is a Professor of Practice in the Department of Management and Operations in the Villanova School of Business. His research interests focus on sports analytics, specifically, in the areas of team evaluation and managerial decision-making. He also is an active Analytics Consultant with 10+ years of experience working with professional teams and other sports organizations.

ABSTRACT 

The purpose of this study was to develop and validate a comprehensive metric for evaluating modern soccer goalkeepers that accounts for both defensive and offensive responsibilities. Total Goalkeeper Performance (TGP) was constructed using publicly available data from the English Premier League, incorporating shot-stopping, cross-stopping, sweeping, and distribution metrics. Analysis of 70 observations of goalkeeper performance revealed a moderate positive correlation between TGP and team success (r = 0.474, p < 0.001), with TGP explaining 22.5% of variance in expected team points per game (3 points for win/1 point for draw/0 points for loss). A one-unit increase in TGP corresponded to 1.75-4.64 additional expected points over a 38-match season. Year-over-year analysis showed moderate consistency in goalkeeper performance as measured by TGP. These findings suggest TGP effectively captures goalkeeper contribution to team success while accounting for the evolving multidimensional nature of the position. TGP provides a data-driven framework for recruitment, talent identification, and tactical planning that aligns with the demands of modern soccer.

Key Words: soccer analytics, goalkeeper metrics, player evaluation, player development

INTRODUCTION 

The goalkeeper is a unique position in sports. The player is often isolated from the rest of the team as the last line of defense. They even have different equipment from the rest of the team. In hockey, soccer, and more, one can immediately distinguish a goalkeeper from their teammates (3, 5). It leads to a feeling that the goalkeeper is isolated from the rest of the team. But no one is an island. There have been times when goalkeepers have done more than just protect their goal, which has been seen prominently in soccer (13).

This has been supported by several studies into the position. A study using machine learning algorithms showed that the difference between what they called elite and sub-elite goalkeepers was their ability with their feet (11). This suggests that the position has evolved so much that shot stopping is not even a goalkeeper’s main priority at the world’s best clubs. This was far from the only study to suggest that goalkeepers have seen an increase in their responsibilities. Goalkeepers are asked to do a lot more than just save shots in today’s game (13, 19). Soccer is not the only sport where this has occurred.

From roughly the mid-1990s to the mid-2000s, certain hockey goaltenders had a similar task. The most notable was Martin Brodeur. The New Jersey Devils, Brodeur’s team, employed a strategy called a neutral zone trap. The trap utilized a goalkeeper’s ability outside of the net to limit their opponent’s scoring chances. The Devils won three Stanley Cup titles from 1994 to 2003 with this strategy before the NHL introduced a new rule which severely limited what a goalkeeper could do outside of making saves (5).

In soccer, a goalkeeper is the only player on the team who can touch the ball with their hands and the player can only do this inside the 18-yard box (2). Historically, this led to goalkeepers not playing with their feet at all. But as the game evolved, this began to change, helped along by a rule change after the 1990 World Cup. The 1990 World Cup is considered one of the worst World Cups of all time due to the boring play. Part of the reason for the dullness was the goalkeepers who would waste time by holding the ball as long as legally allowed (14).

To combat this, the back-pass rule was introduced. This established the rule that goalkeepers could not pick up the ball if a player on their team passed it to them (14). With the change introduced to the game, it made goalkeepers’ ability to play with their feet more important (1). Following the success of goalkeepers like Manuel Neuer and Ederson, a goalkeeper’s distribution has become an essential skill (16-17). It is to the point that in some development teams, such as Chelsea, goalkeepers are judged more for their passing than their saves (4). Goalkeepers need to do so much more than just stop shots. However, that idea still hasn’t taken hold.

There is no way to rank soccer goalkeepers in a way that accounts for what they do with the ball. There isn’t even a statistic to accurately rank a goalkeeper defensively. Some statistics individually look at saves, cross-stopping, and sweeping, but no statistics takes all those aspects into account (6). In fact, some awards recognize players for a single performance statistic. For example, the Premier League Golden Glove award is given to the goalkeeper who has had the most ‘clean sheets’ (i.e., a game where they did not allow a goal) (10). The way goalkeepers are ranked has not kept up with the times. There should be a new statistic that accurately rates a goalkeeper based on everything they have to do, their Total Goalkeeper Performance.

The purpose of this study is to develop and advance a comprehensive metric for evaluating modern soccer goalkeepers that captures both their defensive and offensive responsibilities. First, the study outlines the methodology, including the acquisition of player performance data from the English Premier League for men’s professional soccer. Second, the offensive and defensive statistics used to construct the Total Goalkeeper Performance (TGP) metric are defined, and the procedures for calculating these statistics are detailed. Third, the data analysis plan—featuring correlation analysis and regression modeling—is described. The results are then presented and interpreted, followed by the study’s conclusions. Finally, the practical applications of this metric within sports analytics, particularly in organized soccer, are discussed.

METHODS 

Dataset and Sampling

The data used in the paper spans eight seasons from the Premier League (2017-2018 through 2024-2025), a period where modern goalkeeper statistics are publicly available. All of the data comes from FBRef.com, with the exception of the data on punches which came from the Premier League’s website. In all, 70 goalkeepers were examined.

TGP features several different parts of a goalkeeper’s responsibilities. These can be divided into defensive and offensive statistics. Based on the data available the following performance statistics are used to construct TGP.

Defensive Statistics

Defensively, the main responsibility a goalkeeper has is shot stopping, making saves to prevent goals, but that’s not their only task. Goalkeepers also must defend the goal when balls come into their area. This can be from either crosses that a goalkeeper must deal with inside the 18-yard box or passes that force a goalkeeper to leave the box (18). These skills will be called cross-stopping and sweeping.

  1. Shot Stopping

Shot stopping will be measured with expected goals (xG). xG tracks how likely a goal is to be scored from the moment it is struck on a scale of 0-1 with a better shot being ranked closer to 1 (8). To make this stat useful for a goalkeeper, one must track how much xG a goalkeeper faces and then subtract the total number of goals allowed to get the post-shot (PS) expected goals minus goals allowed (GA). In order to standardize this for all goalkeepers, the statistic will be converted into a per 90 minutes through using the minutes played by each goalkeeper (PSxG-GA/90).

2. Cross Stopping

Cross Stopping is another skill needed to be quantified. Goalkeepers typically face several crosses being put into their box during a game. The best way to stop a cross is to claim it, catching the cross before the opposition can get to it. Punching a cross away can also be beneficial but is not preferable to catching as the ball could go back to the opposition but it is still preferable to leaving the cross to the opposition (9). When making cross-stopping into a statistic, one must factor in both crosses claimed (CC), crosses punched (CP), and total crosses faced (TC), but claiming and punching crosses are not equal.. Because of that, TGP weighs a punch as half of a claim when measuring cross-stopping. The final stat to measure cross stopping for TGP is: Cross Stopping = (CC+.5xCP)/TC.

3. Sweeping

Sweeping is the easiest of the three defensive skills to quantify. The statistic – defensive actions outside of the penalty area – measures sweeping well. This tracks how often a goalkeeper comes outside of his goal to help his team (6). The more often a goalkeeper does this, the better they are at sweeping. To fairly measure these statistics in comparison, it must be looked at on a per-90-minute basis as well. TGP will use defensive actions outside the penalty area per 90 (DAOP/90) minutes to measure sweeping.

From an interview with US International goalkeeper Tyler Miller, shot-stopping is the most important, followed by cross-stopping, then sweeping (12). TGP will weigh the skills 3:2:1 in that order. The stats will then be added together to make a defensive score.

Offensive Statistics

Days 1-4 focused on primary lift progression (Front Squat, Bench Press, Deadlift, Overhead Press) with integrated plyometric, conditioning, and movement quality components. Day 5 emphasized pulling strength and unilateral work. Day 6 focused on coordination, explosive power, and metabolic conditioning. All days included Tabata rowing (20 seconds work/10 seconds rest × 8 rounds) for conditioning stimulus and mental toughness development.

  1. Pass Completion Percentage (in buildup)

Offensive skills are more difficult to track due to the limitations of data on goalkeepers’ offensive abilities. One skill to track is a goalkeeper’s ability in buildup, making short passes to help his team up the field. In addition, a goalkeeper’s ability to make decisive passes that can start an attack on his team is important as well. The best widely available statistic to track a goalkeeper’s ability in buildup is completion percentage (PC) , how accurate they are as a passer.

2. Long Pass Completion Percentage

Similarly, long pass completion percentage (LP) shows how effective a goalkeeper is with long passes that are more likely to lead to an attack (6). Both statistics, completion percentage and long pass completion percentage, will be weighed equally to make an offensive score.

Component Weighting and Possession-Based Adjustments

The TGP metric is a weighted average of offensive and defensive scores.  However, weights are conditionally applied based on possession characteristics of the team. Teams with more possession tend to take more advantage of the offensive skills of the goalkeeper through more time with the ball. Meanwhile, teams with less of the ball have much less of a use for an offensively minded goalkeeper. (18). Because of that, possession, the statistic for how much of the ball a team has per game, is a way to weigh how important a goalkeeper’s offensive skills are for a team (6).

To ensure that the statistic is applicable across different seasons, a player’s score in different statistics will be weighed against the league’s average score. This includes the league average scores on shot-stopping (μPSxG – GA/90), cross stopping (μ(CC + .5CP)/TC)), sweeping (μDAOP/90), pass completion percentage(μPC), and long pass completion percentage (μLP).

For a team with 62.5 percent possession (P) or more, a number chosen for ease of calculations though it is a number only the most ball dominant teams can reach, the offensive and defensive scores will be weighted equally. For a team with 37.5 percent possession or below, a number chosen for the same reasons but for the least ball dominant teams, it will be weighted 3:1 defensive score to the offensive score (7). For example a player on a team with 62.5% possession or more his defensive and offensive scores would remain the same for calculations. In a team with 37.5% possession or below his defensive score would be multiplied by 1.5 and his offensive score multiplied by 0.5. For a team with between 37.5 and 62.5 percent possession the weight would slide between those ratios. For example, in a team with .531 percent possession a goalkeeper would have his defensive score multiplied by 1.188 and his offensive score multiplied by .812.

The overall TGP formula for a goalkeeper per match can be expressed as follows:

 TGP = (DS*(2 – (2P – 0.25))) + (OS*(2P – 0.25))) / 2

where:
 
DS = 1.47*(((PSxG – GA/90 + 0.52)/(μPSxG – GA/90 + 0.52)*5.09) + 0.97*(((CC + .5CP)/TC)/μ(CC + .5CP)/TC)*5.15) + 0.62*((DAOP/90)/μ(DAOP/90)*4.02) OS = (0.90*((PC*/μPC*)/10.12) + 1.01*((LP*/μLP*)/9.92)/(4/3)

*μ represented the mean levels of the performance metric by league.

Here is a sample calculation for a high performing goalkeeper with the following measures:

Nick Pope 2023-24: TGP=(19.37*1.206 + 16.05*0.794) / 2 = 18.03

DS=1.47*(.58/0.44)*5.09 + (0.97*0.98)/0.83)*5.15) + 0.62*(1.87/1.29)*4.02=19.37

OS=(0.99(76.9/72.9)*10.12+1.01*(47.9/44.27)*9.92)/(4/3)=16.05

Here is a sample calculation for a low performing goalkeeper with the following measures:

James Trafford 2023-24: TGP (14*1.302 + 12.09*.698) / 2 = 13.36

DS=1.47*(0.31/0.44)*5.09 + 0.97*(0.95/0.83)*5.15) + 0.62*(1.54/1.29)*4.02=14.00

OS=(0.99(65.5/72.9)*10.12+1.01*(32/44.27)*9.92)/(4/3)=12.16

The formula is created with a score of 15 to be the league average for every season. The numbers each individual statistic is multiplied by is there to ensure that no stat is weighted more than any other.

Analyses and Visualizations

Three key areas in this study are explored: 1) Relationship between TGP and Team Success, 2) Individual TGP Rankings and Year-Over-Year Repeatability, 3) TGP vs. Player Market Value. In order to evaluate Team Success, Team EPL Points per Game (PPG) will be used (3 points for team 1, 1 point for team draw, 0 points for team loss). Python is used to carry out correlation and regression analyses exploring the relationship between TGP and Team Success based on n = 70 qualifying goalkeepers from the EPL. Specifically, the scipy library is used for correlation analysis and statsmodels library is used for regression analysis. Furthermore, data visualization is carried out using Python’s matplotlib library. Correlation analysis and data visualization (also from Python) are also used to explore year-over-year repeatability of TGP scores based on n = 10 qualifying goalkeepers, Similar methods are also used to help examine the relationship between TGP and Player Market value.

RESULTS

Relationship between TGP and Team Success

In order to assess the relationship between TGP and team success, a Pearson correlation analysis was performed comparing TGP to PPG for 70 observations across the 2022-2023, 2023-2024, and 2024-2025 English Premier league seasons. The data set is representative of 39 distinct goalkeepers that qualify by having played at least 10 matches.

The analysis revealed a moderative positive correlation between TGP and PPG (r = 0.474 and p < 0.001). This indicates that goalkeepers with higher TGP scores tend to play for teams that earn more points per match. Figure 1 displays the scatterplot with a fitted regression line which demonstrates the positive, linear trend between TGP and team performance. While correlation does not imply causation, the statistically significant relationship suggests that the multidimensional TGP metric captures aspects of goalkeeper performance that contributes directly to winning.

Figure 1

Note. Scatterplot depicting relationship Points per Game and TGP across 2022-2023, 2023-2024, and 2024-2025 seasons in the English Premier League.

Furthermore, a simple linear regression was performed to help understand the magnitude of the contribution to team success. The analysis was performed using the statsmodels library in Python and the results are included in Figure 2.

Figure 2

Note. Ordinary Least Squared Regression Results for TGP vs. Team Performance

The model was statistically significant, F(1,68)=19.74, p<0.001, and explained 22.5% of the variance in PPG (R² = 0.225). The resulting regression equation was:

PPG=0.143+0.084×TGP

The TGP coefficient was positive and significant (β=0.084, t=4.44, p1<0.001), with a 95% confidence interval ranging from 0.046 to 0.122. To put it more in practical terms, every 1 unit increase in TGP is expected to increase points per game from 0.046 to 0.122. In the context of a 38 match EPL season, a 1 unit increase in TGP exhibited by GKs would lead to 1.75 to 4.64 additional points.

Individual TGP Rankings and Year-over-Year Analysis

The Total Goalkeeper Performance (TGP) results for the 2024–2025 Premier League season are summarized in Table 2 below. The top-performing goalkeeper was Guglielmo Vicario of Tottenham Hotspur, who achieved a TGP score of 19.93 across 24 league appearances. Based on the established regression model, this corresponds to an expected points-per-game (PPG) value of approximately 1.81. In contrast, Alphonse Areola of West Ham recorded the lowest TGP score of 11.50 over 26 matches, translating to an expected PPG of roughly 1.10. When extrapolated over a full 38-match season, the difference in expected point contribution between a high-performing and low-performing goalkeeper equates to 26.98 points (68.78 vs. 41.80). While overall team success depends on multiple factors—including defensive structure and attacking capabilities—this analysis demonstrates that goalkeeper performance, as captured by TGP, is a significant independent driver of team outcomes.

Table 2

2024-2025 TGP Rankings in the EPL

RankingPlayerClubEffective Matches played (per 90)TGP
1Guglielmo VicarioTottenham2019.93
2EdersonMan City21.818.95
3Nick PopeNewcastle2318.28
4Robert SánchezChelsea2717.92
5Arijanet MuricIpswich1816.93
6David RayaArsenal3316.78
7AlissonLiverpool22.916
8Mark FlekkenBrentford31.415.87
9Kepa ArrizabalagaBournemouth2615.7
10Emiliano MartínezAston Villa3115.57
11Jordan PickfordEverton3314.98
12Łukasz FabiańskiWest Ham11.914.54
13Mads HermansenLeicester25.514.43
14Stefan OrtegaMan City11.214.12
15Bart VerbruggenBrighton3114.1
16André OnanaMan United3213.53
17Bernd LenoFulham3313.45
18Dean HendersonCrystal Palace3313.38
19José SáWolves2512.37
20Aaron RamsdaleSouthampton2512
21Matz SelsNottingham Forest3211.68
22Alphonse AreolaWest Ham21.111.5

There is also good evidence of the repeatability of TGP ratings year over year. That is – there is slight to moderate positive correlation between seasons. Table 3 represents the TGP performance of 10 GK who had qualifying minutes in the 2022-2023, 2023-2024, and 2024-2025 seasons.

Table 3

Year-over year TGP performances of qualifying Goalkeepers

Player24-25 TGP23-24 TGP22-23 TGP
Emiliano Martínez15.5720.9819.42
Ederson18.9519.5316.57
Nick Pope18.2818.0317.41
Alisson16.0016.0620.72
David Raya16.7816.6717.72
Robert Sánchez17.9217.3313.96
Bernd Leno13.4514.4718.67
Jordan Pickford14.9816.5514.88
José Sá12.3717.4312.96
Dean Henderson13.3811.7012.90

To accompany this table, Figure 3 below shows a correlation matrix that summarizes the strength of the pairwise association between each of the last three seasons in terms of TGP performance.

Figure 3

Note. Correlation matrix of TGP performance for 2022-2023, 2023-2024, and 2024-2025 seasons

TGP vs. Player Market Value

Player evaluators and scouts need to be in tune with the market value of players. One common method is to use Transfermkt (https://www.transfermarkt.com/), a highly reputable site used to estimate player market value based on performance, potential, age, and other market trends. Accordingly, the player market values from the recent 2024-2025 season were collected and paired against TGP values. The relationship between the two variables is depicted in Figure 4.

Figure 4

Note. TGP vs. Player Market Value for the 2024-2025 season.

As you can see, there is a baseline positive relationship between TGP and player market value. The scatterplot also labels the player with a color-coding system such that players above the expectation of performance by salary are in green, while those at expectation are in yellow, and those below expectation in red. Given the typical club operates on player budgets for wages, it is a common goal to try to acquire players that deliver at or above expectations in terms of performance.

DISCUSSION

Interpretation of Findings

The results of this study provide compelling evidence for the utility of the Total Goalkeeper Performance (TGP) metric as a comprehensive evaluation tool for modern soccer goalkeepers. The correlation (r = 0.474, p < 0.001) between TGP and PPG is evidence of a moderate, positive association between goalkeeping performance (as measured by TGP) and team performance. Furthermore, it can be said that 22.5% of the variation in PPG can be explained by the TGP metric. Given that there are 11 players on the field that contribute to team performance, 22.5% in the goalkeeping position signifies how critical the position is to team success.

The regression model also indicates that a single unit increase in TGP corresponds to an additional 1.75 to 4.64 points over a 38-match season. This finding quantifies the tangible impact a high-performing goalkeeper can have on a team’s league position. The substantial 26.98-point difference in expected contribution between the highest and lowest TGP scores in our sample (Vicario at 19.93 vs. Areola at 11.50) underscores the potential competitive advantage teams can gain through goalkeeper selection and development.

The year-over-year analysis reveals moderate consistency in goalkeeper performance as measured by TGP, suggesting that while the metric captures some stable aspects of goalkeeper ability, performance also fluctuates due to contextual factors such as team defensive structure, managerial approach, and opposition quality. This temporal stability adds credibility to TGP as a metric that identifies genuine skill rather than merely capturing random variation.

Tactical or Practical Implications

The TGP metric offers several practical applications for soccer professionals. First, it provides a data-driven framework for recruitment and talent identification that aligns with the multidimensional demands of the modern goalkeeper position. Clubs can use TGP to identify goalkeepers whose specific skill profiles match their tactical approach, rather than relying on traditional metrics that may not capture relevant abilities.

For teams with high possession percentages, our findings suggest that investing in goalkeepers with strong distribution skills yields tangible benefits. Conversely, teams that typically have less possession might prioritize shot-stopping and cross-claiming abilities. This contextual approach to goalkeeper evaluation enables more nuanced decision-making in the transfer market. Our analysis shows how TGP can be paired with player valuations, which can enable front offices to make smarter decisions.

The year-over-year analysis provides insights for player development specialists. The moderate temporal stability of TGP scores suggests that while goalkeeper performance has a skill component that persists across seasons, there is also room for improvement through targeted training. Development programs could use TGP component scores to identify specific areas for improvement in young goalkeepers.

From a tactical perspective, managers can use TGP to inform game strategy. Understanding the relative strengths of opposition goalkeepers across different dimensions could influence pressing approaches, crossing strategies, and shot selection. Similarly, awareness of one’s own goalkeeper’s TGP profile might influence defensive organization and build-up patterns.

Limitations

Several limitations must be acknowledged when interpreting these results. First, while our dataset includes three seasons of Premier League data, it represents only one league.. Goalkeeper requirements may differ substantially across leagues with different tactical tendencies, and the TGP weightings established here may not generalize perfectly to other contexts.

Second, our reliance on publicly available data limits the granularity of our analysis. More sophisticated tracking data could provide additional insights into goalkeeper positioning, command of area, and communication—aspects that are difficult to quantify with event data alone. The offensive component of TGP is particularly constrained by data availability, as metrics like pass completion percentage do not fully capture the quality and tactical significance of goalkeeper distribution.

Third, while we adjusted for team possession, other contextual factors like defensive structure, opposition quality, and score state may influence goalkeeper performance in ways not fully accounted for in the TGP metric. A goalkeeper playing behind a well-organized defense may face fewer high-quality shots, potentially affecting their PSxG-GA/90 component.

Finally, our weighting system, while informed by intuitive insight, introduces a subjective element to the metric. Different experts might propose alternative weightings based on their philosophical approach to the position. Future research could explore the sensitivity of TGP to different weighting schemes or develop data-driven approaches to component weighting. Despite these limitations, TGP represents a significant advancement in goalkeeper evaluation methodology and provides a foundation for future refinements as data availability and analytical techniques continue to evolve.

CONCLUSION 

This study demonstrates that the Total Goalkeeper Performance (TGP) metric is a robust and comprehensive tool for evaluating modern goalkeepers. By integrating both defensive and offensive contributions into a single, possession-adjusted framework, TGP captures the multidimensional nature of the position more effectively than existing measures. The results show a clear and statistically meaningful relationship between TGP and team success, as well as moderate year-over-year consistency, establishing TGP as a credible and practical benchmark for goalkeeper performance.

TGP should be recognized as a new standard for goalkeeper evaluation. It provides clubs, coaches, and analysts with a powerful framework for recruitment, player development, and tactical decision-making. The metric moves beyond traditional, outdated statistics such as clean sheets and instead delivers a data-driven, holistic assessment that reflects the modern demands of the position.

While future refinements—particularly improved offensive data, expanded league coverage, and longitudinal tracking—will further strengthen its utility, the evidence presented here is clear: TGP represents a decisive advancement in goalkeeper analytics. By adopting this framework, the soccer industry can better align evaluation practices with the realities of today’s game and gain a competitive edge in identifying and developing top goalkeepers.

APPLICATIONS IN SPORT

TGP provides practical value for multiple stakeholders in professional soccer. For technical directors and recruitment teams, it offers a multidimensional framework for goalkeeper evaluation that aligns with modern tactical demands, enabling more informed transfer decisions by identifying goalkeepers whose specific skill profiles match a team’s playing style. For coaches and tactical analysts, TGP components can inform game strategy by highlighting opposition goalkeeper weaknesses across different dimensions. Teams might adjust pressing approaches against goalkeepers with poor distribution or increase crossing volume against those who struggle with aerial control. Player development specialists can utilize TGP component scores to create targeted training programs addressing specific goalkeeper weaknesses, allowing youth academies to track development progress across all relevant goalkeeper skills rather than focusing exclusively on traditional shot-stopping metrics.

This type of expanded analysis has proven transformative in other sports. In American football, quarterback evaluation has evolved far beyond simple counting statistics such as touchdowns or interceptions. Advanced metrics like Expected Points Added (EPA), Completion Percentage Over Expectation (CPOE), and QBR now provide a multidimensional assessment of quarterback decision-making, efficiency, and contextual performance. In baseball, the introduction of Wins Above Replacement (WAR) revolutionized how players are valued, combining offensive, defensive, and baserunning contributions into a single comprehensive number. These examples illustrate the power of moving past one-dimensional measures to holistic frameworks that better reflect player impact.

Soccer goalkeepers are a natural candidate for this type of approach, but they are not alone. Other sports positions that blend defensive and offensive responsibilities—such as catchers in baseball, liberos in volleyball, or goaltenders in lacrosse and hockey—could benefit from similar metrics that capture their multifaceted roles. Expanding evaluation frameworks in this way allows teams across sports to more accurately quantify player value, align talent acquisition with tactical systems, and design targeted development programs that reflect the true demands of the position.

REFERENCES 

  1. Alencar, M. (2024, Mar 20). Higuita: Scorpion kick changed football forever: Rene Higuita’s  acrobatic clearance helped usher in new era of ball-playing goalkeepers, he tells Mauricio Alencar. City A.M. https://proxy.library.georgetown.edu/login?url=https://www.proquest.com/newspapers/higuita-scorpion-kick-changed-football-forever/docview/2968727496/se-2?accountid=11091
  2. Bate, A. (2021, May 26). Stanley Menzo interview: Goalkeeping pioneer who changed the game under Johan Cruyff at Ajax. Sky Sports. https://www.skysports.com/football/story-telling/11946/12311915
  3. Bugda, G., & Swann, S. (2024, July 9). Who is the “goalkeeper” for your organization? Medium. https://twodummies.medium.com/who-is-the-goalkeeper-for-your-organization-aa6b53838f9f
  4. Carmichael-Brown, S. [Hashtag United]. (2024, January 9). CHELSEA GOALKEEPER TED CURD RECALLED! [Video]. Youtube. https://www.youtube.com/watch?v=BLCTnNQfWPY
  5. Diamos, J. (2005, September 16). Hockey; new rule will take a weapon away from Brodeur. HOCKEY – New Rule Will Take a Weapon Away From Brodeur – NYTimes.com. http://web.archive.org/web/20131101025522/https://select.nytimes.com/gst/abstract.html?res=F10616F835550C758DDDA00894DD404482
  6. FBRef. (2025). Premier League goalkeeper stats. FBref.comhttps://fbref.com/en/comps/9/keepersadv/Premier-League-Stats#all_stats_keeper_adv
  7. FBRef. (2025). Premier League stats. FBRef.com https://fbref.com/en/comps/9/Premier-League-Stats
  8. FBRef. (2019). XG explained. FBref.com.  https://fbref.com/en/expected-goals-model-explained/
  9. FIFA. (2025, January 7). Goalkeeping: Dealing with crosses from corners. FIFA Training Centre. https://www.fifatrainingcentre.com/en/game/tournaments/fu17wwcfu17wwc-24/2024/goalkeeping-dealing-with-crosses.php
  10. Garrick, O. (2024, May 19). Arsenal’s David Raya wins 2023-24 premier league golden         glove award. The New York Times .https://www.nytimes.com/athletic/5495358/2024/05/19/david-raya-arsenal-golden-glove/
  11. Jamil, M., Phatak, A., Mehta, S., Beato, M., Memmert, D., & Connor, M. (2021, November 22). Using multiple machine learning algorithms to classify elite and sub-elite goalkeepers in professional men’s football. Nature News. https://www.nature.com/articles/s41598-021-01187-5#Sec7
  12. Marcolongo, D., & Miller, T. (2025, March 27). Interview with Tyler Miller. personal.
  13. Obetko, M., Peráček, P., Mikulič, M., & Babic, M. (2022). Technical–tactical profile of an elite soccer goalkeeper. Journal of Physical Education and Sport, 22(1), 38-46.             https://doi.org/10.7752/jpes.2022.01005
  14. Patrikarakos, D. (2009). Defining Moment: The back-pass rule livens up football, 1992. FT.Com, https://proxy.library.georgetown.edu/login?url=https://www.proquest.com/trade-journals/defining-moment-back-pass-rule-livens-up-football/docview/229164255/se-2?accountid=11091
  15. Premier League. (2025). Most punches – premier league goalkeeper stats. https://www.premierleague.com/stats/top/players/punches
  16. Rose, G. (2025, January 8). Manchester City News: How Pep Guardiola has influenced goalkeeper evolution in English football. BBC Sport. https://www.bbc.com/sport/football/articles/clyvw8xlv4lo
  17. Schmidt, C., & Hopkins, O. (2021, April 7). Manuel Neuer: Record-chaser and revolutionary. Opta Analyst. https://theanalyst.com/2021/04/manuel-neuer-record-chaser-and-revolutionary
  18. Smith, A. (2023, August 15). Best goalkeeper in the Premier League revealed: Andre Onana, David Raya and Robert Sanchez transfers analysed. Sky Sports. https://www.skysports.com/football/news/11661/12933578/best-goalkeeper-in-the-premier-league-revealed-andre-onana-david-raya-and-robert-sanchez-transfers-analysed
  19. Yam, D. (2019). A data driven goalkeeper evaluation framework. MIT Sloan Sports Analytics Conference. https://www.sloansportsconference.com/research-papers/a-data-driven-goalkeeper-evaluation-framework

2025-12-03T09:26:11-06:00May 20th, 2026|General, Sport Training, Sports Coaching, Sports Exercise Science|Comments Off on Total Goalkeeper Performance (TGP): A Comprehensive Metric for Evaluating Modern Soccer Goalkeepers

BOOK REVIEW: Moawad, T (2020). It takes what it takes: How to think neutrally and gain control of your life. HarperOne.

Author: Barrett Snyder

Corresponding Author:

Barrett Snyder

[email protected]

EDITOR’S NOTE: This article was written while the author was a student. The author has since graduated. The author holds an M.S. Exercise Science degree from West Chester University of Pennsylvania

Before his name appeared on a bestselling book, Trevor Moawad was already shaping champions behind the scenes. Dubbed “The World’s Best Brain Trainer” by Sports Illustrated in 2017, he spent years redefining mental conditioning at the highest levels of sport. From IMG Academy to a nine-season run with Nick Saban at Alabama, his résumé spanned elite college programs, pro teams like the Memphis Grizzlies and Miami Dolphins, and military units such as the U.S. Navy SEALs.

Moawad gained wider recognition through his work with NFL quarterback Russell Wilson, whom he met in 2012. He soon became a core member of Wilson’s performance team, and their relationship evolved into a close friendship and business partnership. In 2018, they co-founded Limitless Minds, a company focused on building sustainable mindset habits.

Despite years of working with world-class athletes, Moawad didn’t publish his first book until 2020: It Takes What It Takes: How to Think Neutrally and Gain Control of Your Life—a work I consider essential reading for anyone interested in mental performance, both in sports and in life. The book is divided into twelve chapters, covering topics such as planning, visualization, self-awareness, handling pressure, and leadership. While every chapter offers valuable insight, this review highlights the three that best capture Moawad’s message and resonated most with me: Chapter 1, “It Takes Neutral Thinking”; Chapter 3, “It Takes Hard Choices”; and Chapter 4, “It Takes a Verbal Governor.”

Chapter 1 introduces the cornerstone of Moawad’s philosophy: neutral thinking. Rather than leaning into overly positive or negative mindsets, it centers on the present and what can be controlled in the moment. Neutral thinking accepts that the past is irrevocable—it can’t be changed with mantras or platitudes. Moawad warns of a common bias in performance: “We elevate the past. We give it too much importance. We serve the past when we should be giving it berth.” That line stayed with me. Like many, I often overanalyze past decisions and dwell on mistakes. Moawad’s perspective challenged me to let go of that habit. Neutral thinking encourages us to move forward without being anchored by what came before. The past may be real, but it’s not predictive. In a culture often drawn to blind optimism, Moawad’s approach felt both grounding and liberating.

In Chapter 3, Moawad poses a powerful question: is choice an illusion? Often, he argues, it is. Success isn’t about what you feel like doing—it’s about what must be done. “A lot of times we feel as if we have choices to make about where we want to go and what it takes to get there. The reality is that what it takes to succeed is not really a choice,” he writes. He illustrates this idea with everyday decisions: sleep or binge Breaking Bad? Jack and Coke or water? Time with your kids or video games? These moments reveal how easily we confuse comfort with choice—and how small, daily decisions shape long-term outcomes. To illustrate this further, Moawad offers one of the book’s most memorable lines: “When I started working with the Alabama football team, I would hold a bag of Doritos in one hand and an apple in the other. ‘Do you really need a nutritionist to tell you which of these things is better for you?’” What first sounds like a joke lands as one of the book’s most honest truths: we usually know the better option—we just don’t always choose it.

Chapter 4 offers the most immediately actionable advice: “What if we could get people to just stop saying stupid sh— out loud?” The brain absorbs negativity more easily than positivity, and voicing our struggles makes them more harmful than merely thinking them. As someone prone to verbalizing self-doubt, I found Moawad’s message powerful—what we say out loud can reinforce the very negativity we’re trying to overcome. Moawad draws on research to show how negative self-talk can undermine performance, citing the infamous error by Red Sox first baseman Bill Buckner in the 1986 World Series. Nineteen days earlier, Buckner had said aloud, “The dreams are that you’re gonna have a great series and win. The nightmares are that you’re gonna let the winning run score on a ground ball through your legs.” Whether or not it affected the outcome, Moawad argues the fear was already present—and that’s the point: stop saying stupid sh— out loud.

As someone who values academic literature, I’ll note the book includes little scholarly research beyond a few selected studies. Rooted mainly in Moawad’s anecdotal experience, it isn’t meant as an academic work—and while it may lack empirical depth, that feels secondary to its purpose. In terms of content and accessibility, the book’s heavy use of sports examples may be a barrier for some. Readers less interested in athletics might find the frequent game and athlete references less relatable. While Moawad includes a few business and everyday examples, the book is firmly rooted in sports. Still, its core lessons extend well beyond the field, which is why I ultimately recommend it to all audiences.

Moawad concludes the book by returning to his core principle of neutral thinking, offering a memorable metaphor: “The idea of living neutral is putting a comma at the end of the event…and knowing that the next words will determine how the sentence continues.” That image reshaped how I view setbacks—a reminder that the story isn’t over unless I decide it is. Moawad’s message is clear: we hold the pen, and with it, the power to shape what comes next.

2025-10-10T11:45:06-05:00May 13th, 2026|Book Reveiws, Contemporary Sports Issues, Leadership|Comments Off on BOOK REVIEW: Moawad, T (2020). It takes what it takes: How to think neutrally and gain control of your life. HarperOne.

Identifying Self-Awareness of Leadership Abilities Using 360 Degree Feedback Method: A Case Study of Collegiate Rowers

Authors: Stephen Cadoux1, Kimberly Shaffer2

1Department of Clinical Psychology, Antioch University New England, Keene, NH, USA

2Department of Sport & Exercise Science, Barry University, Miami, FL, USA

 

Corresponding Author:

Kimberly Shaffer

[email protected]

Stephen Cadoux, MS, is a Clinical Psychology Doctoral student at Antioch University New England. His research interests focus on sports neuropsychology, leadership development, and neurocognitive effects of stress.

Kimberly Shaffer, Ph.D., CMPC is an Associate Professor and program director of the Sport, Exercise & Performance Psychology Program at Barry University. Kimberly’s areas of research interest include athlete identity, transition from sport & core values of performers. 

ABSTRACT 

Self-awareness is one of the most vital characteristics to effective leadership, yet it is a trait rarely measured within leaders. Without self-awareness, leaders place themselves in a position of weakness that can negatively impact their team’s performance. Using a Female NCAA Division II Rowing Team (n= 7), and their coaches (n=2) this study aimed to identify if captains of a collegiate athletic team are self-aware of their leadership abilities. The study was conducted via the Multifactor Leadership Questionnaire (MLQ) and a research technique known as the 360-degree feedback method. Teammates and coaches completed the MLQ about their team captain(s) leadership abilities. Meanwhile, the captain(s) rated their own leadership using the MLQ. Results from the three participant groups were compared to evaluate self-awareness (S-A) of the captain(s). S-A was determined if the Captain(s) self-reported scores are within the standard deviation of the scores of the Coaches and Teammates. Results suggested differences in the S-A of the two captains is, such that Captain X scores were found to be outside the SD of either the Coaches and/or the teammates in six of the twelve leadership subcategories, while Captain Y self-reported scores outside of their coaches and/or teammates SD on 4 different subcategories. The applied nature of this study is valuable for creating leadership programs within collegiate athletic departments and provides a quantitative model for assessing self-awareness in leadership.

Keywords: coaching, NCAA athletics, peer assessment

INTRODUCTION 

Leaders are critical to the functioning of any group, team, or organization. For teams to be successful, they require motivation, hard work, social and task cohesion, and swift decision making (1, 3, 8, 17). Over the past 60 years, there have been over 60 different leadership theories; each aiming to define leadership into distinct and unique concepts (8, 17). 

Presently, the most validated and widely used theory is the Transactional and Transformational Leadership Theory (TTLT) (3). The TTLT involves dividing leadership into two areas: Transactional leadership and Transformational leadership (3). Avolio and Bass modified TTLT to include Passive/ Avoidance behavior (1).

According to TTLT Avoidant/ Passive leaders are more passive and reactive. Avoidant leaders “avoid specifying agreements, clarifying expectations, and providing goals and standards” (1). Individuals with this style can negatively impact those around them and hurt the team’s overall performance. Within Avoidant behavior, are two sections: Management-by-Exception: Passive (MBEP) and Laissez-Faire (LF). Individuals who are high in MBEP wait until an issue arises before acting while leaders high in LF go one step further and fail to ever intervene in issues (1).

The second major category of leadership within the TTLT is Transactional leadership. Transactional leadership is based on exchanging rewards for goal completion, good performances, and desired behavior (3). These leaders clearly lay out the expectations they have for their subordinates, and they encourage their followers to perform to the best of their abilities (1). Transactional leadership is based on Contingent Reward (CR) and Management-by- Exception: Active (MBEA) (Avolio & Bass, 2004). Leaders’ high in CR offer rewards in exchange of one’s service; celebrating the accomplishments of their team and its members to reinforce positive behavior teams accomplishments Conversely, MBEA minded leaders focus on past failures, mistakes, and irregularities. These leaders set a specific standard that all members must meet and any deviation from this standard is confronted (1).

The third category of leadership in the TTLT is Transformational. Transformational leaders are viewed as the highest level of leaders (3). These individuals “connect with followers and appeal to their strengths in order to best challenge them to be more productive” (14, p. 62).

Avolio and Bass added “5 I’s” under the Transformational leadership (1). The 5 I’s are Idealized Attributes (IA), Idealized Behaviors (IB), Inspirational Motivation (IM), Intellectual Stimulation (IS), and Individual Consideration (IC; 1). Both IA and IB fall under the subset of Idealized Influence. Leaders with high Idealized Influence are leaders who consider others needs before their own and are people who others want to emulate (1). Inspirational Motivation (IM) leaders use their leadership to motivate those around them. Intellectual Stimulation (IS) leaders help fuel their follower’s intellectual mental efforts. They help their followers to be more innovative and creative as well as stimulate new ideas, thoughts, and solutions. Lastly, Individual Consideration (IC) leaders focus on their groups need for achievement and growth. They accomplish this by acting as a peer mentor and coaching figure to those around them (1).

The increase in leadership research has been primarily led by Industrial-Organizational psychology (I/O), focusing on improving for-profit businesses, personnel, and staff (5). In contrast, the field of sports has not received comparable levels of research attention or financial investment (16). This disparity has created several gaps in sport leadership research, particularly within the sub-category of leader self-awareness.

Self-awareness is arguably the most important aspect of leadership (9). Despite extensive leadership research in sports, self-awareness is rarely measured (7). Most leaders are not self-aware of their own abilities or talents (7). Without self-awareness, captains are at a disadvantage when it comes to leading their teams to victory. With the amount of money, time, and energy put into these sports teams, captains cannot have large flaws in their leadership.

While there are many ways to measure self-awareness, the 360 Degree Feedback method is not as widely used as it requires more empirical evidence. The 360 Degree Feedback method was designed for the use of providing business managers and executives more accurate feedback on their performance (5). This method involves having the leader (ratee) score their abilities on a survey or questionnaire. The organization then has several staff, peers, and supervisors anonymously complete that same survey about the ratee. This provides the organization with not only how the leader views themselves, but also how the rest of the organization and team view them. The organization can then provide the leader with structured feedback. Using 360-degree feedback has been found to provide more accurate feedback, enhance self-awareness, and can increase self-perceptions in individuals (4).

While the 360 Degree Feedback model is being utilized within the business world, the use of this method has also branched into other academic areas, including sport psychology.  Consultant groups, such as Amplos, have applied the method to identify development within coaches and athletes at various Power 5 athletic institutions (15). Although the method has proven successful in applied settings, it lacks validity in the scientific community and needs empirical evidence to further support its success.

The purpose of the proposed study is to use the Multifactor Leadership Questionnaire (MLQ; 1), and the 360 Degree Feedback Method (6) to identify if collegiate team captains are self-aware (S-A) of their leadership abilities. This study explored three hypotheses: (1)Captains would rate themselves as having higher Transformational and Transactional Leadership as compared to the scores of the coaches and teammates. (2) Captains would rate themselves as having lower Avoidant Leadership as compared to the scores of the coaches and teammates. (3) Captains would have an inverse relationship between the scores of MBEA and MBEP.

METHODS 

Participants

Participants consisted of both male (n=1) and females (n=8) involved in a NCAA Division II rowing team located in South Florida. Ages varied within the three participant categories as both young collegiate athletes and older coaches participated in this study. The Coaches (n=2) had a mean age of 33.50 (SD= ±12.02), the Captains (n=2) had a mean age of 21.50 (SD= ±2.12), and the Teammates (n=5) had a mean age of 21.60 (SD= ±2.30). The Teammate group consisted of 5 participants; however, each Captain rated the other captain and were thus included in the “Teammate” participant group during data collection. With the captains included in the Teammate participant group, the Teammates (n=7) had a mean age of 21.14 (SD= ±2.03).

Procedures

The study began with participant recruitment. Recruitment was conducted via email. Upon recruitment of the rowing team, individual athletes, captains, and coaches were recruited as well. Once recruitment had completed, the study was conducted virtually via an online video call explanation session in which participants received all directions verbally. The PI gave a brief explanation of the purpose of the study, following initial instructions, the PI explained the directions for the consent form, the demographic questionnaire and the MLQ questionnaire (all of which were provided via an online Qualtrics survey link). Participants were instructed to complete one MLQ questionnaire form for each of their participating team captains. After completion of the study, participants were thanked for their time.

 Instruments

Demographic Questionnaire

Demographic questionnaires were created by the PI and were administered to all study participants. Each participant group had its own distinct demographic questionnaire. These questionnaires were used to gather additional data about the participants that the MLQ does not specifically ask for. This data included both personal and athletic information.

Multifactor Leadership Questionnaire

The shortened version of the Multifactor Leadership Questionnaire (MLQ) was used (11). This 45-item self-reporting questionnaire is designed to assess an individual’s leadership abilities, leadership style, and the outcomes of their leadership (11).

The MLQ measures leadership by dividing the subject into three categories: Transactional Leadership, Transformational Leadership, and Passive/ Avoidant Leadership Within these three categories, the MLQ measures these styles using twelve subcategories. Transactional Leadership is divided into CR and MBEA (11). Transformational Leadership is made up of IA, IB, IM, IS, and IC (1). Passive/ Avoidant Leadership is divided into MBEP and LF (1). The last area that the MLQ measures is the outcomes of leadership; this is separated into Extra Effort (EE), Effectiveness (EFF), and Satisfaction (SAT). The MLQ uses a five point-Likert scale ranging from zero (Not at all) to four (Frequently, if not always). The questionnaire’s Cronbach’s coefficient alphas range from 0.63 to 0.92 with an internal consistency above 0.70.

Data Analyses

All data was analyzed using the IBM SPSS Statistics program. A descriptive analysis was conducted to find the means and standard deviations of the self-reported scores. S-A is determined if the captain’s self-reported scores are within the standard deviation of the scores collected from their Coaches and Teammates (1, 11).

RESULTS

Captains
The two captains tested in this study will be labelled as “Captain X” and “Captain Y”. Captain X is an American citizen who has been rowing for 10 years. She has been Captain of her team for 1 year and was also the Captain of her High School rowing team. She believes that her team is highly successful and believes that she has directly influenced the performances of her team. She also describes herself as self-aware of her abilities. Captain Y is an international student studying in the United States. Captain Y has been rowing for only two years, not having rowed in high school. Captain Y also believes her team is highly successful and her leadership abilities directly influence the team’s overall results. She also describes herself as self-aware of her leadership abilities.


Coaches
The coaching staff consisted of a male, American head coach with 12 years of coaching experience and a female, Eastern European assistant coach with four years’ experience. Both Coaches have Coached Captain X for three years and Captain Y for two years. Both Coaches also believe that their team is having a successful season and that their Team Captains are a direct result of that success.


Captain X
As seen below in table 1, Captain X’s self-reported scores were found to be outside the SD range of the scores of their Coaches and/or Teammates in six of twelve leadership subcategories. The first is IM. Captain X (m=4, ±0) self-reported themselves as higher than the scores of the teammates (m=3.30, ±0.48), while the Coaches (m=3.12, ±1.24) rated Captain X between the two groups. Within Intellectual Stimulation, Captain X (m=3.75, ±0) rated themselves higher than both the Coaches (m=2.87, ±0.53) the Teammates (m=3.30, ±0.44). In CR, Captain X (m=3.50, ±0) rated themselves as higher than the Coaches (m=2.25, ±0) while their teammates (m=3.05, ±0.51) scored between them. In MBEA, Captain X (m=2.25, ±0) ranked themselves as higher than the Coaches (m=1.87, ±0.17) but were not outside the scores provided by the Teammates (m=1.65, ±1.16). In EE, Captain X (m=4.00, ±0) scored higher than the rankings of the Teammates (m=3.13, ±0.69) while the Coaches (m=3.16, ±1.17) scored between both of the groups. The last category is EFF, where Captain X (m=4.00, ±0) rated themself higher than the SD of the Teammates (m=3.30, ±0.48). The Teammates scores were not outside the SD range of the Coaches (m=3.37, ±0.88).

Table 1

Mean scores and Standard Deviation’s for Captain X’s MLQ 360-Degree Feedback Test

 IA (SD)IB (SD)IM* (SD)IS* (SD)IC (SD)CR* (SD)MBEA* (SD)MBEP (SD)LF (SD)EE* (SD)EFF* (SD)SAT (SD)
Captain X3.50 (0)3.50 (0)4.00 (0)3.75 (0)2.75 (0)3.50 (0)2.25 (0)1.00 (0)0.25 (0)4.00 (0)4.00 (0)4.00 (0)
Coaches (n=2)3.12 (0.88)3.37 (0.88)3.12 (1.24)2.87 (0.53)2.75 (0.70)2.25 (0)1.87 (0.17)1.25 (1.76)1.00 (1.41)3.16 (1.17)3.37 (0.88)3.25 (1.06)
Teammates (n=6)3.35 (0.57)3.50 (0.46)3.30 (0.48)3.30 (0.44)3.30 (0.77)3.05 (0.51)1.65 (1.16)1.08 (0.61)0.60 (0.57)3.13 (0.69)3.30 (0.48)3.40 (0.65)
Note: *Captains scores are outside the SD for one or both groups

Table 2

Mean scores and Standard Deviation’s for Captain Y’s MLQ 360-Degree Feedback Test

 
 IA (SD)IB* (SD)IM (SD)IS (SD)IC (SD)CR (SD)MBEA* (SD)MBEP* (SD)LF (SD)EE (SD)EFF (SD)SAT* (SD)
Captain Y2.75 (0)4.00 (0)3.50 (0)3.00 (0)3.00 (0)2.75 (0)2.25 (0)0.25 (0)0.75 (0)3.00 (0)3.25 (0)4.00 (0)
Coaches (n=2)3.25 (0.70)3.37 (0.53)3.50 (0.70)3.12 (0.17)3.12 (0.17)3.25 (0.70)2.87 (0.53)0.75 (1.06)0.50 (0.70)3.50 (0.70)3.50 (0.70)3.00 (2.00)
Teammates (n=6)2.91 (0.54)3.33 (0.30)3.08 (0.78)2.70 (0.96)3.33 (0.46)3.04 (0.88)2.54 (1.30)1.00 (0.61)0.62 (0.41)3.27 (0.57)3.33 (0.43)3.08 (0.37)
Note: *Captains scores are outside the SD for one or both groups

Figure 1

Captain X 360-Degree Feedback Data

Figure 2

Captain Y 360-Degree Feedback Data

Captain Y

As seen in Table 2, Captain Y’s self-reported scores are outside the SD range of the reported scores of the Coaches and/or Teammates in only four of twelve leadership subcategories. The first is IB. Captain Y (m=4, ±0) rated themselves higher than both their Teammates (m=3.33, ±0.30) and Coaches (m=3.37, ±0.53). In MBEA, Captain Y (m=2.25, ±0) rated themselves below the SD of the Coaches (m=2.87, ±0.53). Another category of difference is MBEP. Captain Y (m=0.25, ±0) rated themselves lower than the SD of the teammates (m=1.00, ±0.61). Neither group’s scores were outside the SD provided by the Coaches (m=0.75, ±1.06). The last difference is in the subcategory of SAT. Captain Y (m=4.00, ±0) self-reported scores higher than the SD of both the Coaches (m=3.00, ±0) and Teammates (m=3.08, ±0.37).

DISCUSSION

The collected data suggests Captain Y and Captain X differ in their leadership strengths and level of S-A. Captain X scores were found to be outside the SD of either the Coaches and/or the teammates in six of the twelve leadership subcategories, while Captain Y self-reported scores outside of their coaches and/or teammates SD on 4 different subcategories. Captain X’s scores were outside the SD of both the Coaches and Teammates for only one subcategory, Leadership. While Captain Y had two subcategories, Idealized Behavior and Satisfaction, that were outside the SD range of both the Teammates and Coaches scores.

Most interesting is the evaluation of SD of scores. The SD for several Coach and Teammate scores varied greatly. An example of this wide-ranging SD can be found on Table 1 with the Coaches having a SD of 1.76 (m=1.25) on MBEP and on Table 2 with the Teammates having a SD of 1.30 (m= 2.53) on MBEA. These wide-ranging SD display a divide in the perspective the Coaches and Teammates have on the Captains. Captain X and Y scored different than the mean scores both the Coaches and Teammates in almost all of the Leadership subcategories. However, the large SDs kept the Captains within the range to be labeled “self-aware” according to Avolio and Bass (1). These large SDs argue neither the Coaches or Teammates were unified in their beliefs of the Captains. Some participants within their groups believed that their captains were excellent leaders who provided crucial support to their team. While some participants saw their captains as less effective and, sometimes, borderline detrimental to their teams. It furthers interest that the Coaches, with a group size of 2, were also divided on their Captains in several categories. While the data suggests that these Captains are self-aware of their leadership, this self-awareness does not come without scrutiny. This can be best seen in Figures 1 and 2.

Another interesting point is within Captain X and Y’s belief in the Outcomes of their Leadership. Represented in the MLQ as EE, EFF, and SAT, Captain X rated herself as a “4” for all three categories, while Captain Y rated herself as the following: 3 (EE), 3.25 (EFF), and 4 (SAT). While Captain X has stronger belief that their leadership causes more positive outcomes for their team than Captain Y, they each rated themselves as a “4” in satisfaction. Meaning, they each believe their Teammates and Coaches are satisfied with their leadership abilities. However, this cannot be the case due to the wide-ranging SD’s found in many subcategories. It can be inferred, even without major differences from both their Teammates and Coaches in the SAT category, Captains may be incorrect about their teammate’s opinions of their leadership. They believe their team celebrate their leadership, while there is not a unified belief on their abilities. In addition, a high level of perceived satisfaction may inhibit captains’ motivation to grow or further develop their leadership abilities, as they may mistakenly believe their current performance is sufficient. This tendency aligns with patterns of social loafing, where individuals reduce effort or avoid self-improvement when they perceive their contributions as adequate and unchallenged (2, 10).

While the MLQ does not label the leadership style of Captains, it does infer trends and likelihoods. Within the scores collected, Captain X views themselves as a Transformational leader who directly, and positively, influences their teams’ performances. While Captain Y does not fit directly into Transformational, Transactional, or Avoidant Leadership. Captain Y rated herself as an amalgamation of both transformational and transactional leadership styles, specializing in having a strong moral code who may occasionally act as a parental figure to many of their teammates (IB).

As stated previously, this study had three hypotheses. The first hypothesis was that the Captains would rate themselves as having higher Transformational and Transactional Leadership when compared to the scores of the Coaches and Teammates. This hypothesis was not true with either Captains. The second hypothesis was the Captains would rate themselves as having lower Avoidant Leadership when compared to the scores of the Coaches and Teammates. This hypothesis was true only for Captain Y. The last hypothesis was that Captains will have an inverse relationship between the scores of MBEA and MBEP. This was found to be true in both Captains.

Limitations & Future Directions

While this study had several strengths, the main being the first empirical test of the 360 Feedback method, it of course is not without weakness. The first being a small sample size. While the MLQ does not give a specific sample size to use to make it effective, merely using one team (n=9) is small nonetheless. Future studies of this nature should look to include various teams from different sport types, genders, age and experience levels. To ensure validity, the items of the MLQ were not re-worded for each distinct participant group. All items of the MLQ were phrased “I am…”. While the items were worded correctly for the captains, all coaches and teammates had to reword the items in their heads as they were not responding to these questions about themselves. Furthermore, the MLQ is not a sport specific questionnaire. While it is a statistically valid and reliable questionnaire, it was designed to be used with a general population base. It was not specifically designed for athletes.

Other limitations to consider, are the social pressures of collegiate teammates. Despite the confidential and anonymous nature of the study, teammates may have felt unconscious pressure to identify their captains as having higher levels of positive leadership to avoid drama, feelings of guilt, or confrontations from the team (2).

Outside of adjustments to sample size, and inclusion of a sport specific questionnaire, future research should include a qualitative component to capture nuances of leadership, as well as a debriefing session with both coaches and captains. This level of transparency about how the captain is doing in the coaches and teammates eyes could provide a mechanism for change and promote open dialogue between all parties.

Lastly, the population used in this study were proficient in the English language, it was not their first language. With many international students and coaches used in this study, it is unknown if there were any difficulties understanding, reading, or comprehending the items they were tasked with completing.

CONCLUSION 

This study provides an empirical look at leadership and perceptions of different stakeholders about how team captain’s lead. Ultimately, one of the biggest takeaways is the large variance in opinions about the captains. Not just the difference in perception from the captains themselves to the ratings of the athletes and coaches, but the differences of how each individual teammate viewed the ability of the captain.  While the goal was to analyze the self-awareness of collegiate sport captains, the take home was more centered around the unique perception and individual nature to each athlete of what makes a great leader. This is supported in various studies regarding the notion that there is no one-size-fits-all approach to leadership (9, 12, 13, 17) Simply because an individual is elected, or selected, as a captain, that does not automatically make them an excellent leader and unanimously beloved.

APPLICATIONS IN SPORT

Applied implications of this study are vast within the realms of research and consulting practices. First, it provides a framework for future 360-Degree Feedback Method studies to take place. As previously stated, this method of research is underutilized in the realm of Sport Psychology research. Additionally, the data collected from this study may be used to update leadership education programs, creating importance for Self-Awareness training and identification within students, athletes, and leaders. Use of this data can also be used to stress the importance of team building and team cohesion. This study’s data found that the team’s coaches and teammates had dramatically different opinions on the leadership of their captains. This dramatic difference within the groups can be harmful to a team’s cohesion and performance, stressing the importance of this research study.

REFERENCES 

  1. Avolio, B. J., & Bass, B. M. (2004). Multifactor leadership questionnaire. Mind Garden.
  2. Bratton, V. K., Dodd, N. G., & Brown, F. W. (2011). The impact of emotional intelligence on accuracy of self‐awareness and leadership performance. Leadership & Organization Development Journal, 32(2), 127–149. https://doi.org/10.1108/01437731111112971
  3. Burns, J. M. (1978). Leadership. Harper & Row.
  4. Carlson, M. S. (1998). 360-degree feedback: The power of multiple perspectives. Popular Government, 63(2), 38–49.
  5. Carson, M. (2006). Saying it like it isn’t: The pros and cons of 360-degree feedback. Business Horizons, 49(5), 395–402. https://doi.org/10.1016/j.bushor.2006.01.004
  6. Drew, G. (2009). A “360” degree view for individual leadership development. Journal of Management Development, 26(7), 581–592. https://doi.org/10.1108/02621710910972698
  7. Eurich, T. (2017, September). Increase your self-awareness with one simple fix [Video]. TEDxMileHigh. https://www.ted.com/talks/tasha_eurich_increase_your_self_awareness_with_one_simple_fix
  8. Fleishman, E. A., Mumford, M. D., Zaccaro, S. J., Levin, K. Y., Korotkin, A. L., & Hein, M. B. (1991). Taxonomic efforts in the description of leader behavior: A synthesis and functional interpretation. The Leadership Quarterly, 2(4), 245–287. https://doi.org/10.1016/1048-9843(91)90016-U
  9. George, B., Sims, P., McLean, A. N., & Mayer, D. (2007). Discovering your authentic leadership. Harvard Business Review, 85(2), 1–8.
  10. Ghaleb, B. (2024). Social loafing: Understanding, mitigating, and enhancing group performance. International Journal of Scientific Multidisciplinary Research, 2(9), 1321-1328. https://doi.org/10.55927/ijsmr.v2i9.10975
  11. Muenjohn, N., & Armstrong, A. (2008). Evaluating the structural validity of the Multifactor Leadership Questionnaire (MLQ), capturing the leadership factors of transformational-transactional leadership. Contemporary Management Research, 4(1), 3–14. https://doi.org/10.7903/cmr.704
  12. Northouse, P. G. (2016). Leadership: Theory and practice (7th ed.). SAGE Publications.
  13. Pienaar, J., & Nel, P. (2017). A conceptual framework for understanding leader self-schemas and the influence of those self-schemas on the integration of feedback. SA Journal of Human Resource Management, 15, 1–11. https://doi.org/10.4102/sajhrm.v15i0.772
  14. Robbins, J. E., & Madrigal, L. (2017). Sport, exercise, and performance psychology: Bridging theory and application. Springer Publishing Company.
  15. Shaffer, J. (2018). 360 review: Self, teammate, and coach evaluation for personal development. Synergy Performance: A Division of Synergy Group.
  16. Wagstaff, C. R. D., Fletcher, D., & Hanton, S. (2012). Positive organizational psychology in sport: An ethnography of organizational functioning in a national sport organization. Journal of Applied Sport Psychology, 24(1), 26-47. https://doi.org/10.1080/10413200.2011.589423
  17. Warrick, D. (2011). The urgent need for skilled transformational leaders: Integrating transformational leadership and organization development. Journal of Leadership, Accountability, and Ethics, 8(5), 11–26.
2026-04-15T11:28:29-05:00May 6th, 2026|General, Sport Education, Sports Coaching, Sports Studies|Comments Off on Identifying Self-Awareness of Leadership Abilities Using 360 Degree Feedback Method: A Case Study of Collegiate Rowers

Over-promised, under-delivered: Does position in the National Football League draft matter?

Authors: Dennis M. Shaffer1 and Ryanne E. Shaffer

1Department of Psychology, The Ohio State University Mansfield, Mansfield, Ohio, USA

 

Corresponding Author:

Dennis M. Shaffer, PhD

1760 University Drive

Mansfield, OH 44906

[email protected]

Dennis M. Shaffer, PhD, is a Full Professor Psychology at The Ohio State University in Mansfield, Ohio. His research interests focus on how athletes use visual information to pursue and induce collisions with targets in the environment in domains such as Frisbee catching, American football, and baseball, and how cognition and systems of perception and action interact.

Ryanne E. Shaffer is currently a senior at Twinsburg High School in Twinsburg, Ohio.

ABSTRACT 

Purpose. We investigated whether players drafted higher in the National Football League (NFL) over a ten-year period performed better in their first four years in the league, consistent with the trade value charts and rookie wage scale the NFL uses to value players. The purpose was to see whether how the NFL intuits draft values is connected to player performance.

Methods. In Study 1, we collected draft position data for each of the seven rounds of the draft over a ten-year period as well as the values for each of two different trade charts and the salaries in the rookie wage scale. We then coded data by round, third of round (top, middle, bottom), years in the league, and Pro Football Focus (PFF) grades.

Results. We found no correlation between performance and the way the NFL values draft positions and no difference in player performance and years in the league between draft positions in rounds 4 and 5. There were also no differences in player performance or years played in the among top, middle, or bottom thirds of rounds. We also found a distinct advantage in player performance for teams trading down for draft picks compared to those trading up for draft picks, contrary to the way the NFL values draft positions.

Conclusions. Our work shows several player performance-based results that contradict well-established beliefs concerning the value of draft picks in the NFL.

Applications in Sport. Trade values and rookie wage salaries are used as baselines by the NFL. The importance of drafting better players higher in the draft order have important implications for greater success for teams, executives, and players. Our work may inform strategies that might be best to use in drafting prospective players in the National Football League.

Key Words: NFL draft, trade value, intuitive beliefs, player performance.

INTRODUCTION 

The work here tested whether draft position predicts player performance once they are drafted into the National Football League (NFL). The NFL draft is set up so the team with the worst regular season record picks first, followed by the team with the second worst regular season record picking second, and so on. Picking first, though it means you finished with the worst record in the league the previous year, is an enviable position to be in at draft time as you have your pick of ~250-~275 players. The NFL draft consists of seven rounds(since 1994) of draft picks where, at least originally, every team has one pick in every round. Prevailing wisdom in this field and even if you are picking teams for any game whether athletic or not is that higher picks should be valued more than lower picks and that over time the data should bear this out.

The intuitive beliefs that the NFL, individual teams, and executives have about draft order or draft pick position can be measured in two ways. First, this may be measured by what are called ‘trade value charts,’ that define values for each one of the draft picks (3). There are a few different types of trade value charts, but most teams follow one of these versions if they want to trade draft picks with any other team. The classic version of a trade value chart is the Jimmy Johnson (JJ) chart. A more recent chart is the Rich Hill (RH) trade value chart. These charts basically provide teams with a framework or baseline from which to trade draft picks (14). The trade value charts are similar in several ways—(1) values increase exponentially in the first round from about pick 4 to pick 1 and (2) values for picks decrease for each subsequent pick. For instance, Pick #10 in the JJ chart this year was given a value that was 46.3% of Pick#1 (RH chart = 36.9%); Pick #20 was given a value that was 28.33% of Pick #1 (RH chart = 26.9%), and Pick #30 was given a value that was 20.67% of Pick #1 (RH chart = 19.6%). Consistent with this, the ratio of values in the top third : middle third of the first round is 1.824 in the JJ chart, (RH chart = 2); middle third : bottom third is ~1.49 (RH = 1.55); top third : bottom third is ~2.71 (RH = 3.13), and bottom third of 1st : top third of 2nd is 1.3 (RH = 1.5).

The second way the NFL’s intuitive beliefs about draft order may be measured is by the ‘rookie wage scale’ put forth by the player’s labor union and the NFL in 2011, which defines the parameters for what every drafted player will earn in his first four years in the league (3). For instance, this year, the number one overall draft pick will earn $48,757,500 in total value over his first 4 years; the number 10 pick will earn a little less than 55% of that (a difference of over $22M over the first four years); the number 20 pick will earn about 37% of that (a difference of almost $30M), while the number 30 pick will earn about 31.25% of that (a difference of ~$33.5M). While the percentage and salary difference is less in subsequent rounds among those picks (picks 1, 10, 20, and 30 in rounds 2-7), the importance of drafting better players higher in the draft order have important implications for building the best team, paying players the proper amount for their performance, the amount of money that is charged to a team’s salary cap, and the livelihood of the NFL executives who have a hand in drafting these players.

Previous work has investigated several avenues regarding characteristics that affect draft value, that are related to performance once in the NFL (5, 9, 15, 17, 20). The results of some of this work show how draft value does very little to affect probabilities of teams making the playoffs (9, 15), Other work has shown that college performance is a better predictor of performance once in the NFL than tests measuring physical ability (11, 20-21). While other work has shown that total yards gained by running backs in college and overall speed has been shown to be a primary predictor of both draft status and higher salaries once in the NFL over tests of physical ability and combine tests (6, 11, 16-17, 20). Additionally, predicting success based on results of athletic testing including the NFL Combine can yield complicated and somewhat mixed results (16).

The primary focus of this paper was to investigate whether what teams intuit of draft value based on grades in these trade value charts and rookie wage scales matches actual performance data for the players chosen in those spots. More specifically, our primary investigative foci in Study 1 were to: (1) analyze whether differences in player value (as given by trade value charts and the rookie wage scale) from pick-to-subsequent-pick were correlated with differences in player performance from pick-to-subsequent-pick, (2) analyze whether player performance, as measured by Pro Football Focus (PFF) grades and years spent in the league, was different among and within rounds (14), (3) analyze whether PFF grades and years in the league were different among thirds of rounds across and within rounds, (4) and analyze how the NFL valuation and PFF grades for last twelve picks of the first round compared to the first twelve picks of the second round.

STUDY 1

METHODS

Data Sets

We first used the Pro Football Reference site (20) to gather and download data for every player drafted from 2011-2020. We then used the Pro Football Focus (14) site to gather the overall season grades for each player across their first four years in the NFL. This resulted in 2,544 drafted players across 10 years. This study was approved by The Ohio State University Behavioral and Social Sciences Institutional Review Board (Study Number: 2023B0282).

Procedure

Evaluating a Player’s First Four Years in the NFL

Since we were interested in evaluating the success of teams in drafting, we evaluated player performance over the player’s first four years. This is because four years is the length of all rookie (1st year player) contracts. Additionally, the first four years provides a very good indicator of what the teams think the player can do for their team in terms of performance.

Understanding the Pro Football Focus Grading System

PFF analyzes every player on every snap, with each play receiving a grade on a scale from -2 to +2. A score of 0 represents an average or the expected execution of the player’s responsibilities, while a +2 denotes an outstanding play and a -2 indicates a critical error. These assessments are adjusted for factors such as difficulty of assignment and game context. PFF’s system includes tracking over 200 data points per play using the All-22 coaches’ film, including such aspects as player alignment, assignment, and outcome of the play from every aspect of the field (1, 14). PFF then converts these evaluations into a normalized score on a 0–100 scale.

Calculation and Coding of PFF Grades and Years in the League

While PFF normalizes plays to values ranging from 0-100, the overall grades across an entire season of plays are far more restricted, ranging from ~high 40’s-low 90’s (for the requirement of at least 10 games played per season as described below). For every player, we calculated a mean for their overall PFF grade across their first four years after being drafted. For players with a missing grade, we found which of the four years there was a missing grade for and why. If the player was injured and missed the entire year (for any year), we did not count that year for their average and averaged across their other years. For players at most positions, we used the offense or defense overall grade for the given year. Only for punters and kickers did we use the special team grade. Our threshold for counting a PFF grade for the year, was at least ten games played. Additionally, if the drafting team waived the player they drafted, we assigned that player a value of 35, as that is below the lowest grade anyone on a team who played earned across an entire year of play (with a minimum of 10 games played). If they played on a team after they were waived, we filled in the four years with the grade(s) they earned in the remaining year(s) on the subsequent team. We wanted to penalize the drafting team, but we also did not want to assign a 0 as waiving the player was an act but does not represent their PFF grade over an entire year. Additionally, this happened far less often in earlier rounds and since we were calculating means, we did not want these outliers to dramatically influence the results. We assigned a value of 45 for a player who was on an NFL roster, but not active for the minimum number of games (or did not have enough snaps to be graded by PFF). This is a lower grade than any player we graded who played during the season for at least ten games and gave us a baseline for someone who is good enough to be on the team but may not be good enough/needed to dress on game day(s). We did not gather PFF data for rounds 6 and 7 as fewer players in these rounds were active for enough games (i.e., played enough snaps) for which PFF could assign grades.

Finally, we also analyzed the number of years players were in the league. Again, in the interests of evaluating how well teams draft, we were really focused on years in the league of these players over their first four to five years. Therefore, we coded years played in the league in categories of less than two years, two to three years, four years, five years, and more than five years, and then analyzed this coded data.

Availability of Data and Material

Data may be accessed at: https://osf.io/pf5hq/?view_only=28f7350c720f430b92270c76e5b48080

RESULTS

We performed Bayesian analyses throughout the Results sections for all experiments to properly identify and balance the same evidence in favor of as we did evidence opposed to differences, in line with the recommendations of both Dienes (4) and Kruschke (7). The primary independent variables were draft round and position within the round (top, middle, or bottom third), while the primary dependent variables were years in the league and PFF mean overall grade for players’ first four years in the league. We outline each set of analyses below.

Testing for Correlations Between Differences in PFF Grades and Differences in Jimmy Johnson and Rich Hill Trade Chart Values and Rookie Salaries for Each Subsequent Pick

 If players drafted with picks 1-10 are better than players drafted with picks 10-20, and so on, then both the difference in trade chart values and rookie wage scale salaries from pick #1 to pick #2 and pick #2 to #3 and so on through the first five rounds of the draft should be highly correlated with PFF grades. Bayesian correlational analyses showed substantial to strong evidence that there was close to zero correlation between PFF grades and trade chart values, RH trade value chart: Bayes Factor in favor of the null hypothesis (BF01)= 5.594, r = .086, JJ trade value chart: BF01= 10.485, r = .014, and PFF grades and rookie wage scale salaries: BF01= 9.047, r = .043. The Bayes factors may be interpreted that it is 5.594, 10.485, and 9.047 times as likely that there is no correlation between PFF grades and RH trade chart values, JJ trade chart values, and rookie wage scale salaries, respectively, than there is a correlation (12, 24). Values of BF01 or BF10 of 0-1 = no evidence, 1-3 = anecdotal evidence, 3-10 = substantial evidence, 10-30 = strong evidence, 30-100 = very strong evidence, and >100 = decisive evidence in favor of whatever hypothesis is being tested (null (BF01) or alternative (BF10) (12, 24).

Analyzing How the NFL Values Draft Positions Based on the Rookie Wage Scale

We first established how the NFL values draft position across rounds and thirds of rounds. We used the rookie wage scale salaries for the first five rounds of the draft (the same rounds for which we calculated PFF grades for players—picks 1-165). Bayesian analyses showed decisive evidence of differences in salaries across rounds, BF10= 2.806 x 10+37, F(4, 150) = 448.83, p < .001, h2 = 0.71, Cauchy Prior with a scale of .707. Post hoc tests also indicated decisive evidence for differences among all rounds. Bayesian analysis showed substantial evidence of differences in salaries across thirds of rounds, BF10= 4.99, F(2, 150) = 88.19, p < .001, h2 = 0.07, Cauchy Prior with a scale of .707. Post hoc tests confirmed between anecdotal to substantial evidence for differences among all thirds of rounds (top, middle, and bottom). Virtually identical results were found when performing these same analyses using each trade value chart in lieu of the rookie wage scale.

Analyzing Differences in Rounds for Coded Years in League and PFF Overall Mean Grade

 Coded Years in League

 A Bayesian one-way ANOVA analyzing whether there were differences in years played in the league showed that there were: BF10= 2.806 x 10+111 (decisive evidence), > Test value F(6, 2536) = 101.93, p < .001, h2 = 0.19, Cauchy Prior with a scale of .707. Post hoc tests indicated that there was moderate to strong evidence that players drafted in round 1 remained in the league somewhat longer than players drafted in round 2, BF10= 4.312 (moderate to substantial). Players in almost all subsequent rounds remained in the league for less time than the previous round. One exception was that there was no difference in years played in the league between rounds 4 and 5, BF01 = 3.83 in favor of no difference, indicating moderate to substantial evidence in favor of no difference in years played in the league for 4th and 5th round draft picks.

 PFF Overall Mean Grade

A Bayesian one-way ANOVA analyzing whether there were differences in PFF overall mean grade showed that there were: BF10= 2.673 x 10+59 (decisive evidence), > Test value F(6, 2536) = 80.95, p < .001, h2 = 0.16, Cauchy Prior with a scale of .707. Additionally, again almost all post hoc test BF10 evidence showed decisive evidence for differences among all five rounds with values ranging from BF10 = 188.192 to 2.724 x 10+38. The one exception was that there was no difference in PFF overall mean grade between rounds 4 and 5, BF01 = 4.51 in favor of no difference, indicating moderate to substantial evidence in favor of no difference. Figure 1 shows a graph of pick position (x-axis) by PFF grade (y-axis) for picks across all ten years.

Figure 1.

Shown is a plot of the pick number by overall mean PFF grade for the first 4 years. Each symbol represents the average PFF grade across 10 years for a particular position in the draft (picks1-179).

Differences in Thirds of Rounds for Coded Years in League and PFF Overall Mean Grade

A Bayesian one-way ANOVA analyzing whether there were differences in years played in the league in terms of whether a player was chosen at the top, in the middle, or at the bottom third of the round showed decisive evidence that there is no difference: BF01= 20.998, Cauchy Prior with a scale of .707.

PFF Overall Mean Grade

A Bayesian one-way ANOVA analyzing whether there were differences in PFF overall mean grade showed decisive evidence that there also is no difference: BF01= 30.292, Cauchy Prior with a scale of .707.

Differences Among Top, Middle, and Bottom of Rounds for Coded Years in League and PFF Overall Mean Grade Round-by-Round

While our previous analyses show no differences among player longevity and PFF overall mean grade across the top, middle, and bottom of rounds, it could be that any potential differences were washed out by combining rounds for the analysis. For instance, later rounds that have lesser talented players overall may see no differences, or have more talented players at the middle and bottom of rounds than the tops of rounds, whereas earlier rounds may have more talented players at the tops of rounds than at the middle and bottom of rounds. These effects or patterns may cancel each other out by combining all rounds. Therefore, we again tested for differences among the top, middle, and bottom third of rounds, but this time did so within each round. Table 1 shows the results of these analyses.

Table 1.

Shown are Bayes factor in favor of the null hypothesis (BF01) for one-way ANOVAs testing for differences in coded years in the league and PFF overall mean grades for draft picks in rounds 1-7 (for years) and round 1-5 (for PFF grades) from 2011-2020. Values of BF01 of 1-3 = anecdotal evidence, 3-10 = substantial evidence, 10-30 = strong evidence, 30-100 = very strong evidence, and >100 = decisive evidence in favor of the null hypothesis) (12, 23).

Round 1Round 2Round 3Round 4Round 5Round 6Round 7
Years in League2.2373.2847.1034.856.1791.28510.313
PFF Grades8.54318.6256.6032.5219.475  

Predicting PFF Grades from Trade Value Charts and Evaluating Whether Trading Up into the First Round from the Second Round Warranted

Many times, draft experts will argue that some teams may “trade up” into the bottom third of the first round—that is, the last ~twelve to fifteen (~picks 18-32 or so)–from the top of the second round in order to draft a second player for whom they will have the 5th year option (1st round pick). Many teams have done this in the past. In fact, according to the values themselves, the NFL views the bottom 10-15 picks in round 1 as over 7008 times greater in value than the top 10-15 picks in round 2 , BF10= 7008.114, Cauchy Prior with a scale of .707). When we analyzed PFF grades across ten years for the “bottom third” of the first round (12 picks—21-32) and compared them to PFF grades across ten years for the “top third” of the second round (12 picks—33-44), a Bayesian one-way ANOVA showed substantial evidence that there is no difference: BF01= 6.854, Cauchy Prior with a scale of .707 (MBottom10of1st = 63.835; SDBottom10of1st = 10.642, MTop10of2nd= 64.387, SDTop10of2nd = 9.975).

STUDY 2

In Study 1 we found that the NFL values draft positions in the top of a round far more than those in the middle or bottom of a round and those in the middle of the round far more than those in the bottom of a round. However, there was substantial to decisive evidence of no differences in PFF grades across and within rounds, respectively, for top, middle, and bottom thirds of draft positions in rounds. We also found that there was no difference in PFF grades for draft positions in the top third of the 2nd round compared to the bottom third of the 1st round. This predicts that teams should not move up in a draft for players as the player performance will not be better for players drafted even 20-30 picks higher. When teams move up in the draft, they give up more draft capital for at least two reasons. First, while the values in the trade value chart might be even, typically the team moving back must give up more than one pick to do that in order to even out the trade value. Second, the team trading down must be incentivized in some ways to trade down. Sometimes, that team simply needs more players for the values to be even. Other times, the team trading down will ask for more as they are giving up an attractive draft pick. In Study 2, we sought to investigate whether the findings from the ten year period we investigated in Study 1 would predict the outcome of pick-for-pick trades in the 2021 NFL draft.

METHODS

We identified each of the draft pick-for-draft pick trades in the draft immediately after the last season for which we analyzed draft picks (the 2021 NFL draft). We evaluated each of the twenty-nine trades that are listed by the NFL that occurred during the 2021 draft (15). We did this because the 2021 draft was the first draft after the last year for which player performance data was collected so it allowed us to test whether our findings predicted future patterns and because it was the last year that would still allow us to analyze player performance in the first four years of the player’s career. We only looked at trades involving draft picks for draft picks.

Raters

We had two high school football players (MAge = 18 years, MExperiencePlayingCompetitiveFootball = 6 years) who have a strong knowledge of not only the workings of football but also a strong knowledge of the NFL, the NFL draft, and grading players.

Procedure     

We gave raters the series of trades with round(s), pick number(s), and PFF grades across years played listed for each pick. We removed the draft year and team and player names from the list. Raters were also blind to which part of the trade was the “trade up” and which part was the “trade down.” Our list consisted of Team A on one side and Team B on the other. We randomly assigned which team (“trade up” or “trade down”) was Team A and which team was Team B. We instructed raters on how PFF grades are set up and general cutoffs for what PFF grades are generally considered elite, good, above average, average, and poor. We also instructed them to decide which team “won” each trade. While they were instructed that they should use all the information available in making their decisions, they were told that the performance of the players (i.e. PFF grades) should be paramount in making their decisions.

Raters made their judgments independently from one another. They were initially seated in two different areas at the same time while they made their judgments. They then came together, compared their judgments, and went over the judgments that were different to see whether they could come to a consensus on the judgments that were different.

RESULTS

The raters initially agreed on 20/29 trades. After discussing the trades they had originally disagreed on, they came to a consensus on all 29. Of the 9 on which they initially had different answers, neither rater favored the “trade up” team; each of the 9 consisted of one rater deciding on the “trade down” team and the other deciding on “neither.” For 6/9 of the trades they eventually decided on “neither,” and in the other 3 trades they decided on “trade down.” Finally agreed-upon frequencies for each group were: Trade Up: 4, Trade Down: 19, and Neither: 6.

Since we were only interested in testing whether trading up resulted in better player performance, we were most interested in a comparison where we split the categories into the following two groups: Group 1: Trade Up and Group 2: Trade Down and Neither. We performed a Bayesian binomial test on frequencies of what the raters judged as “wins” in each category. Raters judged that there were significantly more wins in terms of better performing players for teams who traded down than for teams who traded up, BF10  = 753.471, Proportion = .832, Prior Distribution with α and β = 1. This is decisive evidence that trading down led to better performing players than trading up and may be interpreted that it is more than 753 times more likely that trading down led to better performing players than trading up. In a second analysis, we compared only trade downs versus trade ups and removed any trade that resulted in a judgement of “neither.” Raters again judged that there were significantly more wins in terms of better performing players for teams who traded down than for teams who traded up, BF10  = 39.472 , Proportion = .826, Prior Distribution with α and β = 1. This indicates very strong evidence that trading down led to better performing players than trading up. Therefore, even when solely comparing trade ups versus trade downs, it is still over thirty nine more times likely that trading down led to better performing players than trading up.

DISCUSSION 

The way the NFL values draft positions in terms of trade values and rookie salaries is not correlated at all with player performance consistent with previous work (14). While time spent in the league and overall PFF grade during their rookie contracts did, for the most part, gradually decline in subsequent rounds as trade value charts, fans, and NFL executives would all predict, this pattern was not straightforward. One large deviation from this gradual decline between rounds was between the  4th and 5th rounds, where there was moderate to substantial evidence in favor of no difference in both years played and overall PFF grades between those rounds, contradicting how the NFL values draft position. In fact, if we use trade values as a representation of people’s intuitions then we should expect far greater value out of 4th round picks than we do from 5th round picks, as a Bayesian independent-samples t-test analyzing trade values in round 4 versus round 5 found decisive evidence that round 4 values are significantly greater than round 5 values, BF10= 7.294 x 10+11, Cauchy Prior with a scale of .707. According to the values themselves, the NFL views round 4 picks as 7.294 x 10+11 greater in value than 5th round picks.

There were several other counterintuitive findings. First, there was decisive evidence of no difference among the top, middle, and bottom thirds of rounds across all rounds for both years in the league and PFF grades. Second, this evidence of no difference for both years in the league and PFF grades among the top, middle, and bottom thirds of rounds was a regularity for every round when evaluating each round individually. Third, we found that the performance of players taken in the top third of the second round was no different from the performance of players taken in the bottom third of the first round across all ten years, contradicting what the trade values tell us—that there is decisive evidence in favor of differences in those respective trade values.

When we looked at trades that occurred in 2021, we found that, in terms of quality of player(s), the large majority—over 82%–and significant number of trades did not favor the team who traded higher up in the draft where that pick had greater value. This result is consistent with what was found with the analyses finding no differences across thirds of rounds across and within rounds across our data. This has important implications for not only player performance, but also because it has been recently shown that teams getting the better end of trades increase their probability of making the playoffs (9).

There were limitations to this study. First, while PFF grades are used by NFL teams, NFL analytics sites, and content creators to assess player performance and are seen as the best tool for doing this, they are not perfect. However, while there may be an argument as to what goes into creating the absolute grades, we analyzed the grades relatively for players so the shortcomings of the grades themselves would apply to all players. A second limitation is that we used trade values and rookie wage salaries to as a measure of how general managers (GMs) of teams and the NFL as a whole assess player quality without directly asking them about how they value draft positions. However, there was a collective bargaining agreement between the NFL and player’s union that put in place the rookie wage scale in 2011 (3), and owners, GMs, and players all had their input into how this would be created. Additionally, every team in the NFL uses these trade values as the standard way to barter before, during, and after the draft. Thus, we feel this was a fair way to assess the way the NFL values draft positions.

Further research should be conducted to see how the player performance grades in the first four years connect with second contracts of players in their next 2-5 years. It may be that it takes certain players four years to blossom in the league. However, the average tenure of a player who makes the opening-day roster is ~6 years (about half of that if you include drafted players who do not make the opening day roster). Other work should focus on directly assessing the executives in charge of the teams who draft players and the intuitions they have regarding trading up or down in the draft and what goes into these decisions. It may be that player performance is not the only factor that drives this decision making.

CONCLUSION 

Differences in the way the NFL values draft positions are not associated with player performance in those respective draft positions. This occurs whether you analyze differences in values assigned to draft positions, different positions within a round, or adjacent positions across rounds. These patterns from our findings also predict the outcomes of future drafts in terms of the assets a team trading up gets compared to a team trading down.

APPLICATIONS IN SPORT

Trade values and trade value charts in the NFL are used as the baselines by which to trade draft picks. These charts serve at least two purposes. First, they give general managers a common mechanism that they generally agree on to trade draft picks. Second, it prevents desperate teams from trading away too much and prevents overly greedy teams from demanding too much. These values assign an assumed or perceived worth of the player picked in that position. The idea is that, while picks that are very close together may result in players that are of equal talent, picks that are several positions away from one another should result in better players for those picks that are higher in the draft order. This is reflected in the draft assets that teams are willing to give up to move up in the draft order. Our work shows that there are several performance-based patterns that contradict these naïve beliefs stemming from values given to players. While some teams might argue that they needed a player at a specific position over the best available player, one would expect that if teams stayed with their original pick, that across a general manager’s tenure they would be better off picking someone who provides better performance and not positional need. One should also expect that over one’s tenure the player performance is more valuable as a better player is more valuable as a trade asset. Our work may inform strategies that might be best to use in drafting prospective players in the National Football League.

ACKNOWLEDGEMENTS

Dennis M. Shaffer conceptualized the studies. For Study 1, the author oversaw investigation, methodology, and data curation. Formal analysis in the paper was conducted by this author. 

Ryanne E. Shaffer contributed to both studies, assisting with data curation for Study 1, and conducting Study 2. Dennis M. Shaffer supervised Study 2. 

The paper was drafted by Dennis M. Shaffer; the paper was reviewed and edited by Ryanne E. Shaffer. 

The authors would like to thank JD Okuma and Gavin Davis for their work as raters.

REFERENCES 

1. ALL-22 (2025). https://www.all-22.com/#/

2. All Access Football (2025). https://www.allaccessfootball.com/p/nfl-draft-pick-trade-value-chart

3. Deubert, C., Wong, G. M., & Howe, J. (2012). All four quarters: A retrospective and analysis of the 2011 collective bargaining process and agreement in the National

Football League. UCLA Entertainment Law Review, 19, 1-78. https://doi.org/10.5070/LR8191027149

4. Dienes, Z. (2024). Use one system for all results to avoid contradiction: Advice for using significance tests, equivalence tests, and Bayes factors. Journal of Experimental

Psychology: Human Perception and Performance. 50, 531-534. doi: 1        0.1037/xhp0001202.

5. Hadley, L., Poitras, M., Ruggiero, J., & Knowles, S. (2000). Performance evaluation of National Football League teams. Managerial And Decision Economics, 2, 63-70.

http://www.jstor.org/stable/3108334

6. Hartman, M. (2011). Competitive performance compared to combine performance as a valid predictor of NFL draft status. Journal of Strength & Conditioning Research, 25, S105-S106. DOI:10.1097/01.JSC.0000395746.03546.e8

7. Kruschke, J. K. (2021). Bayesian analysis reporting guidelines. Nature Human Behaviour, 5, 1282-1291. https://doi.org/10.1038/s41562-021-01177-7

8. Lakens, D. (2022). Sample size justification. Collabra: Psychology, 8, 1-28. https://doi.org/10.1525/collabra.33267

9. Lapre, M., A., & Palazzolo, E. M. (2024). Does draft currency promote competitive balance? An empirical investigation of the National Football League 2002–2021

Journal of Sports Economics, 25, 779-801. https://doi.org/10.1177/15270025241264238

10. Lee, M. D., & Wagenmakers, E.-J. (2013). Bayesian cognitive modeling: A practical course. Cambridge University Press.

11. Lyons, B. D., Hoffman, B. J., Michel, J. W., & Williams, K. J. (2011). On the predictive efficiency of past performance and physical ability: The case of the National Football League. Human Performance, 24, 158-172. https://doi.org/10.1080/08959285.2011.555218

12. Massey, C., and Thaler, R. (2013). The loser’s curse: Decision making and market efficiency in the National Football League draft. Management Science, 59, 1479-1495. http://dx.doi.org/10.1287/mnsc.1120.1657

13. NFL (2025). https://www.nfl.com/news/2021-nfl-draft-trade-tracker-full-details-of-every-move

14. PFF football news and analysis (June, 2025). https://www.pff.com/.

15. Reynolds, Z., Bonds, T., Thompson, S., LeCrom, C. (2015). Deconstructing the draft: An evaluation of the NFL draft as a predictor of team success. Journal of Applied Sport Management, 7, https://doi.org/10.7290/jasm07bj7q.

16. Rishis, E., Johnston, K., & Baker, J. (2023). On the predictive validity of the National Football League combine: Does it forecast future success? Journal of Sports Sciences, 41,

217-231. https://doi.org/10.1080/02640414.2023.2207853

17. Simmons, R., & Berri, D. (2009). Gains from specialization and free Agency: The story from the gridiron. Review Of Industrial Organization, 34, 81-98. https://doi.org/10.1007/s11151-009-9200-9

18. Sports Reference © (June, 2025). Pro Football Reference. https://www.pro-football-reference.com

19. Spotrac © (September, 2025). https://www.spotrac.com/nfl/cba/rookie-scale.

20. Terry, N. (2007). Investing in NFL prospects: Factors influencing team winning percentage. International Advances In Economic Research, 13, 117. https://doi.org/10.1007/s11294-006-9071-x

21. Tucker, R., Lee, C., & Black, W. J. (2024). The predictive ability of the physical skills used at the NFL combine to predict draft status. The Sport Journal. https://thesportjournal.org/article/the-predictive-ability-of-the-physical-skills-used-at-the-nfl-combine-to-predict-draft-status/

22. Wetzels, R., Matzke, D., Lee, M. D., Rouder, J. N., Iverson, G. J., & Wagenmakers, E. J. (2011). Statistical evidence in experimental psychology: An empirical comparison using 855 t tests. Perspectives on Psychological Science, 6, 291–298. http://dx.doi.org/10.1177/1745691611406923

.

2025-10-13T10:22:32-05:00April 29th, 2026|Research, Sports Coaching, Sports Management|Comments Off on Over-promised, under-delivered: Does position in the National Football League draft matter?
Go to Top