Skip to content

corti00104/NFL-4th-Down-Modeling

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Leveraged 2021-2023 NFL play-by-play database and utilized Python for data scraping from PFF and Pro Football Reference, statistical modeling, and machine learning techniques. Implemented data pre-processing, feature selection, and cross-validation for predictive accuracy. • Developed models to estimate 4th down conversion probabilities, combining play-by-play data, team rankings, and situational variables. Created decision-making framework that considers various scenarios, conditions, and micro variables to deliver a verdict on whether a conversion attempt will be successful. • Leveraged logistic regression, feature importance selection, and random forests to predict binary outcomes, using football context to optimize model features, and understanding team dynamics, field conditions, and strategy.

This analysis uses the nfl_data_py library. nfl_data_py: https://github.com/cooperdff/nfl_data_py

My Contributions

This project was completed as part of a collaborative team effort.

My individual contributions included:

  • Assisting with data scraping from sources like PFF and Pro Football Reference
  • Cleaning and processing data to ensure quality and readiness for modeling
  • Reviewing and verifying visualizations for accuracy, clarity, and presentation quality
  • Collaborating with the team on analysis interpretation and delivery

This fork serves as my personal record of the project and highlights my applied experience with data scraping, data quality assurance, visualization review, and collaborative data analysis.

About

Files used in code to generate findings, code, and associated visuals and research report

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%