Skip to content

preston-thomas/nfl_advanced_statistics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NFL Advanced Statistics Data Pipeline Project

This will hone my data engineering skills while exploring some of my greatest passions in life - the NFL and advanced metrics.

This is still a work in progress, more coming soon...

Anticipated Requirements

  • DBeaver or another universal database manager of your choice
  • A local PostgreSQL installation (or another SQL database of your choice, SQLite is another good, lightweight option)
  • Python 3.13.0
  • SQLAlchemy Python library for SQL connection to PostgreSQL database, Pandas for data manipulation, PyQt and Matplotlib for vizualization

Run Instructions

  • You will need to load the CSVs into SQL tables
  • You will then need to establish a connection to your local PostgreSQL database, using a connection string. An example can be found at the top of the data_ingestion.py file

Citations

All data currently in the database contributed from the CSVs found from this GitHub profile: nflverse. This repository contains the source code responsible for scraping data from multiple sources and is refreshed on a week-to-week basis during the NFL season.

About

NFL Advanced Statistics Data Pipeline

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages