This project is a personal initiative to build a predictive model for NBA Finals outcomes based on historical player and team data. The goal is to integrate full rosters, regular season performance, and advanced statistics into a machine learning pipeline to forecast future playoff results.
I’m combining and cleaning data from two major Kaggle datasets:
Since both datasets were raw and unstructured, I wrote a Python script that uses SQLite to:
This groundwork allows for future steps like feature engineering, win prediction modeling, and visual dashboarding using libraries like Pandas, Scikit-learn, and Matplotlib.
This project reflects my interest in sports analytics, end-to-end data pipelines, and actionable insights through modeling.