🚀 SpaceX Falcon 9 First Stage Landing Prediction

This project tackles a real-world challenge faced by SpaceX: Can we predict whether a Falcon 9 first-stage booster will land successfully, before launch, using historical mission data?

Leveraging data from the SpaceX API and public sources, I built an end-to-end data science pipeline—from automated data collection and rigorous cleaning, to in-depth exploratory analysis, interactive mapping, dashboard creation, and machine learning modeling.

Unified, analysis-ready dataset combining API and web data on launches, sites, orbits, payloads, and recovery outcomes.
Interactive dashboards and geospatial maps enable rapid exploration of launch patterns and success rates by site, orbit, and payload.
ML-powered predictions: Trained, tuned, and compared several classifiers—including Logistic Regression, SVM, KNN, and Decision Tree—to predict landing success with high accuracy.
Actionable insights: Identified which launch sites and mission profiles drive the highest landing success, and discovered how payload mass and orbit impact outcomes.

This capstone demonstrates my ability to solve end-to-end business problems with data engineering, analytics, visualization, and predictive modeling—all skills directly transferable to industry roles in data science and analytics.

Repository Contents

jupyter-labs-spacex-data-collection-api.ipynb
Collects SpaceX launch data via API, merging with Wikipedia data for comprehensive features.
jupyter-labs-webscraping.ipynb
Scrapes web data for SpaceX payloads and booster recovery details.
labs-jupyter-spacex-Data wrangling.ipynb
Cleans, merges, and prepares the data for analysis.
jupyter-labs-eda-sql-coursera_sqlite (2).ipynb
Performs exploratory data analysis (EDA) and SQL-based analysis to uncover trends and correlations.
edadataviz.ipynb
Creates visualizations to understand launch outcomes by site, orbit, and payload mass.
lab_jupyter_launch_site_location (1).ipynb
Maps launch sites using Folium to visualize site locations and nearby infrastructure.
Lab Dashboard Application with Plotly Dash.pdf
Interactive dashboard for exploring launch outcomes and success ratios by site and payload.
SpaceX_Machine Learning Prediction_Part_5.ipynb
Builds and evaluates machine learning models (Logistic Regression, SVM, Decision Tree, KNN) to predict landing success. Includes model comparison, accuracy, and confusion matrix.

Project Highlights

End‑to‑end pipeline: Data collection (SpaceX API + Wikipedia scraping) → data wrangling → EDA (plots & SQL) → interactive Folium maps → Plotly Dash dashboard → ML model training & evaluation.
Created a unified, clean dataset of Falcon 9 launches, outcomes, and site/orbit/payload details.
Built dashboards and maps to enable rapid exploration and visualization of launch trends.
Trained and compared multiple classification models (Logistic Regression, SVM, Decision Tree, KNN) to predict first‑stage landing outcomes.

Key Findings

Top‑performing launch sites: In this dataset, CCAFS SLC‑40 and KSC LC‑39A consistently achieved the highest first‑stage landing success rates.
Payload mass effect: Most landing successes cluster below ~6,000 kg; heavier payloads show more variable outcomes.
Orbit impact: SSO (polar), GEO, and HEO orbits show near‑perfect recovery rates in this sample, while GTO lags at ~50%.
Best model: The Decision Tree classifier achieved the highest cross‑validated accuracy (≈ 0.89). On the hold‑out test set, all models achieved ~0.83 accuracy.

📄 Full Project Report

A comprehensive slide deck walks through every step of this project, from data collection and wrangling all the way through EDA, interactive maps, dashboards, and machine-learning results.

👉 SpaceX Falcon 9 Landing Prediction – Full Report

The report includes:

Data collection & wrangling flow
Visual EDA (plots + SQL results)
Interactive analytics (Folium + Plotly Dash)
Predictive modeling (LogReg, SVM, Decision Tree, KNN), hyperparameter tuning, accuracy, and confusion matrix
Conclusions and next steps

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
AnalyzingHistoricalStockRevenueData&BuildingaDashboard.ipynb		AnalyzingHistoricalStockRevenueData&BuildingaDashboard.ipynb
DataScienceEcosystem.ipynb		DataScienceEcosystem.ipynb
Lab Dashboard Application with Plotly Dash.pdf		Lab Dashboard Application with Plotly Dash.pdf
README.md		README.md
SpaceX Falcon 9 Landing Prediction – Full Report.pdf		SpaceX Falcon 9 Landing Prediction – Full Report.pdf
SpaceX_Machine Learning Prediction_Part_5 (2).ipynb		SpaceX_Machine Learning Prediction_Part_5 (2).ipynb
edadataviz.ipynb		edadataviz.ipynb
jupyter-labs-eda-sql-coursera_sqllite (2).ipynb		jupyter-labs-eda-sql-coursera_sqllite (2).ipynb
jupyter-labs-spacex-data-collection-api (1).ipynb		jupyter-labs-spacex-data-collection-api (1).ipynb
jupyter-labs-webscraping (1).ipynb		jupyter-labs-webscraping (1).ipynb
lab_jupyter_launch_site_location (1).ipynb		lab_jupyter_launch_site_location (1).ipynb
labs-jupyter-spacex-Data wrangling.ipynb		labs-jupyter-spacex-Data wrangling.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🚀 SpaceX Falcon 9 First Stage Landing Prediction

Repository Contents

Project Highlights

Key Findings

📄 Full Project Report

About

Uh oh!

Releases

Packages

Languages

RC-15-coder/Spacex-Falcon9-landing-prediction

Folders and files

Latest commit

History

Repository files navigation

🚀 SpaceX Falcon 9 First Stage Landing Prediction

Repository Contents

Project Highlights

Key Findings

📄 Full Project Report

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages