etl_titanic

ETL Data Pipeline Project This project demonstrates the creation of an ETL (Extract, Transform, Load) data pipeline using Python. The pipeline extracts raw data from various sources, processes and transforms it to clean and structured formats, and finally loads it into a relational database for further analysis. The pipeline ensures data consistency, handles missing values, and performs transformations like categorizing age groups. It uses libraries such as Pandas for data manipulation and SQLAlchemy to interact with a MySQL database. The final dataset is optimized for analytics and reporting.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.gitattributes		.gitattributes
ETL_Data_Pipeline.ipynb		ETL_Data_Pipeline.ipynb
ETL_Data_Pipeline.pdf		ETL_Data_Pipeline.pdf
README.md		README.md
tested.csv		tested.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

etl_titanic

About

Uh oh!

Releases

Packages

Languages

rohit-ashva900/etl_titanic

Folders and files

Latest commit

History

Repository files navigation

etl_titanic

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages