Skip to content

ELT pipeline transforming raw CSV data into a structured MySQL dataset for potential analysis and visualization

Notifications You must be signed in to change notification settings

KwiecinskiMateusz/Project1_ELT_Pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Project 1: ELT Pipeline

This project is my first portfolio piece in Data Analysis, providing hands-on experience with MySQL. The goal was to extract, load, and transform data from three raw CSV files into a structured, clean dataset ready for potential exploratory data analysis (EDA) and visualization.

Data Sources:

  • Population data (255 countries and regions, 1960–2023)
  • Average temperature data (170 countries, 1901–2022)
  • Electricity production data (251 countries and regions, 1985–2023)

Key Transformations & Feature Engineering:

  • Merged datasets into a final table covering 160 countries (2000–2022) with population, temperature, and electricity production data.
  • Created additional features, including yearly changes (absolute & percentage) for all metrics.
  • Added electricity production per capita as a new calculated column.
  • Performed basic data aggregation for enhanced code usability.

The final dataset is export-ready in CSV format for further potential analysis and visualization.

About

ELT pipeline transforming raw CSV data into a structured MySQL dataset for potential analysis and visualization

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published