Skip to content

fatmaNashat/Incremental-ETL-pipeline-to-synchronize-data-from-PostgreSQL-to-MySQL-using-Airflow-Docker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

ChatGPT Image Nov 5, 2025, 08_51_03 AM

Incremental ETL Pipeline with Apache Airflow

This project demonstrates a production-style ETL workflow using Apache Airflow to incrementally synchronize data from PostgreSQL to MySQL.

Key Features

Timestamp-based incremental extraction (last_update)

Idempotent upsert logic in MySQL

Airflow Variables for persistent state tracking

Task orchestration using PythonOperators

Fully containerized environment using Docker

Tech Stack

-Apache Airflow

-PostgreSQL

-MySQL

-Python

-Docker

Use Case

Efficiently process only new or updated records, reducing load, improving performance, and ensuring reliable data synchronization.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages