This repo contains my notes and homework assignments for the 2024 Data Talks Data Engineering Zoomcamp plus additional notes on orchestrators from other years - Airflow (2022), Prefect (2023), and Kestra (2025) The final project for the 2024 course can be found in this repo NYC Collisions Analytics
MODULE 1: Docker & SQL • Terraform & GCP
MODULE 2A: Orchestration with Mage
MODULE 2B: Orchestration with Airflow
MODULE 2C: Orchestration with Prefect
MODULE 2D: Orchestration with Kestra
MODULE 3: Data Warehouses with BigQuery
MODULE 4: Analytics Engineering with dbt
MODULE 5: Batch with Spark
MODULE 6: Streaming with Kafka
GIT
- GIT Beginner
- GIT Cheatsheet
- GIT Markdown Cheatsheet
- GIT Extended Special commit cases
- GIT Readme Adding HTML and CSS to GIT Readme
- GIT Merge Divergent Branches
Terraform
PostgreSQL
- Postgres PostgreSQL official documentation