A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
-
Updated
Apr 23, 2025 - TSQL
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
End-to-end Data Lakehouse project built on Databricks, following the Medallion Architecture (Bronze, Silver, Gold). Covers real-world data engineering and analytics workflows using Spark, PySpark, SQL, Delta Lake, and Unity Catalog. Designed for learning, portfolio building, and job interviews.
A cloud-native data pipeline and visualization project analyzing Formula 1 racing data using Azure, Databricks, Delta Lake, Tableau, and Python for insightful EDA and interactive dashboards.
🦆 Batch data pipeline with Airflow, DuckDB, Delta Lake, Trino, MinIO, and Metabase. Full observability and data quality.
A production-ready PySpark project template with medallion architecture, Python packaging, unit tests, integration tests, CI/CD automation, Databricks Asset Bundles, and DQX data quality framework.
Databricks Data Engineer Associate Certification Lab: End-to-end hands-on project covering Auto Loader, Medallion Architecture, SCD Type 2, Unity Catalog governance, and Databricks Jobs orchestration. Build a production-grade pipeline on Databricks Free Edition.
Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from Blizzard’s Hearthstone API. Focused on card statistics and attributes, this project reveals detailed insights into card mechanics, strengths, and trends to support BI and strategic analysis.
Databricks DLT Apparel Pipeline Project: Learn medallion architecture, streaming, and data engineering with Delta Live Tables. Includes synthetic data, step-by-step guide, and certification prep.
Building a modern data warehouse with SQL server, including ETL processes, data modeling, and analytics.
Revolutionary AI ETL with Medallion Architecture: Zero-touch autonomous & HITL pipelines on Databricks
'Talk to Your Factory' demo leveraging Edge (Azure IoT Operations), Cloud (Microsoft Fabric), and a Factory Agent (Azure OpenAI), to streamline factory operations. It allows real-time, natural language communication with factory systems, helping operators quickly identify issues, boost efficiency, and minimize downtime.
Unified Data Foundation with Microsoft Fabric with Options to Integrate with Azure Databricks and Microsoft Purview
This project implements a Lakehouse Medallion Architecture using modern Data Stack tools such as Fivetran, Snowflake and dbt. The ficticious organization is an e-commerce company.
Building a modern data warehouse with Microsoft SQL Server, including ETL processes with Bronze Layer, Silver Layer and the Gold Layer, data modeling and as well as analytics.
Building a modern data warehouse with SQL Server, including ETL processes, data modeling and analytics
This repo provides a step-by-step approach to building a modern data warehouse using PostgreSQL. It covers the ETL (Extract, Transform, Load) process, data modeling, exploratory data analysis (EDA), and advanced data analysis techniques.
End-to-end Azure data engineering pipeline ingesting real-time earthquake data from the USGS API. Implements a Bronze–Silver–Gold lakehouse using Azure Data Factory, Databricks, ADLS Gen2, and Synapse Analytics, with both manual execution and fully automated daily-triggered workflows.
Enterprise-grade Data Platform for NYC Taxi Analytics. Orchestrated with Airflow (Astro) & dbt, served via FastAPI & Power BI. Features Medallion Architecture, Data Quality Observability (Slack), and Star Schema modeling.
development scaffold for test driven pyspark structured streaming with fast local testing
Add a description, image, and links to the medallion-architecture topic page so that developers can more easily learn about it.
To associate your repository with the medallion-architecture topic, visit your repo's landing page and select "manage topics."