You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+63-17Lines changed: 63 additions & 17 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -27,15 +27,15 @@
27
27
28
28
---
29
29
30
-
## 📌 Project Overview
30
+
## 📌 1. Project Overview
31
31
32
32
This project demonstrates my ability to build a **scalable, production-grade data pipeline** using industry-standard tools. From raw data ingestion and transformation to CI/CD and visualization, this project simulates the daily responsibilities of a Data Engineer.
@@ -48,7 +48,7 @@ This project demonstrates my ability to build a **scalable, production-grade dat
48
48
49
49
---
50
50
51
-
## 🧱 Architecture Diagram
51
+
## 🧱 3. Architecture Diagram
52
52
53
53
This project follows a modular and automated data engineering architecture on Google Cloud.
54
54
Raw synthetic healthcare data is generated and stored in GCS, externalized into BigQuery, transformed via DBT models, and deployed through CI/CD using GitHub Actions.
@@ -59,21 +59,67 @@ Raw synthetic healthcare data is generated and stored in GCS, externalized into
59
59
60
60
---
61
61
62
-
## 🔁 Step-by-Step Workflow
62
+
## 🔁 4. Step-by-Step Workflow
63
63
64
64
### 4.1 GCP Setup
65
-
- Created GCP project
66
-
- Configured IAM roles and Service Accounts
67
-
- Enabled BigQuery & GCS
65
+
- Created a new Google Cloud project (`root-matrix-457217-p5`)
66
+
- Enabled **BigQuery**, **Cloud Storage**, and **IAM** APIs
67
+
- Created service accounts with proper IAM roles (`BigQuery Admin`, `Storage Admin`, etc.)
0 commit comments