Skip to content
View debashisdash1999's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report debashisdash1999

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
debashisdash1999/README.md

🙏🏻Greetings, I’m Choudhury Debashis Dash

💻 Cloud Data Engineer | ETL / ELT Developer | Data Analytics Enthusiast
📍 India  |  🌎 Open to roles across India (OnSite & Remote) and abroad (Remote)  |  ⚡ Immediately Available


🚀 What I Do

I design and build data pipelines, cloud warehouses, and BI dashboards that help businesses move from raw data to reliable decisions.

Over 3 years, I've worked across two ends of the data stack - engineering pipelines in Snowflake for a real client project (Mercedes-Benz, USA & Canada) at Infosys, and owning the full BI function at Troy Consultancy where I built dashboards that teams actually used daily.

My focus areas:

  • ELT pipeline development in Snowflake - from raw ingestion through Bronze → Silver → Gold layers
  • Automation using Snowpipe, Streams, and Tasks for CDC and scheduled workflows
  • BI & reporting with Power BI, connecting to SQL Server, Excel, and web sources
  • Query & warehouse optimization - I pay attention to compute costs and performance, not just making things work

🔧 Tech Stack

🗄️ Data Warehousing & Databases

Snowflake Snowflake SQL MS SQL Server PostgreSQL MySQL

☁️ Cloud & Storage

Azure Data Factory Azure Data Lake AWS S3

📊 BI & Visualization

Power BI

💻 Programming & Data Processing

Python Pandas

🤝 Collaboration

GitHub


📂 Featured Projects {Check my full profile (Repositories) for all projects}


🏗️ End-to-End Data Engineering


Full enterprise-style data warehouse built using Microsoft SQL Server, implementing a Bronze → Silver → Gold layered architecture with CRM and ERP source integration, stored procedure-based ETL, star schema modeling, and a Sales Data Mart with dim_customers, dim_products, and fact_sales.

MS SQL Server ETL Star Schema Bronze Silver Gold


Production-style Snowflake pipeline modeled on a food delivery platform, covering initial & delta loads, CDC using Streams, SCD Type 2 dimensions, a star schema fact table at order-item granularity, data governance with Tags & Masking Policies, and full automation via Stored Procedures and Tasks.

Snowflake Streams Tasks SCD2 Star Schema Data Governance


Enterprise-scale retail analytics solution for a 5M+ customer ecommerce company spanning 15 countries. Built on Snowflake with ADLS as external stage, ingesting CSV, JSON, and Parquet data. Implements Bronze → Silver → Gold layers, CDC with Streams, data quality pipelines, and Tasks and Gold layer views for sales performance, customer segmentation, and product analytics.

Snowflake Azure Data Lake Streams Tasks Parquet Bronze Silver Gold


❄️ Snowflake Concept Projects


Change Data Capture implementation (INSERT / UPDATE / DELETE) using Snowflake Streams with AWS S3 integration.

Snowflake AWS S3 CDC


End-to-end automated data ingestion pipeline using Snowpipe — setup, configuration, and event-based triggering.

Snowflake Snowpipe


Querying and extracting nested JSON data in Snowflake using VARIANT data type and FLATTEN function.

Snowflake JSON VARIANT


🧹 SQL Projects


Real-world SQL data cleaning — handling nulls, duplicates, standardization, and data type corrections.

MS SQL Server Data Cleaning


Advanced SQL analytics on MLB player, team, and school data — window functions, aggregations, and performance insights.

SQL Analytics


Analyzing restaurant menu and order data to surface popular dishes, pricing trends, and customer spending patterns.

SQL Analytics


🐍 Python — Data Cleaning Projects


Real-world Airbnb dataset cleaned using Pandas — handling missing values, outliers, type conversions, and column normalization.

Python Pandas Data Cleaning


Amazon product data cleaned and preprocessed using Pandas — structured for downstream analytics or ML use.

Python Pandas Data Cleaning


📊 Power BI Reports


Interactive HR dashboard covering employee headcount, attrition analysis, departmental performance, and workforce KPIs.

Power BI HR Analytics Interactive


Visual analysis of personality survey data with dynamic slicers, trait distributions, and behavioral pattern breakdowns.

Power BI Survey Analytics Interactive


📫 Contact Me


Always learning, always building — data tells the story, I make it clear.

Popular repositories Loading

  1. debashisdash1999 debashisdash1999 Public

    Greetings and welcome to my profile.

    1

  2. sql_retail_sale_project sql_retail_sale_project Public

    Retail Sales Analysis Project This repository hosts a PostgreSQL project analyzing retail sales data with a retail_sales table. It features SQL queries for data cleaning, exploration, and insights …

  3. sql_zepto_project sql_zepto_project Public

    A PostgreSQL project exploring Zepto’s product inventory - cleaning and analyzing data to find category trends, top discounts, out-of-stock high-MRP items, and category-wise revenue.

  4. sql_restaurant_order_analysis sql_restaurant_order_analysis Public

    An SQL project where I dig into restaurant menu & order data to find popular dishes, price trends, and spending patterns.

  5. sql_major_league_baseball_proj sql_major_league_baseball_proj Public

    SQL project analyzing MLB player, team, and school data from the Baseball Database with advanced queries, analytics, and insights.

  6. snowflake_proj1_warehouse_setup_and_basics snowflake_proj1_warehouse_setup_and_basics Public

    The objective of this task is to create and configure a new virtual warehouse in Snowflake. Warehouses are crucial for query execution and data processing, as they provide the compute resources req…