B2B SaaS Analytics Pipeline

This project sets up an end-to-end analytics pipeline that loads data from Supabase into Snowflake, transforms it with dbt, and visualizes it in Apache Superset.

Prerequisites

Python 3.8+
PostgreSQL
Supabase account and credentials
Snowflake account and credentials

Setup Instructions

1. Set Up Python Environment

# Create a virtual environment
python -m venv venv

# Activate the virtual environment
# On Windows:
venv\Scripts\activate
# On Unix or MacOS:
source venv/bin/activate

# Install dependencies
pip install -r requirements.txt

2. Start PostgreSQL Database in Docker

# Start postgres in docker
docker compose up --build

Make sure you have a secrets.toml file in the .dlt folder with the following structure:

3. Set Up Dagster

# Start Dagster webserver
dagster dev

The Dagster UI will be available at http://localhost:3000

4. Run the Pipeline

In the Dagster UI:

Navigate to the Assets tab
Select all assets
Click "Materialize Selected"

This will:

Load data from CSV to PostgreSQL database using dlt
Run dbt transformations on the loaded data

6. Set Up Apache Superset

# Change directory to superset folder
cd superset

# Make scripts executable
chmod +x setup_db.sh run_superset.sh

# Set up the database
./setup_db.sh

# Start Superset
./run_superset.sh

When running run_superset.sh, you'll be prompted to create an admin user. Follow the prompts to set up your credentials.

Superset will be available at http://localhost:8088

7. Create Dashboard in Superset

Log in to Superset using your admin credentials
Go to Data → Databases and add your PostgreSQL connection
Create new datasets from your analytics table
Create charts using these datasets
Combine charts into a dashboard

Troubleshooting

If you encounter database connection issues, verify your credentials in secrets.toml and profiles.yml
For Superset connection issues, check superset_config.py settings
Make sure all required ports are available and not blocked by firewall

Security Note

Never commit files containing credentials (secrets.toml, profiles.yml, superset_config.py) to version control. Add them to your .gitignore file. For demonstration purpose, we have unblocked some of these files.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.dlt		.dlt
.tmp_dagster_home_rc7f9krf		.tmp_dagster_home_rc7f9krf
car-listing-project		car-listing-project
car_listing_project.egg-info		car_listing_project.egg-info
dbt_project		dbt_project
init-scripts		init-scripts
logs		logs
raw		raw
superset		superset
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
requirements-superset.txt		requirements-superset.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

B2B SaaS Analytics Pipeline

Prerequisites

Setup Instructions

1. Set Up Python Environment

2. Start PostgreSQL Database in Docker

3. Set Up Dagster

4. Run the Pipeline

6. Set Up Apache Superset

7. Create Dashboard in Superset

Troubleshooting

Security Note

Additional Resources

About

Uh oh!

Languages

taeefnajib/Car-Listing-ELT-Analytics

Folders and files

Latest commit

History

Repository files navigation

B2B SaaS Analytics Pipeline

Prerequisites

Setup Instructions

1. Set Up Python Environment

2. Start PostgreSQL Database in Docker

3. Set Up Dagster

4. Run the Pipeline

6. Set Up Apache Superset

7. Create Dashboard in Superset

Troubleshooting

Security Note

Additional Resources

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages