https://docs.google.com/presentation/d/1DMHxoBldgaMp8NiIAc_hAzJCv8LsYULI5M4n9KWSye0/edit?usp=sharing
Install dependencies:
pip install --extra-index-url https://test.pypi.org/simple/ datatrac==0.0.5datatrac push --source "https://example.com/" path/to/data.csvdatatrac fetch <dataset-hash>datatrac fetch --download <dataset-hash>datatrac fetch --all
--OR--
datatrac fetch -adatatrac lineage <dataset-hash>datatrac lineage --parent <parent-hash> --child <child-hash>admin).
datatrac delete <dataset-hash>datatrac delete --local <dataset-hash> datatracwebOpen your browser at: http://localhost:8000
The dashboard shows:
- Dataset browser with search
- Top Downloaded Datasets
- Dataset details (size, hash, source, downloads, last download, creation date)
- Upload support
- Lineage Tracking
Dataset Details and Lineage View

| Command | Description | Example |
|---|---|---|
push |
Register a new dataset | datatrac push data.csv --source "url" |
fetch |
Show dataset info or download | datatrac fetch <hash> |
fetch -a |
List all datasets | datatrac fetch --all |
lineage |
View or create lineage links | datatrac lineage <hash> |
delete |
Deregister or delete local dataset | datatrac delete --local <hash> |
Datasets can be referenced by:
- Name: original filename (e.g.,
data.csv)
- Default admin password is hardcoded as
admin(for proof of concept). - Works with Python 3.12+ and uses uv, typer, FastAPI, ReactJS for package, command management and web dashboard.
- Frontend (React) is shipped prebuilt, no need for manual build required as static HTML is served through FastAPI.

