This document describes the organization of the catecholamine research repository.
catecholamine/
├── data/ # All data files
│ ├── raw/ # Immutable raw data files with metadata
│ └── derived/ # Generated datasets (rebuildable from raw)
├── figures/ # Output figures (generated, not committed)
├── notebooks/ # Exploratory analysis notebooks
├── paper/ # Paper manuscript and related files
│ └── sections/ # Paper sections (if split)
├── protocol/ # Research protocols and methodologies
├── refs/ # Bibliography and citation files
├── reports/ # Generated reports from scripts
├── schemas/ # Data validation schemas
├── scripts/ # Pipeline scripts
│ ├── scrape/ # Data scraping utilities
│ ├── validate_raw.py # Validate raw data
│ ├── build_dataset.py # Build master dataset
│ └── build_reports.py # Generate reports
└── src/ # Reusable Python modules
└── catecholamine/ # Main package
README.md- Project overview and quickstartrequirements.txt- Python dependenciespyproject.toml- Python package configurationLICENSE- MIT licenseCONTRIBUTING.md- Contribution guidelines.gitignore- Git ignore patterns
- Raw Data (
data/raw/) - Immutable source data with metadata - Validation (
scripts/validate_raw.py) - Ensure data quality - Master Dataset (
scripts/build_dataset.py) - Build normalized dataset - Reports (
scripts/build_reports.py) - Generate analysis reports - Paper (
paper/paper.qmd) - Render final manuscript
- Every raw data file must have a
.meta.yamlcompanion file - Never edit raw data in place
- All derived data must be rebuildable from raw data
- All figures generated from code
- Paper rendered from Quarto documents
- Complete dependency specification
- Document all inclusion/exclusion decisions
- Log extraction methods
- Use quality flags for questionable data
See README.md for setup instructions and workflow overview.