Skip to content

Commit 16b459e

Browse files
committed
update
1 parent 244fe59 commit 16b459e

File tree

1,519 files changed

+13834
-191368
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

1,519 files changed

+13834
-191368
lines changed

AGENTS.md

Lines changed: 3 additions & 62 deletions
Original file line numberDiff line numberDiff line change
@@ -694,68 +694,9 @@ Synthetic data generation for sequences, ecosystems, and biological systems.
694694

695695
## Documentation
696696

697-
For detailed documentation about AI contributions to specific modules, organized by category:
698-
699-
### Repository-Level Documentation
700-
- **Main Source Development**: [`src/metainformant/AGENTS.md`](src/metainformant/AGENTS.md) - Overall AI assistance in source code development
701-
- **Source Organization**: [`src/AGENTS.md`](src/AGENTS.md) - Source code infrastructure and organization
702-
- **Configuration Management**: [`config/AGENTS.md`](config/AGENTS.md) - Configuration system development
703-
- **General Documentation**: [`docs/AGENTS.md`](docs/AGENTS.md) - AI assistance in documentation development
704-
705-
Note: The `output/` directory is **strictly ephemeral** and contains **only program-generated analysis results**.
706-
707-
**CRITICAL**: Never create documentation, reports, test scripts, or planning documents in `output/`. These belong in `docs/`, `tests/`, or `scripts/` respectively. See `output/.cursorrules` for the complete policy.
708-
709-
### Source Module Documentation (Implementation)
710-
- **Core Utilities**: [`src/metainformant/core/AGENTS.md`](src/metainformant/core/AGENTS.md) - Core infrastructure and shared utilities
711-
- **DNA Analysis**: [`src/metainformant/dna/AGENTS.md`](src/metainformant/dna/AGENTS.md) - DNA sequence analysis and genomics
712-
- **RNA Analysis**: [`src/metainformant/rna/AGENTS.md`](src/metainformant/rna/AGENTS.md) - RNA transcriptomic analysis and workflow orchestration
713-
- **RNA Workflow Steps**: [`src/metainformant/rna/steps/AGENTS.md`](src/metainformant/rna/steps/AGENTS.md) - Modular RNA workflow step implementations
714-
- **Protein Analysis**: [`src/metainformant/protein/AGENTS.md`](src/metainformant/protein/AGENTS.md) - Protein sequence and structure analysis
715-
- **Epigenome**: [`src/metainformant/epigenome/AGENTS.md`](src/metainformant/epigenome/AGENTS.md) - Epigenetic modification analysis
716-
- **Ontology**: [`src/metainformant/ontology/AGENTS.md`](src/metainformant/ontology/AGENTS.md) - Functional annotation and ontologies
717-
- **Phenotype**: [`src/metainformant/phenotype/AGENTS.md`](src/metainformant/phenotype/AGENTS.md) - Phenotypic trait analysis
718-
- **Ecology**: [`src/metainformant/ecology/AGENTS.md`](src/metainformant/ecology/AGENTS.md) - Ecological metadata and community analysis
719-
- **Mathematical Biology**: [`src/metainformant/math/AGENTS.md`](src/metainformant/math/AGENTS.md) - Mathematical and theoretical biology
720-
- **GWAS Module**: [`src/metainformant/gwas/AGENTS.md`](src/metainformant/gwas/AGENTS.md) - Genome-wide association studies implementation
721-
- **Information Theory**: [`src/metainformant/information/AGENTS.md`](src/metainformant/information/AGENTS.md) - Information-theoretic analysis
722-
- **Life Events**: [`src/metainformant/life_events/AGENTS.md`](src/metainformant/life_events/AGENTS.md) - Life course event analysis
723-
- **Visualization**: [`src/metainformant/visualization/AGENTS.md`](src/metainformant/visualization/AGENTS.md) - Plotting and visualization utilities
724-
- **Simulation**: [`src/metainformant/simulation/AGENTS.md`](src/metainformant/simulation/AGENTS.md) - Synthetic data generation and modeling
725-
- **Single-Cell Genomics**: [`src/metainformant/singlecell/AGENTS.md`](src/metainformant/singlecell/AGENTS.md) - Single-cell RNA sequencing analysis
726-
- **Quality Control**: [`src/metainformant/quality/AGENTS.md`](src/metainformant/quality/AGENTS.md) - Data quality assessment
727-
- **Network Analysis**: [`src/metainformant/networks/AGENTS.md`](src/metainformant/networks/AGENTS.md) - Biological network analysis
728-
- **Machine Learning**: [`src/metainformant/ml/AGENTS.md`](src/metainformant/ml/AGENTS.md) - Machine learning for biological data
729-
- **Multi-Omics**: [`src/metainformant/multiomics/AGENTS.md`](src/metainformant/multiomics/AGENTS.md) - Multi-omic data integration
730-
731-
### Documentation Module Files (User Documentation)
732-
- **Core Documentation**: [`docs/core/AGENTS.md`](docs/core/AGENTS.md) - Core utilities documentation development
733-
- **DNA Documentation**: [`docs/dna/AGENTS.md`](docs/dna/AGENTS.md) - DNA analysis documentation development
734-
- **RNA Documentation**: [`docs/rna/AGENTS.md`](docs/rna/AGENTS.md) - RNA analysis documentation development
735-
- **Amalgkit Documentation**: [`docs/rna/amalgkit/AGENTS.md`](docs/rna/amalgkit/AGENTS.md) - Amalgkit integration documentation
736-
- **Amalgkit Steps Documentation**: [`docs/rna/amalgkit/steps/AGENTS.md`](docs/rna/amalgkit/steps/AGENTS.md) - Amalgkit workflow steps documentation
737-
- **Protein Documentation**: [`docs/protein/AGENTS.md`](docs/protein/AGENTS.md) - Protein analysis documentation development
738-
- **Epigenome Documentation**: [`docs/epigenome/AGENTS.md`](docs/epigenome/AGENTS.md) - Epigenome documentation development
739-
- **Ontology Documentation**: [`docs/ontology/AGENTS.md`](docs/ontology/AGENTS.md) - Ontology documentation development
740-
- **Phenotype Documentation**: [`docs/phenotype/AGENTS.md`](docs/phenotype/AGENTS.md) - Phenotype documentation development
741-
- **Ecology Documentation**: [`docs/ecology/AGENTS.md`](docs/ecology/AGENTS.md) - Ecology documentation development
742-
- **Mathematical Biology Documentation**: [`docs/math/AGENTS.md`](docs/math/AGENTS.md) - Mathematical biology documentation development
743-
- **GWAS Documentation**: [`docs/gwas/AGENTS.md`](docs/gwas/AGENTS.md) - GWAS module documentation development
744-
- **Information Theory Documentation**: [`docs/information/AGENTS.md`](docs/information/AGENTS.md) - Information-theoretic analysis documentation development
745-
- **Life Events Documentation**: [`docs/life_events/AGENTS.md`](docs/life_events/AGENTS.md) - Life course event analysis documentation development
746-
- **Visualization Documentation**: [`docs/visualization/AGENTS.md`](docs/visualization/AGENTS.md) - Visualization documentation development
747-
- **Simulation Documentation**: [`docs/simulation/AGENTS.md`](docs/simulation/AGENTS.md) - Simulation documentation development
748-
- **Single-Cell Documentation**: [`docs/singlecell/AGENTS.md`](docs/singlecell/AGENTS.md) - Single-cell documentation development
749-
- **Quality Control Documentation**: [`docs/quality/AGENTS.md`](docs/quality/AGENTS.md) - Quality control documentation development
750-
- **Network Analysis Documentation**: [`docs/networks/AGENTS.md`](docs/networks/AGENTS.md) - Network analysis documentation development
751-
- **Machine Learning Documentation**: [`docs/ml/AGENTS.md`](docs/ml/AGENTS.md) - Machine learning documentation development
752-
- **Multi-Omics Documentation**: [`docs/multiomics/AGENTS.md`](docs/multiomics/AGENTS.md) - Multi-omics documentation development
753-
754-
### Scripts and Testing
755-
- **RNA Scripts**: [`scripts/rna/AGENTS.md`](scripts/rna/AGENTS.md) - RNA workflow scripts development
756-
- **Amalgkit Scripts**: [`scripts/rna/amalgkit/AGENTS.md`](scripts/rna/amalgkit/AGENTS.md) - Amalgkit script development
757-
- **Core Scripts**: [`scripts/core/AGENTS.md`](scripts/core/AGENTS.md) - Core utility scripts development
758-
- **Test Data**: [`tests/data/AGENTS.md`](tests/data/AGENTS.md) - Test data organization documentation
697+
AI contributions are documented per module in `src/metainformant/<module>/AGENTS.md`. Each module-level AGENTS.md describes AI assistance for that specific domain.
698+
699+
Note: The `output/` directory is **strictly ephemeral** and contains **only program-generated analysis results**. Never create documentation in `output/`.
759700

760701
## Contact and Support
761702

CHANGELOG.md

Lines changed: 53 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,53 @@
1+
# Changelog
2+
3+
All notable changes to METAINFORMANT are documented in this file.
4+
5+
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.1.0/),
6+
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
7+
8+
## [0.2.0] - 2025-12
9+
10+
### Added
11+
- Complete UV package management integration (replaces pip)
12+
- Enhanced RNA workflow with automated genome indexing
13+
- Comprehensive GWAS visualization suite (manhattan, QQ, regional plots)
14+
- Information theory module with syntactic and semantic measures
15+
- Life events analysis with sequence modeling
16+
- Single-cell analysis with Leiden clustering
17+
- Multi-omics integration workflows
18+
- Progress tracking with real-time TUI
19+
- Intelligent caching for expensive computations
20+
21+
### Changed
22+
- Python 3.11+ minimum requirement (previously 3.10+)
23+
- Reorganized configuration system with environment variable overrides
24+
- Improved error handling with context-aware messages
25+
- Enhanced parallel processing with better thread management
26+
27+
### Fixed
28+
- Import errors reduced from ~225 to ~63 (72% improvement)
29+
- Test collection success rate improved to 87%
30+
- Memory optimization for large datasets
31+
32+
### Removed
33+
- pip-based installation (use `uv` exclusively)
34+
- Legacy configuration formats
35+
36+
## [0.1.0] - 2025-06
37+
38+
### Added
39+
- Initial release with core bioinformatics modules
40+
- DNA analysis: sequences, alignment, population genetics, phylogeny
41+
- RNA analysis: amalgkit integration, workflow orchestration
42+
- Protein analysis: sequences, structures, AlphaFold integration
43+
- GWAS pipeline: association testing, QC, visualization
44+
- Network analysis: community detection, centrality measures
45+
- Visualization: 57+ plot types
46+
- Quality control: FASTQ analysis, contamination detection
47+
- Machine learning: classification, regression, feature selection
48+
- Mathematical biology: coalescent, population genetics theory
49+
50+
### Notes
51+
- NO_MOCKING_POLICY established: all tests use real implementations
52+
- UV-only package management policy established
53+
- Output directory isolation policy established

0 commit comments

Comments
 (0)