ClarID: A Human-Readable and Compact Identifier Specification for Biomedical Metadata Integration
📘 Documentation: https://cnag-biomedical-informatics.github.io/clarid-tools
📓 Google Colab tutorial: https://colab.research.google.com/drive/1smS42yzL7qYV1kCz34baTWLRr_AAap-r
🗂️ Use Cases I & II GDC Data: https://github.com/CNAG-Biomedical-Informatics/clarid-tools/tree/main/nb/data
📦 CPAN Distribution: https://metacpan.org/pod/ClarID::Tools
🐳 Docker Hub Image: https://hub.docker.com/r/manuelrueda/clarid-tools/tags
ClarID-Tools is a flexible, schema-driven toolkit for generating and parsing structured identifiers for subject and biosample data. The objective is to standardize how subject and biosample metadata are encoded into compact, informative IDs for downstream integration and tracking.
- 🧬 Biosample and Subject ID generation from structured metadata
- 🩺 Support for clinical and experimental metadata, including species, tissue, assay, condition, and more
- 📄 Human-readable and stub-formatted modes for compact or verbose identifiers
- 🧪 Bulk and single-record encoding/decoding
- ✅ Schema validation using JSON Schema and YAML codebooks
- 📦 Command-line interface
We offer two modes of installation:
-
Use Cases:
If you use ClarID-Tools in your work, please cite:
Manuel Rueda and Ivo G. Gut (2025). ClarID: A Human-Readable and Compact Identifier Specification for Biomedical Metadata Integration. Submitted.
Written by Manuel Rueda, PhD. Info about CNAG can be found at https://www.cnag.eu.
ClarID-Tools is released under the Artistic License. See the LICENSE file for details.