The Workspace RO-Crate Profile is a standard specification for describing Bio-OS workspaces. Based on the RO-Crate standard, this profile is specifically designed for unified encapsulation of multi-stage computational provenance, dependencies, and environments in bioinformatics analysis workflows.
This project aims to provide a complete Workspace RO-Crate Profile specification for:
- Unified description of multi-stage bioinformatics analysis in Bio-OS workspaces
- Standardized description of workflows and Interactive Execution Services (IES)
- Unified encapsulation of computational provenance, dependencies, and analysis environments
- Ensuring reproducibility and traceability of workspace data
- Secondary Analysis: Workflow-based analysis, corresponding to Workflow Run Crate
- Tertiary Analysis: Jupyter/RStudio-based interpretive analytics, corresponding to Process Run Crate
- Submission Provenance: Workflow submission provenance information, corresponding to Provenance Run Crate
- Dataset: Root data entity of the workspace
- Workflow: Computational processes for secondary analysis
- IES App: Application definitions for tertiary analysis
- IES Task: Specific execution of tertiary analysis
- Dashboard: Introduction of project and visualization of analysis results
Workspace RO-Crate integrates with the Digger central repository, providing:
- Archiving and sharing of workspace artifacts
- DRS URL conversion
- CRC64 checksum validation to ensure data consistency
Issues and Pull Requests are welcome to improve this specification.
This project is licensed under the MIT License - see the LICENSE file for details.