Skip to content

TableInfo: substance.tsv

Mano Maurya edited this page Nov 26, 2025 · 7 revisions

This table must not be manually created. Users should skip this (and all other controlled vocabulary term tables) when preparing the rest of their datapackage's TSV files for submission. Once the other tables are built, users should then use the CFDE C2M2 utilities to build their C2M2 package.

The substance.tsv table will have as many rows as the number of unique PubChem substance terms appearing in the substance column of either biosample_substance.tsv, subject_substance.tsv, or collection_substance.tsv.

Field Field Description Required? Field Value Type Extra Info
id A PubChem substance ID (SID) Required string Example: 5381226
name A short, human-readable, machine-read-friendly label for this PubChem SID Required string
description A human-readable description of this PubChem SID Optional string
synonyms A list of synonyms for this PubChem SID Optional array of strings
compound The (unique) PubChem compound ID (CID) associated with this PubChem SID Required string Example: 4534

Clone this wiki locally