Skip to content

Metadata File Format: Mutation Data

Mark Keller edited this page Dec 23, 2018 · 10 revisions

The iMuSE mutation data metadata file specifies where to find the mutation data files corresponding to each mutation data project.

It must be located at obj/meta-data.tsv

Rows correspond to projects (corresponding to a specific cancer type from a specific source).

It contains the following columns:

  • Project: An identifier for the project.
  • Project Source: The source of the project.
  • Project Name: The project name. Typically named by the cancer type.
  • Oncotree Code: A code mapping the cancer type to an Oncotree node.
  • Path to Extended SBS File: The path to the extended file corresponding to the SBS mutation type. Relative to the obj directory.
  • Path to Counts SBS_96 File: The path to the counts file corresponding to the SBS_96 category type. Relative to the obj directory.
  • Path to Extended DBS File: The path to the extended file corresponding to the DBS mutation type. Relative to the obj directory.
  • Path to Counts DBS_78 File: The path to the counts file corresponding to the DBS_78 category type. Relative to the obj directory.
  • Path to Extended INDEL File: The path to the extended file corresponding to the INDEL mutation type. Relative to the obj directory.
  • Path to Counts INDEL_Alexandrov2018_83 File: The path to the counts file corresponding to the INDEL_Alexandrov2018_83 category type. Relative to the obj directory.
  • Path to Clinical File: The path to the clinical data file. Relative to the obj directory.
  • Path to Samples File: The path to the sample-patient mapping file. Relative to the obj directory.
  • Path to Genes File: The path to the gene-alterations file. Relative to the obj directory.

Clone this wiki locally