Skip to content

oncotree code inconsistency between different studies for TCGA sample #2226

@anqi970301

Description

@anqi970301

I noticed discrepancies in the Oncotree codes across different study IDs. For example, sample ID = TCGA-2E-A9G8-01

Study ID Oncotree code
ucec_tcga_pan_can_atlas_2018 UEC
ucec_tcga UCEC
ucec_tcga_gdc UCEC

I made the following assumptions about the cause of the difference:
1. GDC may have updated clinical annotations.
2. Different versions of the Oncotree code may have been applied.
3. The mapping strategy could have changed.

Update:

I checked history of public/ucec_tcga_pan_can_atlas_2018/data_clinical.txt and public/ucec_tcga/data_bcr_clinical_data_sample.txt and this discrepancy has persisted since beginning of file history in 2018.

I found several other issues referring to related questions but didn't manage to find an answer.
E.g. #1405

Could you please point me to the documentation of release notes on:

Thank you for your help.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions