Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions build/assets/custom-dictionary.txt
Original file line number Diff line number Diff line change
Expand Up @@ -58,6 +58,7 @@ eq
et
FASTQ
Foltz
ganglioglioma
GC
genomic
genotype
Expand Down Expand Up @@ -117,6 +118,7 @@ Nextflow
neuroblastoma
Nozomi
nozomione
oligodendrocyte
O'Malley
OMalley
openscpca
Expand Down Expand Up @@ -170,6 +172,7 @@ UMAP
UMAPs
UMI
UMIs
uncharacterized
unicode
unspliced
UTHealth
Expand Down
26 changes: 26 additions & 0 deletions content/03.results.md
Original file line number Diff line number Diff line change
Expand Up @@ -193,6 +193,32 @@ However, there is notable heterogeneity even within HGG and LGG subtypes (Figure
Figure {@fig:fig4}D shows the expression of cell-type specific markers for all immune cell types, validating the assignment of various immune cell types with consensus cell types.
A summary of all the consensus cell types observed in all other ScPCA samples can be found in Figure {@fig:figS6}.


### Augmenting cell type annotations for malignant cell identification

Because the consensus annotations were derived from automated methods that do not specifically consider tumor cell states, they provide limited information for distinguishing malignant from normal cells.
We therefore sought complementary avenues to increase the value of cell type annotations with information that can be leveraged for this purpose.

In parallel to developing the ScPCA Portal, we launched the OpenScPCA project [@url: https://openscpca.readthedocs.io], an open-science collaborative initiative to further characterize and analyze Portal data.
Thus far, we have added manual cell type annotations for two projects, `SCPCP000004` (neuroblastoma) and `SCPCP000015` (Ewing sarcoma), to the Portal based on analyses performed in the OpenScPCA project.
Figure {@fig:fig5}A displays, for example, a UMAP of all libraries in `SCPCP000004` highlighting this project's OpenScPCA annotations which were derived using the `NBAtlas` dataset as reference [@doi:10.1016/j.celrep.2024.114804].
Unlike the consensus cell type annotations, the OpenScPCA project annotations distinguish between normal and malignant cells and contain far fewer uncharacterized cells.
Indeed, for `SCPCP000004`, the consensus cell type procedure labeled only ~43% of cells, but the OpenScPCA project labeled ~91% of cells, thereby adding substantial value to the data.
Note that the Portal's summary cell type report will include comparisons between annotations made in `scpca-nf` to OpenScPCA annotations for relevant libraries.

In an effort to identify potential malignant cells across all samples in the Portal, we included a step in the `scpca-nf` pipeline to run `InferCNV` [@url:https://github.com/broadinstitute/inferCNV] to quantify copy-number alterations (Figure {@fig:fig2}A).
The estimates complement the consensus cell types by providing a proxy for a cell's malignant status, such that cells with high levels of CNV are more likely to be tumor than normal cells.
Indeed, there is broad correspondence between malignant cells (Figure {@fig:fig5}A) and the total per-cell CNV across libraries in `SCPCP000004` (Figure {@fig:fig5}B); malignant cells tend to have higher levels of CNV, whereas normal cells tend to have lower levels of CNV.
We probed this relationship further within a single neuroblastoma library, `SCPCL000130`, finding clear signatures of canonical neuroblastoma CNV events such as `1q` loss, `11q` gain, and `17p` loss [@doi:10.1038/nrdp.2016.78; @doi:10.1016/j.celrep.2024.114804; @doi:10.1158/2159-8290.CD-14-0622] within malignant cells (Figure {@fig:fig5}C).
By contrast, normal cells show very few CNV events, consistent with their annotations.
Most intriguingly, unknown cells show CNV event signatures more similar to the malignant cells than to the normal cells, suggesting many of these cells may indeed be malignant.

We also see traces of this relationship even when looking at the consensus cell types in conjunction with CNV events.
In Figure {@fig:fig5}D, we show the distributions of per-cell total CNV events for the most commonly-observed consensus cell types in the neuroblastoma library `SCPCL000130`.
Here, Unknown and neuron cells have distinctly higher total CNV values compared to other cell types, suggesting that they are likely to be malignant cells.
We see similar patterns for the ganglioglioma library `SCPCL000049` (Figure {@fig:figS4}B-C), where consensus T cells have low total CNV values, while other cell types including oligodendrocyte precursor cells, neuron associated cells, and Unknown cells have much higher total CNV values.
As such, joint information from consensus cell type annotations and `InferCNV` results may be used to identify malignant cells across libraries in the Portal, including those which do not yet have associated OpenScPCA project annotations.

## Analysis of bulk RNA-seq

Several projects in the ScPCA Portal contain bulk RNA-seq data in addition to single-cell/nuclei RNA-seq data.
Expand Down
Loading