Skip to content

Commit c3fe0e5

Browse files
authored
Merge pull request #1550 from atrigila/unify_summarised_experiment
Unify summarised experiment
2 parents 15c160f + 20294ca commit c3fe0e5

33 files changed

+250
-277
lines changed

CHANGELOG.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
1919
- [PR #1528](https://github.com/nf-core/rnaseq/pull/1528) - Improve JSON schema validation files
2020
- [PR #1523](https://github.com/nf-core/rnaseq/pull/1523) - Update preprocessing subworkflow to fix linting block on trimming
2121
- [PR #1521](https://github.com/nf-core/rnaseq/pull/1521) - Updated Perl conda package version for local module gtf2bed for Arm compatibility.
22+
- [PR #1550](https://github.com/nf-core/rnaseq/pull/1550) - Simplify `SummarizedExperiment` outputs.
2223
- [PR #1553](https://github.com/nf-core/rnaseq/pull/1553) - Make jobs automatically resubmit for exit code 175
2324

2425
# 3.18.0 - 2024-12-19

docs/output.md

Lines changed: 21 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -728,16 +728,33 @@ The principal output files are the same between Salmon and Kallisto:
728728
- `<pseudo_aligner>/`
729729
- `<pseudo_aligner>.merged.gene_counts.tsv`: Matrix of gene-level raw counts across all samples.
730730
- `<pseudo_aligner>.gene_tpm.tsv`: Matrix of gene-level TPM values across all samples.
731-
- `<pseudo_aligner>.gene_counts.rds`: RDS object that can be loaded in R that contains a [SummarizedExperiment](https://bioconductor.org/packages/release/bioc/html/SummarizedExperiment.html) container with the TPM (`abundance`), estimated counts (`counts`) and transcript length (`length`) in the assays slot for genes.
731+
- `all_samples_gene.SummarizedExperiment.rds`: RDS object that can be loaded in R that contains a [SummarizedExperiment](https://bioconductor.org/packages/release/bioc/html/SummarizedExperiment.html) container with the abundance TPM (`tpm`), estimated counts (`counts`) and gene length (`length`), estimated library size-scaled counts (`counts_scaled`), estimated length-scaled counts (`counts_length_scaled`) in the assays slot for genes.
732732
- `<pseudo_aligner>.merged.gene_lengths.tsv`: Matrix of average within-sample transcript lengths for each gene across all samples.
733733
- `<pseudo_aligner>.merged.gene_counts_scaled.tsv`: Matrix of gene-level library size-scaled estimated counts across all samples.
734-
- `<pseudo_aligner>.merged.gene_counts_scaled.rds`: RDS object that can be loaded in R that contains a [SummarizedExperiment](https://bioconductor.org/packages/release/bioc/html/SummarizedExperiment.html) container with the TPM (`abundance`), estimated library size-scaled counts (`counts`) and transcript length (`length`) in the assays slot for genes.
735734
- `<pseudo_aligner>.merged.gene_counts_length_scaled.tsv`: Matrix of gene-level length-scaled estimated counts across all samples.
736-
- `<pseudo_aligner>.merged.gene_counts_length_scaled.rds`: RDS object that can be loaded in R that contains a [SummarizedExperiment](https://bioconductor.org/packages/release/bioc/html/SummarizedExperiment.html) container with the TPM (`abundance`), estimated length-scaled counts (`counts`) and transcript length (`length`) in the assays slot for genes.
737735
- `<pseudo_aligner>.merged.transcript_counts.tsv`: Matrix of isoform-level raw counts across all samples.
738736
- `<pseudo_aligner>.merged.transcript_tpm.tsv`: Matrix of isoform-level TPM values across all samples.
739-
- `<pseudo_aligner>.merged.transcript_counts.rds`: RDS object that can be loaded in R that contains a [SummarizedExperiment](https://bioconductor.org/packages/release/bioc/html/SummarizedExperiment.html) container with the TPM (`abundance`), estimated isoform-level raw counts (`counts`) and transcript length (`length`) in the assays slot for transcripts.
740737
- `tx2gene.tsv`: Tab-delimited file containing gene to transcripts ids mappings.
738+
- `all_samples_transcript.SummarizedExperiment.rds`: RDS object that can be loaded in R that contains a [SummarizedExperiment](https://bioconductor.org/packages/release/bioc/html/SummarizedExperiment.html) container with the abundance TPM (`tpm`), estimated isoform-level raw counts (`counts`) and transcript length (`length`) in the assays slot for transcripts.
739+
740+
> **TIP:** You can access specific assay matrices from the `SummarizedExperiment` RDS object with the following R code:
741+
742+
```r
743+
library(SummarizedExperiment)
744+
745+
# Load the RDS object
746+
se <- readRDS("all_samples_gene.SummarizedExperiment.rds")
747+
748+
# View available assays
749+
assayNames(se)
750+
751+
# Access a specific assay, e.g., length-scaled counts
752+
counts_length_scaled <- assay(se, "counts_length_scaled")
753+
754+
# Print the matrix
755+
print(counts_length_scaled)
756+
```
757+
741758
</details>
742759

743760
An additional subset of files are distinct to each tool, for Salmon:

modules.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -373,7 +373,7 @@
373373
},
374374
"quantify_pseudo_alignment": {
375375
"branch": "master",
376-
"git_sha": "a1abf90966a2a4016d3c3e41e228bfcbd4811ccc",
376+
"git_sha": "1f008221e451e7a4738226c49e69aaa2eb731369",
377377
"installed_by": ["subworkflows"]
378378
},
379379
"utils_nextflow_pipeline": {

subworkflows/nf-core/quantify_pseudo_alignment/main.nf

Lines changed: 20 additions & 23 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

subworkflows/nf-core/quantify_pseudo_alignment/nextflow.config

Lines changed: 7 additions & 11 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

subworkflows/nf-core/quantify_pseudo_alignment/tests/main.nf.test

Lines changed: 17 additions & 9 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

0 commit comments

Comments
 (0)