Skip to content

Commit 763537a

Browse files
authored
Merge pull request #916 from drpatelh/fixes
Bump pipeline version to v3.10 and other small tweaks
2 parents c16fe7c + 92154a3 commit 763537a

File tree

5 files changed

+45
-21
lines changed

5 files changed

+45
-21
lines changed

CHANGELOG.md

Lines changed: 20 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@
33
The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/)
44
and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0.html).
55

6-
## v3.10dev - [date]
6+
## [[3.10](https://github.com/nf-core/rnaseq/releases/tag/3.10)] - 2022-12-21
77

88
### Enhancements & fixes
99

@@ -18,6 +18,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
1818
- [[#900](https://github.com/nf-core/rnaseq/issues/900)] - Add `--recursive` option to `fastq_dir_to_samplesheet.py` script
1919
- [[#902](https://github.com/nf-core/rnaseq/issues/902)] - `check_samplesheet.py` script doesn't output optional columns in samplesheet
2020
- [[#907](https://github.com/nf-core/rnaseq/issues/907)] - Add `--extra_star_align_args` and `--extra_salmon_quant_args` parameter
21+
- [[#912](https://github.com/nf-core/rnaseq/issues/912)] - Add UMI deduplication before quantification in tube map
2122

2223
### Parameters
2324

@@ -31,6 +32,24 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
3132
> **NB:** Parameter has been **added** if just the new parameter information is present.
3233
> **NB:** Parameter has been **removed** if new parameter information isn't present.
3334
35+
### Software dependencies
36+
37+
Note, since the pipeline is now using Nextflow DSL2, each process will be run with its own [Biocontainer](https://biocontainers.pro/#/registry). This means that on occasion it is entirely possible for the pipeline to be using different versions of the same tool. However, the overall software dependency changes compared to the last release have been listed below for reference.
38+
39+
| Dependency | Old version | New version |
40+
| ----------------------------------- | ----------- | ----------- |
41+
| `bbmap` | 38.93 | 39.01 |
42+
| `bioconductor-dupradar` | 1.18.0 | 1.28.0 |
43+
| `bioconductor-summarizedexperiment` | 1.20.0 | 1.24.0 |
44+
| `bioconductor-tximeta` | 1.8.0 | 1.12.0 |
45+
| `fq` | 0.9.1 | |
46+
| `salmon` | 1.5.2 | 1.9.0 |
47+
| `samtools` | 1.15.1 | 1.16.1 |
48+
49+
> **NB:** Dependency has been **updated** if both old and new version information is present.
50+
> **NB:** Dependency has been **added** if just the new version information is present.
51+
> **NB:** Dependency has been **removed** if version information isn't present.
52+
3453
## [[3.9](https://github.com/nf-core/rnaseq/releases/tag/3.9)] - 2022-09-30
3554

3655
### Enhancements & fixes

README.md

Lines changed: 19 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -32,28 +32,29 @@ You can find numerous talks on the [nf-core events page](https://nf-co.re/events
3232
> The SRA download functionality has been removed from the pipeline (`>=3.2`) and ported to an independent workflow called [nf-core/fetchngs](https://nf-co.re/fetchngs). You can provide `--nf_core_pipeline rnaseq` when running nf-core/fetchngs to download and auto-create a samplesheet containing publicly available samples that can be accepted directly as input by this pipeline.
3333
3434
1. Merge re-sequenced FastQ files ([`cat`](http://www.linfo.org/cat.html))
35-
2. Read QC ([`FastQC`](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/))
36-
3. UMI extraction ([`UMI-tools`](https://github.com/CGATOxford/UMI-tools))
37-
4. Adapter and quality trimming ([`Trim Galore!`](https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/))
38-
5. Removal of genome contaminants ([`BBSplit`](http://seqanswers.com/forums/showthread.php?t=41288))
39-
6. Removal of ribosomal RNA ([`SortMeRNA`](https://github.com/biocore/sortmerna))
40-
7. Choice of multiple alignment and quantification routes:
35+
2. Sub-sample FastQ files and auto-infer strandedness ([`fq`](https://github.com/stjude-rust-labs/fq), [`Salmon`](https://combine-lab.github.io/salmon/))
36+
3. Read QC ([`FastQC`](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/))
37+
4. UMI extraction ([`UMI-tools`](https://github.com/CGATOxford/UMI-tools))
38+
5. Adapter and quality trimming ([`Trim Galore!`](https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/))
39+
6. Removal of genome contaminants ([`BBSplit`](http://seqanswers.com/forums/showthread.php?t=41288))
40+
7. Removal of ribosomal RNA ([`SortMeRNA`](https://github.com/biocore/sortmerna))
41+
8. Choice of multiple alignment and quantification routes:
4142
1. [`STAR`](https://github.com/alexdobin/STAR) -> [`Salmon`](https://combine-lab.github.io/salmon/)
4243
2. [`STAR`](https://github.com/alexdobin/STAR) -> [`RSEM`](https://github.com/deweylab/RSEM)
4344
3. [`HiSAT2`](https://ccb.jhu.edu/software/hisat2/index.shtml) -> **NO QUANTIFICATION**
44-
8. Sort and index alignments ([`SAMtools`](https://sourceforge.net/projects/samtools/files/samtools/))
45-
9. UMI-based deduplication ([`UMI-tools`](https://github.com/CGATOxford/UMI-tools))
46-
10. Duplicate read marking ([`picard MarkDuplicates`](https://broadinstitute.github.io/picard/))
47-
11. Transcript assembly and quantification ([`StringTie`](https://ccb.jhu.edu/software/stringtie/))
48-
12. Create bigWig coverage files ([`BEDTools`](https://github.com/arq5x/bedtools2/), [`bedGraphToBigWig`](http://hgdownload.soe.ucsc.edu/admin/exe/))
49-
13. Extensive quality control:
45+
9. Sort and index alignments ([`SAMtools`](https://sourceforge.net/projects/samtools/files/samtools/))
46+
10. UMI-based deduplication ([`UMI-tools`](https://github.com/CGATOxford/UMI-tools))
47+
11. Duplicate read marking ([`picard MarkDuplicates`](https://broadinstitute.github.io/picard/))
48+
12. Transcript assembly and quantification ([`StringTie`](https://ccb.jhu.edu/software/stringtie/))
49+
13. Create bigWig coverage files ([`BEDTools`](https://github.com/arq5x/bedtools2/), [`bedGraphToBigWig`](http://hgdownload.soe.ucsc.edu/admin/exe/))
50+
14. Extensive quality control:
5051
1. [`RSeQC`](http://rseqc.sourceforge.net/)
5152
2. [`Qualimap`](http://qualimap.bioinfo.cipf.es/)
5253
3. [`dupRadar`](https://bioconductor.org/packages/release/bioc/html/dupRadar.html)
5354
4. [`Preseq`](http://smithlabresearch.org/software/preseq/)
5455
5. [`DESeq2`](https://bioconductor.org/packages/release/bioc/html/DESeq2.html)
55-
14. Pseudo-alignment and quantification ([`Salmon`](https://combine-lab.github.io/salmon/); _optional_)
56-
15. Present QC for raw read, alignment, gene biotype, sample similarity, and strand-specificity checks ([`MultiQC`](http://multiqc.info/), [`R`](https://www.r-project.org/))
56+
15. Pseudo-alignment and quantification ([`Salmon`](https://combine-lab.github.io/salmon/); _optional_)
57+
16. Present QC for raw read, alignment, gene biotype, sample similarity, and strand-specificity checks ([`MultiQC`](http://multiqc.info/), [`R`](https://www.r-project.org/))
5758

5859
> **Warning**
5960
> Quantification isn't performed if using `--aligner hisat2` due to the lack of an appropriate option to calculate accurate expression estimates from HISAT2 derived genomic alignments. However, you can use this route if you have a preference for the alignment, QC and other types of downstream analysis compatible with the output of HISAT2.
@@ -103,6 +104,10 @@ The pipeline was re-written in Nextflow DSL2 and is primarily maintained by Hars
103104
The pipeline workflow diagram was designed by Sarah Guinchard ([@G-Sarah](https://github.com/G-Sarah)) and James Fellows Yates ([@jfy133](https://github.com/jfy133)).
104105

105106
Many thanks to other who have helped out along the way too, including (but not limited to):
107+
[@MatthiasZepper](https://github.com/MatthiasZepper),
108+
[@Emiller88](https://github.com/Emiller88),
109+
[@maxulysse](https://github.com/maxulysse),
110+
[@robsyme](https://github.com/robsyme),
106111
[@Galithil](https://github.com/Galithil),
107112
[@pditommaso](https://github.com/pditommaso),
108113
[@orzechoj](https://github.com/orzechoj),

conf/modules.config

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -613,7 +613,7 @@ if (!params.skip_alignment && params.aligner == 'star_salmon') {
613613
]
614614
}
615615

616-
withName: 'NFCORE_RNASEQ:RNASEQ:UMITOOLS_PREPAREFORRSEM' {
616+
withName: 'NFCORE_RNASEQ:RNASEQ:UMITOOLS_PREPAREFORSALMON' {
617617
ext.prefix = { "${meta.id}.umi_dedup.transcriptome.filtered" }
618618
publishDir = [
619619
path: { "${params.outdir}/${params.aligner}/umitools/log" },

nextflow.config

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -256,7 +256,7 @@ manifest {
256256
description = """RNA sequencing analysis pipeline for gene/isoform quantification and extensive quality control."""
257257
mainScript = 'main.nf'
258258
nextflowVersion = '!>=22.10.1'
259-
version = '3.10dev'
259+
version = '3.10'
260260
doi = 'https://doi.org/10.5281/zenodo.1400710'
261261
}
262262

workflows/rnaseq.nf

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -92,14 +92,14 @@ ch_biotypes_header_multiqc = file("$projectDir/assets/multiqc/biotypes_header.
9292
//
9393
// MODULE: Loaded from modules/local/
9494
//
95-
include { UMITOOLS_PREPAREFORRSEM } from '../modules/local/umitools_prepareforrsem.nf'
9695
include { BEDTOOLS_GENOMECOV } from '../modules/local/bedtools_genomecov'
9796
include { DESEQ2_QC as DESEQ2_QC_STAR_SALMON } from '../modules/local/deseq2_qc'
9897
include { DESEQ2_QC as DESEQ2_QC_RSEM } from '../modules/local/deseq2_qc'
9998
include { DESEQ2_QC as DESEQ2_QC_SALMON } from '../modules/local/deseq2_qc'
10099
include { DUPRADAR } from '../modules/local/dupradar'
101100
include { MULTIQC } from '../modules/local/multiqc'
102101
include { MULTIQC_CUSTOM_BIOTYPE } from '../modules/local/multiqc_custom_biotype'
102+
include { UMITOOLS_PREPAREFORRSEM as UMITOOLS_PREPAREFORSALMON } from '../modules/local/umitools_prepareforrsem.nf'
103103
include { MULTIQC_TSV_FROM_LIST as MULTIQC_TSV_FAIL_MAPPED } from '../modules/local/multiqc_tsv_from_list'
104104
include { MULTIQC_TSV_FROM_LIST as MULTIQC_TSV_FAIL_TRIMMED } from '../modules/local/multiqc_tsv_from_list'
105105
include { MULTIQC_TSV_FROM_LIST as MULTIQC_TSV_STRAND_CHECK } from '../modules/local/multiqc_tsv_from_list'
@@ -421,14 +421,14 @@ workflow RNASEQ {
421421

422422
// Fix paired-end reads in name sorted BAM file
423423
// See: https://github.com/nf-core/rnaseq/issues/828
424-
UMITOOLS_PREPAREFORRSEM (
424+
UMITOOLS_PREPAREFORSALMON (
425425
ch_umitools_dedup_bam.paired_end
426426
)
427-
ch_versions = ch_versions.mix(UMITOOLS_PREPAREFORRSEM.out.versions.first())
427+
ch_versions = ch_versions.mix(UMITOOLS_PREPAREFORSALMON.out.versions.first())
428428

429429
ch_umitools_dedup_bam
430430
.single_end
431-
.mix(UMITOOLS_PREPAREFORRSEM.out.bam)
431+
.mix(UMITOOLS_PREPAREFORSALMON.out.bam)
432432
.set { ch_transcriptome_bam }
433433
}
434434

0 commit comments

Comments
 (0)