Skip to content
Closed
Show file tree
Hide file tree
Changes from 250 commits
Commits
Show all changes
636 commits
Select commit Hold shift + click to select a range
ebe23fb
Merge pull request #133 from nf-core/nf-core-template-merge-3.4.1
FranBonath Nov 7, 2025
aa0ee42
Merge branch 'dev' into bwa_index
maxulysse Nov 7, 2025
1e472c8
Merge branch 'dev' into picard_tools_dev
maxulysse Nov 7, 2025
b119692
fix snapshots
maxulysse Nov 7, 2025
d45e427
fix snapshots
maxulysse Nov 7, 2025
3352f23
Merge branch 'dev' into multiqc-index
pontushojer Nov 7, 2025
17bbc74
.bai files in the same folder as bam files
agrima2010 Nov 18, 2025
ce94600
update snapshots
maxulysse Nov 19, 2025
b9c09ed
update modules
maxulysse Nov 19, 2025
8e76c46
update snapshots once again
maxulysse Nov 19, 2025
f38e893
patch fastqscreen/fastqscreen
maxulysse Nov 19, 2025
de6821f
Merge pull request #128 from agrima2010/picard_tools_dev
maxulysse Nov 19, 2025
37706c4
Merge branch 'dev' into bwa_index
maxulysse Nov 19, 2025
c28635a
Update CHANGELOG.md
maxulysse Nov 19, 2025
205c2e3
solve merge conflicts
maxulysse Nov 19, 2025
f281848
removed hasta config and corrected nextflow_schema.json
agrima2010 Nov 20, 2025
6edf19f
nextflow_schema.json
agrima2010 Nov 20, 2025
aff9444
nextflow.config
agrima2010 Nov 20, 2025
d9373dd
removed channel output of BAM from seqinspector.nf
agrima2010 Nov 20, 2025
2bc95ba
bwa_index added to nextflow.config
agrima2010 Nov 20, 2025
1fc87a8
Update nextflow_schema.json
agrima2010 Nov 20, 2025
71287cd
Merge pull request #132 from agrima2010/bwa_index
agrima2010 Nov 20, 2025
f3b0108
Merge branch 'dev' into nf-core-template-merge-3.5.1
maxulysse Nov 20, 2025
3a77ea6
fix multiqc modules install
maxulysse Nov 20, 2025
a45f606
Merge pull request #144 from nf-core/nf-core-template-merge-3.5.1
maxulysse Nov 20, 2025
de0cdb8
Merge branch 'dev' into run-dirs
matrulda Nov 24, 2025
b85608c
Update rundirparser test and add nextflow config for it
matrulda Nov 24, 2025
6e961e8
Fix #94. Remove outdated TODOs
KarNair Nov 24, 2025
a3ad873
Add #94 to Changelog
KarNair Nov 24, 2025
ab40404
Add #94 to Changelog
KarNair Nov 24, 2025
abf53dc
Update README.md
FranBonath Nov 24, 2025
31e3a83
Merge branch 'dev' into dev_docs
FranBonath Nov 24, 2025
fbba2d4
Merge pull request #145 from KarNair/validate-test-data
KarNair Nov 24, 2025
8f71810
Merge pull request #111 from Patricie34/dev_docs
FranBonath Nov 24, 2025
8c09da2
add tubemap and logo to intro page
FranBonath Nov 24, 2025
2a460f4
fix merge conflicts
FranBonath Nov 24, 2025
317cfd4
update tubemap and some updates to README
FranBonath Nov 24, 2025
7996d9e
update tubmap again
FranBonath Nov 24, 2025
930f0f2
add SeqFu Stats to README.md
FranBonath Nov 24, 2025
2aad60b
Install picard_collecthsmetrics.
Nov 24, 2025
bbd7eed
Install picard_createsequencedictionary.
Nov 24, 2025
4acac74
Add test samplesheet and test config.
Nov 24, 2025
38b2fae
Change modules.json to include installed modules.
Nov 24, 2025
71f1fe7
Set up input parameters.
Nov 24, 2025
62cc3ce
Add picard_collecthsmetrics to workflow.
Nov 24, 2025
df6ebc1
Make SAMTOOLS_INDEX required if bwamem2_mem is run.
Nov 24, 2025
72291c3
Rundirparser: Add tar support, remove dummy script, update format of …
matrulda Nov 24, 2025
f422cd8
Remove skip condition for samtools_index when calling picard_collectm…
Nov 24, 2025
f91167b
Have tools list in README in Table
FranBonath Nov 24, 2025
0894a2b
lint errors
ramprasadn Nov 24, 2025
4c4214b
add darkmode options in README, add darkmode tubemap
FranBonath Nov 24, 2025
06e66ac
lint warnings
ramprasadn Nov 24, 2025
226c89f
Add qc_bam subworkflow to run picard_collecthsmetrics.
Nov 24, 2025
069ee40
Add subworkflow qc_bam.
Nov 24, 2025
98ab4a3
Add subworkflow QC_BAM.
Nov 24, 2025
2083cfc
Add logic to skip collecthsmetrics by default through addin "run_pica…
Nov 24, 2025
66f0c38
Modified seqinspector.nf and added subworkflow
agrima2010 Nov 24, 2025
8a68c93
Add TMP_DIR argument to picard_collecthsmetrics.
Nov 25, 2025
3693ca5
update README and usage.md
FranBonath Nov 25, 2025
48eec4e
Create subworkflow directory. Change channels created from params to …
Nov 25, 2025
75b2f5e
Add curly braket.
Nov 25, 2025
f014036
Add second sample to samplesheet.
Nov 25, 2025
bc3320f
Fix ch_ref_dict.
Nov 25, 2025
5381e4e
Remove outdated qc_bam.nf.
Nov 25, 2025
7473297
Update README.md
FranBonath Nov 25, 2025
b478b36
Update README.md
FranBonath Nov 25, 2025
95582a0
[automated] Fix code linting
nf-core-bot Nov 25, 2025
34baa10
Change if order in qc_bam.
Nov 25, 2025
4637abb
Merge pull request #148 from FranBonath/dev
FranBonath Nov 25, 2025
9f1305e
update output.md
FranBonath Nov 25, 2025
0743b58
Added subworkflow prepare_genome
agrima2010 Nov 25, 2025
edf9076
Move creation of ch_hsmetrics_in to qc_bam.
Nov 25, 2025
1326cec
Merge branch 'nf-core:dev' into output_changes
FranBonath Nov 25, 2025
bf11922
Add qc_bam test!
Nov 25, 2025
f8158fd
Remove test config and samplesheet.
Nov 25, 2025
73efbf7
Minor changes
agrima2010 Nov 25, 2025
0e3f63f
Snapshots
agrima2010 Nov 25, 2025
f876645
Add defaults for collecthsmetrics in modules.config.
Nov 25, 2025
15142b8
Fix so output files are only accessed if QC_BAM is run.
Nov 25, 2025
49e1095
ss
agrima2010 Nov 25, 2025
452ca7b
fix precommit
agrima2010 Nov 25, 2025
aed9052
Merge branch 'dev' into collethsmetrics
eliottBo Nov 25, 2025
14f626c
Add rundirparser as skip option
matrulda Nov 25, 2025
c7444af
Adapt tests for rundirparser (only NovaSeq6000 test data have the req…
matrulda Nov 25, 2025
0009fb6
Add information about run_picard_collecthsmetrics flag to usage.md
eliottBo Nov 25, 2025
3e62620
Changed parameter bwa_index to bwamem2 and changed channel names
agrima2010 Nov 25, 2025
b4ef71c
Update README.md
Nov 25, 2025
dfbc994
Update CITATIONS.md.
Nov 25, 2025
40cd40a
Add PR link to CHANGELOG.md
eliottBo Nov 25, 2025
c4eca8c
Merge branch 'collethsmetrics' of https://github.com/eliottBo/seqinsp…
eliottBo Nov 25, 2025
f413842
Update output.md.
Nov 25, 2025
14ed134
WIP: Updating module test for rundirparser
matrulda Nov 25, 2025
cc77efa
Remove cg_test profile.
Nov 25, 2025
d0a960a
Add output file description for CollectHsMetrics in output.md
eliottBo Nov 25, 2025
ae00279
update tools table in README
FranBonath Nov 25, 2025
2583e5e
Fix title in Picard CollectHSmetrics in output.md
eliottBo Nov 25, 2025
c35a1d9
Add File description for collecthsmetrics in output.md
eliottBo Nov 25, 2025
35626f4
Merge branch 'collethsmetrics' of https://github.com/eliottBo/seqinsp…
eliottBo Nov 25, 2025
ddafe8b
Merge branch 'dev' into changes_usage.md
KarNair Nov 25, 2025
ee4afdb
Merge pull request #150 from ramprasadn/pipeline-lint
ramprasadn Nov 25, 2025
0b3af15
Update README.md
FranBonath Nov 25, 2025
76038b5
[automated] Fix code linting
nf-core-bot Nov 25, 2025
985ddcb
Added params in root main.nf and snapshots
agrima2010 Nov 25, 2025
646ddef
Merge branch 'dev' into subworkflows
agrima2010 Nov 25, 2025
4376756
Merge pull request #153 from FranBonath/changes_usage.md
KarNair Nov 25, 2025
505fa03
Move picard_collectmultiplemetrics to QC_BAM.
Nov 25, 2025
e40b3f0
Merge branch 'dev' into collethsmetrics
beatrizsavinhas Nov 25, 2025
bfa3cd3
Update README.md
beatrizsavinhas Nov 25, 2025
509087f
Fix output channels.
Nov 25, 2025
92ad8a7
Updated CHANGE LOG
agrima2010 Nov 25, 2025
3de0c5f
Merge branch 'dev' into change-qc-bam
beatrizsavinhas Nov 25, 2025
061ec28
Merge branch 'subworkflows' of https://github.com/agrima2010/seqinspe…
agrima2010 Nov 25, 2025
712495d
Merge pull request #151 from agrima2010/subworkflows
agrima2010 Nov 25, 2025
5746497
Fix linting issues.
Nov 25, 2025
e8e43bd
Update CHANGELOG.md
beatrizsavinhas Nov 25, 2025
e74488f
Merge branch 'dev' into collecthsmetrics
beatrizsavinhas Nov 25, 2025
19768f7
Update seqinspector.nf
Nov 25, 2025
42a8ff5
Fix linting erros.
Nov 25, 2025
bc0edf9
Fix test snap generation.
Nov 25, 2025
5444402
Fix linting issues.
Nov 26, 2025
a4f2666
Run prettier and remove trailing whitespace.
Nov 26, 2025
b0bb6ae
Fix end of files.
Nov 26, 2025
8370238
Remove cg_test config.
Nov 26, 2025
ab4ff6d
Trim trailing whitespace.
Nov 26, 2025
10f30df
Remove picard_collecthsmetrics from skip_tools as it is not run by de…
Nov 26, 2025
3b45465
Update usage.md.
Nov 26, 2025
566e4c9
Seqtk add relative sample sets and warning
nkongenelly Nov 25, 2025
8ad085f
Fixing tests
nkongenelly Nov 25, 2025
2e1b664
Fixed test error
nkongenelly Nov 26, 2025
5d3309b
[automated] Fix code linting
nf-core-bot Nov 26, 2025
432816b
Harshil alignment
eliottBo Nov 26, 2025
d724965
Update ro-crate
eliottBo Nov 26, 2025
8838130
Format seqinspector.nf.
Nov 27, 2025
0af25a1
Restore output.md.
Nov 27, 2025
739e7d8
Add input parameter notes for QC_BAM.
Nov 27, 2025
b5ef24a
Fix input notes.
Nov 27, 2025
ad3b333
Update input parameter notes.
Nov 27, 2025
8e08426
Move PICARD_CREATESEQUENCEDICTIONARY to PREPARE_GENOME. Adjust QC_BAM…
Nov 27, 2025
5ec2712
Initialise ch_ref_dict in PREPARE_GENOME.
Nov 27, 2025
0023329
Harshil alignment
eliottBo Nov 28, 2025
845a749
Merge branch 'dev' into multiqc-index
pontushojer Nov 28, 2025
52cce2f
Update snapshot for QC_BAM.
Nov 28, 2025
55f2d6f
Update CHANGELOG.md
beatrizsavinhas Nov 28, 2025
6ee64f3
Merge branch 'collecthsmetrics' into change-qc-bam
Nov 28, 2025
79356af
Correct QC_BAM main.nf.
Nov 28, 2025
6a1b397
Remove PICARD_COLLECTMULTIPLEMETRICS from seqinspector.nf.
Dec 1, 2025
fa53330
Fix linting issues.
Dec 1, 2025
2238747
Add checks for empty params bait_intervals and target_intervals.
Dec 1, 2025
4d02baf
Fix QC_BAM input parameters.
Dec 1, 2025
028d095
Update QC_BAM test.
Dec 1, 2025
4ddf032
Update usage.md
beatrizsavinhas Dec 1, 2025
399e1ce
Merge branch 'collecthsmetrics' into change-qc-bam
Dec 1, 2025
030eb2e
Edit CHANGELOG.md
Dec 1, 2025
03a9731
Run prettier.
Dec 1, 2025
f7fadab
Merge branch 'collecthsmetrics' into change-qc-bam
Dec 1, 2025
d71d935
Rename ch_reference_fai.
Dec 1, 2025
520ccdc
Update PREPARE_GENOME input parameters description
Dec 1, 2025
9a44235
Merge pull request #159 from eliottBo/collecthsmetrics
beatrizsavinhas Dec 1, 2025
40a7053
Merge branch 'nf-core:dev' into change-qc-bam
beatrizsavinhas Dec 1, 2025
95ac7b5
Fix prepare_genome/main.nf
Dec 2, 2025
5a1196c
Add tests
Dec 2, 2025
7709979
Update test
Dec 2, 2025
8c6edff
Merge branch 'dev' into test-prepare-genome
beatrizsavinhas Dec 2, 2025
f57ef72
Update prepare_genome/main.nf
Dec 2, 2025
dc9690e
Add publishDir for PICARD_CREATESEQUENCEDICTIONARY
Dec 2, 2025
63df743
Refactor tests
Dec 2, 2025
c26104c
Fix end of file main.nf.test
Dec 2, 2025
e55caaa
Update CHANGELOG.md
Dec 2, 2025
83e319a
Update CHANGELOG.md
Dec 2, 2025
446651d
Revert redundant changes in modules.config
Dec 2, 2025
8d3a773
Update some snapshots
Dec 2, 2025
a9e3b30
More snapshot updates
Dec 2, 2025
815176e
Update remaining snapshot
Dec 2, 2025
8ea7b63
Re-add changes to modules.config
Dec 2, 2025
60e8666
Small changes in comments in main.nf.test
Dec 2, 2025
d5f69b8
Small changes in comments in main.nf.test
Dec 2, 2025
d64f815
Add nextflow.config to tests
Dec 2, 2025
4a5d3a1
Update tests and snapshot
Dec 2, 2025
4e43aec
Add EOL to nextflow.config
Dec 2, 2025
fb4376d
Update QC_BAM input description
Dec 3, 2025
5614398
Update PREPARE_GENOME test configs
Dec 3, 2025
1f7cae8
Update snapshots
Dec 3, 2025
9d9a2f9
Add test nextflow.config
Dec 3, 2025
783f565
Update test configs to remove hardcoded pipelines_testdata_base_path …
Dec 3, 2025
8ac6732
Remove redundant / in test-data paths
Dec 3, 2025
2c408f9
Merge pull request #158 from eliottBo/change-qc-bam
beatrizsavinhas Dec 3, 2025
0c33826
Merge branch 'nf-core:dev' into test-prepare-genome
beatrizsavinhas Dec 3, 2025
8bf9588
run fastqscreen with subsampled data if available
pontushojer Dec 4, 2025
2345202
update changelog
pontushojer Dec 4, 2025
324851e
Fix bug preventing picard_multiple_metrics files from being passed to…
Dec 4, 2025
67db2a9
Update CHANGELOG.md
Dec 4, 2025
6f71488
Update CHANGELOG.md
beatrizsavinhas Dec 5, 2025
743ae2d
Merge pull request #163 from pontushojer/sample_fastqscreen
pontushojer Dec 8, 2025
6534925
Merge branch 'dev' into test-prepare-genome
maxulysse Dec 8, 2025
204cc02
fix snapshot
maxulysse Dec 8, 2025
82a49d9
Merge pull request #162 from beatrizsavinhas/test-prepare-genome
beatrizsavinhas Dec 9, 2025
e0ae79d
Merge pull request #135 from pontushojer/multiqc-index
pontushojer Dec 10, 2025
f103e6b
Merge branch 'dev' into output_changes
maxulysse Dec 11, 2025
d16f194
Merge pull request #152 from FranBonath/output_changes
maxulysse Dec 11, 2025
94b8bfd
Merge branch 'dev' into sample-naming
maxulysse Dec 12, 2025
8318295
re-generate snapshots
maxulysse Dec 12, 2025
3ba97e7
Merge branch 'dev' into seqtk_sample_size_warning
maxulysse Dec 12, 2025
64e0a44
Merge pull request #121 from pontushojer/sample-naming
maxulysse Dec 12, 2025
412978e
re-generate snapshots
maxulysse Dec 12, 2025
0cbe100
Update workflows/seqinspector.nf
maxulysse Dec 12, 2025
dba2d8f
Merge branch 'dev' into seqtk_sample_size_warning
maxulysse Dec 12, 2025
9f98c4b
use log.warn, and update tests
maxulysse Dec 12, 2025
afe2f84
Merge pull request #156 from nkongenelly/seqtk_sample_size_warning
maxulysse Dec 12, 2025
61f172b
Merge branch 'dev' into run-dirs
maxulysse Dec 12, 2025
c1576e2
update snapshots
maxulysse Dec 12, 2025
d4c5306
update module
maxulysse Dec 12, 2025
54609e9
code polish + topics
maxulysse Dec 12, 2025
ea4b20c
code polish
maxulysse Dec 12, 2025
16db84c
fix tests
maxulysse Dec 12, 2025
daf18b0
no need for view()
maxulysse Dec 12, 2025
70fdcf2
simplify config
maxulysse Dec 12, 2025
8766778
dir_meta -> meta
maxulysse Dec 12, 2025
4a5024c
Delete modules/local/rundirparser/meta.yml
maxulysse Dec 12, 2025
ff89c38
Update tests/NovaSeq6000.main.nf.test.config
maxulysse Dec 12, 2025
dcd7c8a
sort modules import
maxulysse Dec 12, 2025
f54f4c7
sort import
maxulysse Dec 12, 2025
5d224a4
Trigger Build
maxulysse Dec 12, 2025
5f04718
update snapshots
maxulysse Dec 12, 2025
5c68a10
Merge pull request #106 from kedhammar/run-dirs
maxulysse Dec 12, 2025
e78578d
rename test files and delete config
maxulysse Dec 12, 2025
b3e6fed
need the params in the tests for now
maxulysse Dec 12, 2025
dbe8114
early failure for test for smaller diff
maxulysse Dec 12, 2025
e09b166
simplify tests
maxulysse Dec 12, 2025
ba22c67
polish local subworkflows and tests
maxulysse Dec 12, 2025
4201300
bam are unstable
maxulysse Dec 12, 2025
8a49714
fix prepare_genome tests and snapshots
maxulysse Dec 12, 2025
b39fff7
code polish
maxulysse Dec 12, 2025
7d4b433
remove file from snapshot
maxulysse Dec 15, 2025
44889d2
fix qc_bam subworkflow tests
maxulysse Dec 15, 2025
eaf4328
update and improve CHANGELOG
maxulysse Dec 15, 2025
bb585d0
add missing entries
maxulysse Dec 15, 2025
9371567
code polish
maxulysse Dec 15, 2025
052b7ea
fix polish
maxulysse Dec 15, 2025
a4ba47c
Merge pull request #164 from nf-core/tests_cleanup
maxulysse Dec 15, 2025
cfd4a8d
Skip RUNDIRPARSER if no rundir information is available
matrulda Jan 19, 2026
6e56eab
Add 167 to changelog
matrulda Jan 19, 2026
960728a
Add test for no rundir information scenario
matrulda Jan 19, 2026
e535dc1
Add note about RUNDIRPARSER being skipped when not using rundir
matrulda Jan 20, 2026
749aee2
pre-commit fix
matrulda Jan 20, 2026
d080687
Write warning if rundir is missing for a sample
matrulda Feb 6, 2026
c4052c6
Merge pull request #167 from matrulda/issue-166_document_rundir_parser
maxulysse Feb 9, 2026
9da961c
Address review comments (#168)
maxulysse Feb 10, 2026
9df5cfc
prepare release 1.0.0 (#169)
maxulysse Feb 11, 2026
204b44d
rescure size from the pipeline tests (#171)
maxulysse Feb 11, 2026
2a59538
fix conda env for rundirparser (#172)
maxulysse Feb 11, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 6 additions & 0 deletions .github/CONTRIBUTING.md
Original file line number Diff line number Diff line change
Expand Up @@ -82,6 +82,12 @@ If you wish to contribute a new step, please use the following coding standards:
9. Update MultiQC config `assets/multiqc_config.yml` so relevant suffixes, file name clean up and module plots are in the appropriate order. If applicable, add a [MultiQC](https://https://multiqc.info/) module.
10. Add a description of the output files and if relevant any appropriate images from the MultiQC report to `docs/output.md`.

### Things to consider regarding displaying results for a new tool

- If a MultiQC module exist for the tool, use the standard settings for it to start with.
- If no Multiqc module exists, the results of the tool should be made available in the results directory.
- If a tool doesn’t produce output files, the stdout should be channeled into a output file that can be accessible from the outdir of the pipeline.

### Default values

Parameters should be initialised / defined with default values within the `params` scope in `nextflow.config`.
Expand Down
3 changes: 0 additions & 3 deletions .github/workflows/awsfulltest.yml
Original file line number Diff line number Diff line change
Expand Up @@ -24,9 +24,6 @@ jobs:

- name: Launch workflow via Seqera Platform
uses: seqeralabs/action-tower-launch@v2
# TODO nf-core: You can customise AWS full pipeline tests as required
# Add full size test data (but still relatively small datasets for few samples)
# on the `test_full.config` test runs with only one set of parameters
with:
workspace_id: ${{ vars.TOWER_WORKSPACE_ID }}
access_token: ${{ secrets.TOWER_ACCESS_TOKEN }}
Expand Down
2 changes: 2 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -7,3 +7,5 @@ testing/
testing*
*.pyc
null/
.nf-test
.nf-test.log
5 changes: 5 additions & 0 deletions .nf-core.yml
Original file line number Diff line number Diff line change
@@ -1,11 +1,16 @@
lint:
multiqc_config: false
files_exist:
- tests/default.nf.test
files_unchanged:
- .github/CONTRIBUTING.md
- docs/images/nf-core-seqinspector_logo_dark.png
- docs/images/nf-core-seqinspector_logo_light.png
- assets/nf-core-seqinspector_logo_light.png
nextflow_config:
- config_defaults:
- params.fastq_screen_references
nf_test_content: false
nf_core_version: 3.5.1
repository_type: pipeline
template:
Expand Down
56 changes: 56 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -9,8 +9,64 @@ Initial release of nf-core/seqinspector, created with the [nf-core](https://nf-c

### `Added`

- [#2](https://github.com/nf-core/seqinspector/pull/2) Input workflow and schema definition
- [#13](https://github.com/nf-core/seqinspector/pull/13) Generate reports per run, per project and per lane
- [#20](https://github.com/nf-core/seqinspector/pull/20) Use tags to generate group reports
- [#50](https://github.com/nf-core/seqinspector/pull/50) Add an optional subsampling step
- [#51](https://github.com/nf-core/seqinspector/pull/51) Add nf-test to CI
- [#53](https://github.com/nf-core/seqinspector/pull/53), [#64](https://github.com/nf-core/seqinspector/pull/64) Add FastQ-Screen database multiplexing and limit scope of nf-test in CI
- [#56](https://github.com/nf-core/seqinspector/pull/56) Added SeqFu stats module
- [#63](https://github.com/nf-core/seqinspector/pull/63) Contribution guidelines added about displaying results for new tools
- [#68](https://github.com/nf-core/seqinspector/pull/68) Add tool selector
- [#75](https://github.com/nf-core/seqinspector/pull/75) Set up nft-utils
- [#96](https://github.com/nf-core/seqinspector/pull/96) Added missing citations to citation tool
- [#100](https://github.com/nf-core/seqinspector/pull/100) Added official logos
- [#103](https://github.com/nf-core/seqinspector/pull/103) Configure full-tests
- [#106](https://github.com/nf-core/seqinspector/pull/106) Parse rundir metadata
- [#108](https://github.com/nf-core/seqinspector/pull/108) Update lists of default steps in the pipeline (#86)
- [#108](https://github.com/nf-core/seqinspector/pull/108) Test data validation (#94)
- [#110](https://github.com/nf-core/seqinspector/pull/110) Update input schema to accept either tar file or directory as rundir, and fastq messages and patterns
- [#111](https://github.com/nf-core/seqinspector/pull/111) Short summary of seqinspector in README.md (#84)
- [#127](https://github.com/nf-core/seqinspector/pull/127) Added alignment tools - bwamem2 - index and mem
- [#128](https://github.com/nf-core/seqinspector/pull/128) Added Picard tools - Collect Multiple Mterics to collect QC metrics
- [#132](https://github.com/nf-core/seqinspector/pull/132) Added a bwamem2 index params for faster output
- [#135](https://github.com/nf-core/seqinspector/pull/135) Added index section to MultiQC reports to facilitate report navigation (#125)
- [#148](https://github.com/nf-core/seqinspector/pull/148) Add Tubemap
- [#151](https://github.com/nf-core/seqinspector/pull/151) Added a prepare_genome subworkflow to handle bwamem2 indexing
- [#156](https://github.com/nf-core/seqinspector/pull/156) Added relative sample_size and warning when a sample has less reads than desired sample_size
- [#159](https://github.com/nf-core/seqinspector/pull/159) Added a subworkflow QC_BAM including picard_collecthsmetrics for alignment QC of hybrid-selection data
- [#162](https://github.com/nf-core/seqinspector/pull/162) Add tests for prepare_genome subworkflow

### `Fixed`

- [#71](https://github.com/nf-core/seqinspector/pull/71) FASTQSCREEN does not fail when multiple reads are provided
- [#77](https://github.com/nf-core/seqinspector/pull/77) Use a params for fastqscreen csv file, and not the hardcoded example one
- [#99](https://github.com/nf-core/seqinspector/pull/99) Fix group reports for paired reads
- [#107](https://github.com/nf-core/seqinspector/pull/107) Put SeqFU-stats section reports together
- [#108](https://github.com/nf-core/seqinspector/pull/108) Go through and validate test data (#94)
- [#112](https://github.com/nf-core/seqinspector/pull/112) Making fastq_screen_references value to use parentDir
- [#121](https://github.com/nf-core/seqinspector/pull/121) Cleanup sample naming for MultiQC report (#105)
- [#150](https://github.com/nf-core/seqinspector/pull/150) Fix pipeline linting issues
- [#162](https://github.com/nf-core/seqinspector/pull/162) Fix bugs in qc_bam and prepare_genome subworkflows and add tests
- [#163](https://github.com/nf-core/seqinspector/pull/163) Run fastqscreen with subsampled data if available

### `Changed`

- [#15](https://github.com/nf-core/seqinspector/pull/15) Template update for nf-core/tools v2.14.1
- [#26](https://github.com/nf-core/seqinspector/pull/26), [#49](https://github.com/nf-core/seqinspector/pull/49) Template update for nf-core/tools v3.0.2
- [#69](https://github.com/nf-core/seqinspector/pull/69) Template update for nf-core/tools v3.1.0
- [#72](https://github.com/nf-core/seqinspector/pull/72) Template update for nf-core/tools v3.1.2dev0
- [#74](https://github.com/nf-core/seqinspector/pull/74) Template update for nf-core/tools v3.2.0
- [#114](https://github.com/nf-core/seqinspector/pull/114) Update CI
- [#133](https://github.com/nf-core/seqinspector/pull/133) Template update for nf-core/tools v3.4.1
- [#144](https://github.com/nf-core/seqinspector/pull/144) Template update for nf-core/tools v3.5.1
- [#145](https://github.com/nf-core/seqinspector/pull/145) Remove outdated comments
- [#148](https://github.com/nf-core/seqinspector/pull/148), [#152](https://github.com/nf-core/seqinspector/pull/152), [#153](https://github.com/nf-core/seqinspector/pull/153) Update documentation
- [#158](https://github.com/nf-core/seqinspector/pull/158) Moved picard_collectmultiplemetrics to the subworkflow QC_BAM
- [#164](https://github.com/nf-core/seqinspector/pull/164) Refactor local subworkflow and pipeline tests

### `Dependencies`

- [#116](https://github.com/nf-core/seqinspector/pull/116) Update MultiQC to 1.28

### `Deprecated`
22 changes: 22 additions & 0 deletions CITATIONS.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,10 +14,32 @@

> Andrews, S. (2010). FastQC: A Quality Control Tool for High Throughput Sequence Data [Online].

- [SeqFu](https://telatin.github.io/seqfu2/)

> Telatin A, Fariselli P, Birolo G. SeqFu: A Suite of Utilities for the Robust and Reproducible Manipulation of Sequence Files. Bioengineering 2021, 8, 59. doi.org/10.3390/bioengineering8050059

- [FastQ Screen](https://www.bioinformatics.babraham.ac.uk/projects/fastq_screen/)

> Wingett SW and Andrews S. FastQ Screen: A tool for multi-genome mapping and quality control [version 2; referees: 4 approved]. F1000Research 2018, 7:1338 (https://doi.org/10.12688/f1000research.15931.2)

- [MultiQC](https://pubmed.ncbi.nlm.nih.gov/27312411/)

> Ewels P, Magnusson M, Lundin S, Käller M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics. 2016 Oct 1;32(19):3047-8. doi: 10.1093/bioinformatics/btw354. Epub 2016 Jun 16. PubMed PMID: 27312411; PubMed Central PMCID: PMC5039924.

- [Seqtk](https://github.com/lh3/seqtk)

- [BWAMEM2](https://ieeexplore.ieee.org/abstract/document/8820962)

> Vasimuddin Md, Misra S, Li H, Aluru S. Efficient Architecture-Aware Acceleration of BWA-MEM for Multicore Systems. In: 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS). IEEE; 2019:314-324. doi:10.1109/IPDPS.2019.00041

- [SAMTOOLS](https://academic.oup.com/bioinformatics/article/25/16/2078/204688)

> Danecek P, Bonfield JK, Liddle J, et al. Twelve years of SAMtools and BCFtools. GigaScience. 2021;10(2):giab008. doi:10.1093/gigascience/giab008

- [Picard Tools](https://broadinstitute.github.io/picard/)

> Broad Institute, “Picard Toolkit.” 2019. GitHub Repository. https://broadinstitute.github.io/picard/

## Software packaging/containerisation tools

- [Anaconda](https://anaconda.com)
Expand Down
62 changes: 37 additions & 25 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -21,43 +21,58 @@

## Introduction

**nf-core/seqinspector** is a bioinformatics pipeline that ...

<!-- TODO nf-core:
Complete this sentence with a 2-3 sentence summary of what types of data the pipeline ingests, a brief overview of the
major pipeline sections and the types of output it produces. You're giving an overview to someone new
to nf-core here, in 15-20 seconds. For an example, see https://github.com/nf-core/rnaseq/blob/master/README.md#introduction
-->

<!-- TODO nf-core: Include a figure that guides the user through the major workflow steps. Many nf-core
workflows use the "tube map" design for that. See https://nf-co.re/docs/guidelines/graphic_design/workflow_diagrams#examples for examples. -->
<!-- TODO nf-core: Fill in short bullet-pointed list of the default steps in the pipeline -->1. Read QC ([`FastQC`](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/))2. Present QC for raw reads ([`MultiQC`](http://multiqc.info/))
<picture>
<source media="(prefers-color-scheme: dark)" srcset="docs/images/nf-core-seqinspector_logo_dark.png">
<source media="(prefers-color-scheme: light)" srcset="docs/images/nf-core-seqinspector_logo_light.png">
<img alt="Fallback image description" src="docs/images/nf-core-seqinspector_logo_light.png">
</picture>

<picture>
<source media="(prefers-color-scheme: dark)" srcset="docs/images/nfcore-seqinspector-logo-hex-dark.png">
<source media="(prefers-color-scheme: light)" srcset="docs/images/nfcore-seqinspector-logo-hex-light.png">
<img alt="Fallback image description" src="docs/images/nfcore-seqinspector-logo-hex-light.png">
</picture>

**nf-core/seqinspector** is a bioinformatics pipeline that that processes raw sequence data (FASTQ) to provide comprehensive quality control. It can perform subsampling, quality assessment, duplication level analysis, and complexity evaluation on a per-sample basis, while also detecting adapter content, technical artifacts, and common biological contaminants. The pipeline generates detailed MultiQC reports with flexible output options, ranging from individual sample reports to project-wide summaries, making it particularly useful for sequencing core facilities and research groups with access to sequencing instruments. If provided, nf-core/seqinspector can also parse statistics from an Illumina run folder directory into the final MultiQC reports.

<!-- TODO: add a search tool that accepts a tree for `Compatibility with Data`. -->

| Tool Type | Tool Name | Tool Description | Compatibility with Data | Dependencies | Default tool |
| ------------------- | ------------------------------------------------------------------------------------------------------------------- | --------------------------------------------------------------------------------------------- | ----------------------- | ------------------------------------------------------------------------------------------------------------------------- | ------------ |
| `Subsampling` | [`Seqtk`](https://github.com/lh3/seqtk) | Global subsampling of reads. Only performs subsampling if `--sample_size` parameter is given. | [RNA, DNA, synthetic] | [N/A] | no |
| `Indexing, Mapping` | [`Bwamem2`](https://github.com/bwa-mem2/bwa-mem2) | Align reads to reference | [RNA, DNA] | [N/A] | yes |
| `Indexing` | [`SAMtools`](http://github.com/samtools) | Index aligned BAM files, create FASTA index | [DNA] | [N/A] | yes |
| `QC` | [`FastQC`](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) | Read QC | [RNA, DNA] | [N/A] | yes |
| `QC` | [`FastqScreen`](https://www.bioinformatics.babraham.ac.uk/projects/fastq_screen/) | Basic contamination detection | [RNA, DNA] | [N/A] | yes |
| `QC` | [`SeqFu Stats`](https://github.com/telatin/seqfu2) | Sequence statistics | [RNA, DNA] | [N/A] | yes |
| `QC` | [`Picard collect multiple metrics`](https://broadinstitute.github.io/picard/picard-metric-definitions.html) | Collect multiple QC metrics | [RNA, DNA] | [Bwamem2, SAMtools, `--genome`] | yes |
| `QC` | [`Picard_collecthsmetrics`](https://gatk.broadinstitute.org/hc/en-us/articles/360036856051-CollectHsMetrics-Picard) | Collect alignment QC metrics of hybrid-selection data. | [RNA, DNA] | [Bwamem2, SAMtools, `--fasta`, `--run_picard_collecths_metrics`, `--bait_intervals`, `--target_intervals` (`--ref_dict`)] | no |
| `Reporting` | [`MultiQC`](http://multiqc.info/) | Present QC for raw reads | [RNA, DNA, synthetic] | [N/A] | yes |

<picture>
<source media="(prefers-color-scheme: dark)" srcset="docs/images/seqinspector_tubemap_V1.0_dark.png">
<source media="(prefers-color-scheme: light)" srcset="docs/images/seqinspector_tubemap_V1.0.png">
<img alt="Fallback image description" src="docs/images/seqinspector_tubemap_V1.0.png">
</picture>

## Usage

> [!NOTE]
> If you are new to Nextflow and nf-core, please refer to [this page](https://nf-co.re/docs/usage/installation) on how to set-up Nextflow. Make sure to [test your setup](https://nf-co.re/docs/usage/introduction#how-to-run-a-pipeline) with `-profile test` before running the workflow on actual data.

<!-- TODO nf-core: Describe the minimum required steps to execute the pipeline, e.g. how to prepare samplesheets.
Explain what rows and columns represent. For instance (please edit as appropriate):

First, prepare a samplesheet with your input data that looks as follows:

`samplesheet.csv`:

```csv
sample,fastq_1,fastq_2
CONTROL_REP1,AEG588A1_S1_L002_R1_001.fastq.gz,AEG588A1_S1_L002_R2_001.fastq.gz
sample,fastq_1,fastq_2,rundir,tags
CONTROL_REP1,AEG588A1_S1_L002_R1_001.fastq.gz,AEG588A1_S1_L002_R2_001.fastq.gz,200624_A00834_0183_BHMTFYDRXX,lane1:project5:group2
```

Each row represents a fastq file (single-end) or a pair of fastq files (paired end).

-->

Now, you can run the pipeline using:

<!-- TODO nf-core: update the following command to include all required parameters for a minimal example -->

```bash
nextflow run nf-core/seqinspector \
-profile <docker/singularity/.../institute> \
Expand All @@ -78,11 +93,11 @@ For more details about the output files and reports, please refer to the

## Credits

nf-core/seqinspector was originally written by Adrien Coulier.
nf-core/seqinspector was originally written by the Swedish [@NationalGenomicsInfrastructure](https://github.com/NationalGenomicsInfrastructure/) and [Clinical Genomics Stockholm](https://clinical.scilifelab.se/).

We thank the following people for their extensive assistance in the development of this pipeline:

<!-- TODO nf-core: If applicable, make list of people who have also contributed -->
- [@mahesh-panchal](https://github.com/mahesh-panchal)

## Contributions and Support

Expand All @@ -92,9 +107,6 @@ For further information or help, don't hesitate to get in touch on the [Slack `#

## Citations

<!-- TODO nf-core: Add citation for pipeline after first release. Uncomment lines below and update Zenodo doi and badge at the top of this file. -->
<!-- If you use nf-core/seqinspector for your analysis, please cite it using the following doi: [10.5281/zenodo.XXXXXX](https://doi.org/10.5281/zenodo.XXXXXX) -->

<!-- TODO nf-core: Add bibliography of tools and data used in your pipeline -->

An extensive list of references for the tools used by the pipeline can be found in the [`CITATIONS.md`](CITATIONS.md) file.
Expand Down
4 changes: 4 additions & 0 deletions assets/example_fastq_screen_references.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
name,dir,basename,aligner
Ecoli,s3://ngi-igenomes/igenomes/Escherichia_coli_K_12_MG1655/NCBI/2001-10-15/Sequence/Bowtie2Index/,genome,bowtie2
PhiX,s3://ngi-igenomes/igenomes/PhiX/Illumina/RTA/Sequence/Bowtie2Index/,genome,bowtie2
Scerevisiae,s3://ngi-igenomes/igenomes/Saccharomyces_cerevisiae/NCBI/build3.1/Sequence/Bowtie2Index/,genome,bowtie2
Loading