Skip to content

Allow generic definition of reference databases in config file#74

Open
timrozday-mgnify wants to merge 65 commits intomainfrom
mimicc
Open

Allow generic definition of reference databases in config file#74
timrozday-mgnify wants to merge 65 commits intomainfrom
mimicc

Conversation

@timrozday-mgnify
Copy link
Contributor

For supporting the MIMICC project, allow custom databases to be used in the pipeline.

Also add capturing of truncation position by DADA2.

…e databases for SSU, PR2, LSU, ITS and UNITE analysis.
…s and databases don't get muddled. Change modules so that reads/seqs and database files are combined in a single channel. Small edits of channels throughout to align with good nextflow style. Modify resultsDir to use db_label variable from input channels.
…ess duplication of modules and SWFs for different databases, instead done by maintaining and filtering a databases channel.
@timrozday-mgnify
Copy link
Contributor Author

I've updated the test snapshot because a useful file output has been added (truncation point). Some file hashes have changed, however the final output files are the same size as before.

Please check the diff of the test snapshot.

One puzzling thing is that previously the tests didn't produce an output for SILVA-LSU but now they do. I'm not sure what the desired behavior is and if I've removed a filter that was previously in place. @chrisAta?

@timrozday-mgnify
Copy link
Contributor Author

Ah, still need to update module tests

Copy link
Member

@mberacochea mberacochea left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I haven't finished.. the bbmap refactor module caught my attention

@@ -0,0 +1,65 @@
process BBMAP_REFORMAT_STANDARDISE {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This one should be pushed into nf-core, or at least nf-modules. Also it has several "TODO nf-core" to clean up.

itsonedb_mapseq_krona_tuple,
)
ch_versions = ch_versions.mix(MAPSEQ_OTU_KRONA_ITSONEDB.out.versions)
if (!params.skip_asv) {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

just one quick comment before I do a more in-depth review: why are we adding this parameter? do we have cases in production where we would want to skip ASVs completely for entire studies/samplesheets?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It is to make it more flexible for other non-production uses.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants