Run checks on a random (reproducible) subset of reads

In order to enable more compute heavy analysis methods (or alternatively just speed up the run), I think it would be very good to have a module to generate random subsets of the input read files.

I believe https://nf-co.re/modules/seqtk_sample/ can be used.

Ideally the user should be able to specify 
1. A random seed to enable reproducibility
2. A number of reads to aim for per sample (if there are fewer reads in a sample, output a warning and use all reads)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Run checks on a random (reproducible) subset of reads #79

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Run checks on a random (reproducible) subset of reads #79

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions