Skip to content

Run checks on a random (reproducible) subset of reads #79

@FranBonath

Description

@FranBonath

In order to enable more compute heavy analysis methods (or alternatively just speed up the run), I think it would be very good to have a module to generate random subsets of the input read files.

I believe https://nf-co.re/modules/seqtk_sample/ can be used.

Ideally the user should be able to specify

  1. A random seed to enable reproducibility
  2. A number of reads to aim for per sample (if there are fewer reads in a sample, output a warning and use all reads)

Metadata

Metadata

Labels

enhancementNew feature or requesthelp wantedExtra attention is needed

Type

No type

Projects

Status

In Progress

Status

To do

Relationships

None yet

Development

No branches or pull requests

Issue actions