is there a way to process n number of files at a time? #4083

complexgenome · 2023-07-07T15:40:40Z

complexgenome
Jul 7, 2023

Hello there,

I've 11 samples (11 tumors, 11 normals) that I'd like to process using nextflow pipeline I've put. At the moment all 11 are processed in parallel. Is there a way that can restrict the number of samples to process in parallel? For example process 4 at a time, then next 4 then remaining, in this case 3.

I have code as:
read_pairs_ch = Channel.fromFilePairs( params.raw_files).take(3)

But it only takes 3 sample, remaining are not processed. I'd like to have three-three at a time.
How do I enable processing three samples at a time?

Answered by bentsherman

Jul 7, 2023

Yes, this is because your process expects a single sample, whereas collate groups multiple samples into a tuple. This is why I suggested maxForks as the better way to limit the parallelism.

View full answer

bentsherman · 2023-07-07T16:06:31Z

bentsherman
Jul 7, 2023
Maintainer

You can use the collate operator to group the samples into groups of 4 (including the remainder group):

read_pairs_ch = Channel.fromFilePairs( params.raw_files).collate(4)

Alternatively, you can use the maxForks directive to limit the number of parallel tasks.

0 replies

complexgenome · 2023-07-07T17:00:52Z

complexgenome
Jul 7, 2023
Author

Thanks for your reply. When I put collate, I run into error for WARN: Input tuple does not match input set cardinality declared by process

Please see attached process, main.nf scripts along with error.

1 reply

bentsherman Jul 7, 2023
Maintainer

Yes, this is because your process expects a single sample, whereas collate groups multiple samples into a tuple. This is why I suggested maxForks as the better way to limit the parallelism.

Answer selected by complexgenome

complexgenome · 2023-07-07T18:26:49Z

complexgenome
Jul 7, 2023
Author

Understood, thank you 🙏 so much.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

is there a way to process n number of files at a time? #4083

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

is there a way to process n number of files at a time? #4083

Uh oh!

complexgenome Jul 7, 2023

Replies: 3 comments · 1 reply

Uh oh!

bentsherman Jul 7, 2023 Maintainer

Uh oh!

complexgenome Jul 7, 2023 Author

Uh oh!

Uh oh!

bentsherman Jul 7, 2023 Maintainer

Uh oh!

complexgenome Jul 7, 2023 Author

complexgenome
Jul 7, 2023

Replies: 3 comments 1 reply

bentsherman
Jul 7, 2023
Maintainer

complexgenome
Jul 7, 2023
Author

bentsherman Jul 7, 2023
Maintainer

complexgenome
Jul 7, 2023
Author