Ability to run parallel process in a specific order #1901
Replies: 1 comment
-
HI @jmlivingstone , the scheduling of tasks is generally handled by the underlying executor -- SLURM, AWS Batch, etc. Especially in the world of HPC schedulers there are all sorts of scheduling algorithms that try to "fit" jobs in the most efficient way. In your example, you might set the |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Inputs within a process should be able to run in the ordered they are received. Currently documentation states, 'The channel guarantees that items are delivered in the same order as they have been sent - but - since the process is executed in a parallel manner, there is no guarantee that they are processed in the same order as they are received.'
It would be beneficial to be able to process inputs in the order they are received so that inputs that will take longer can be started before shorter inputs. This will have an impact on the overall run time of the pipeline.
Usage scenario
This would be beneficiialy when running a process in parallel per chromosome. Larger chromosomes (ie 1-4) will take longer than smaller chromosomes (ie 20-22). You would want to be able to give an ordered list to the process of which chromosomes (inputs) to run first. If there are only enough resources to run x number of inputs in parallel than you would want to start the longer jobs first.
Beta Was this translation helpful? Give feedback.
All reactions