Fix: Reporting the correct number of utilized CPUs by JosuaCarl · Pull Request #6809 · nextflow-io/nextflow

JosuaCarl · 2026-02-05T10:39:00Z

As stated in #6743, a problem with flexible scheduling of tasks is that the rules for execution constraints upon logical cores is not enforced, or enforced with wiggle room.

In the instance of a Docker executor, --cpu-shared 2048 is used instead of --cpus 2 which prioritizes resource utilization, but is no hard cutoff (see https://docs.docker.com/engine/containers/resource_constraints/#configure-the-default-cfs-scheduler).

nextflow/modules/nextflow/src/main/groovy/nextflow/container/DockerBuilder.groovy

Lines 128 to 129 in 887443e

    
           if( cpus && !legacy ) 
        
               result << "--cpu-shares ${cpus * 1024} "

This behavior has many upsides, but the number of cpus in the trace is often reported incorrectly, as it takes the value set by the user in process.cpus for granted, even if it was not enforced.

The PR adds a sampling script which utilizes breadth-first search to extract the tasks and its children's last used logical CPU and record them in a list. In the end, this is summed to the total value of utilized CPUs.

The current sampling rate is at 0.1s. I tried 1.0s, but this missed several short processes in the demo pipeline, although I suspect this effect may vanish for longer processes. I could see no large changes in execution time and memory utilization for the tasks for either option. Sampling was chosen, because other ways either require root permission (tracking at kernel level with perf) or only reported minima (ceil(%cpu / 100)) and maxima (grep Cpus_allowed_list /proc/$pid/status) for the number of utilized CPUs. In practice I observed, that the number of actually used CPUs was most of the time close to the maximum, so if an approximation was to be used that does not rely on sampling, I would suggest grep Cpus_allowed_list /proc/$pid/status, which was my original solution in the first commit (have a look if you want).

Closes cpus trace value does not correspond to used logical cores of the task #6743

Edits:

Changed the parameter that is reported from cpus to a new TraceRecord field used_cpus.
Replaced trace writing with already existing method nxf_write_trace to align nxf_trace_linux with nxf_trace_mac and save a few lines. This could also be put into a new PR, if deemed too much deviation from the main task.

…d on system level information at execution time Signed-off-by: Josua Carl <josua.carl@uni-tuebingen.de>

Signed-off-by: Josua Carl <josua.carl@uni-tuebingen.de>

…`command-trace.txt` Signed-off-by: Josua Carl <josua.carl@uni-tuebingen.de>

Signed-off-by: Josua Carl <josua.carl@uni-tuebingen.de>

netlify · 2026-02-05T10:39:06Z

✅ Deploy Preview for nextflow-docs-staging canceled.

Name	Link
🔨 Latest commit	`f6b45de`
🔍 Latest deploy log	https://app.netlify.com/projects/nextflow-docs-staging/deploys/6985abb3c1eabe0008539c11

muffato · 2026-02-05T21:36:01Z

(I'm not in the Nextflow team, I'm just a Nextflow user.)

I would like to plead to leave the current cpus as it is, because together with memory and time (and disk) they report what the workflow had defined and requested for each task. The values can then be compared with the actual usage %cpu, peak_rss, and realtime to then optimise resource requests within the pipeline. Making resource usage match resource requests is a very important step when deploying a pipeline in production, to reduce scheduling time and wastage.

I see the metrics you're trying to add as an "advanced" trace that gives valuable insights, alongside the existing scheduler and runtime traces, like duration, %mem, vmem, write_bytes , and many others, but doesn't replace cpus.

JosuaCarl · 2026-02-06T08:17:11Z

@muffato So would you rather opt for an additional trace value, like used_cpus, so the reporting of user set cpus is not lost?

…n of `cpus` Refactor: Utilized `nxf_trace_write` in `nxf_trace_linux` to align it with `nxf_trace_mac` Test: Added additional testing condition for parsing of `used_cpus` from memory after bash script Signed-off-by: Josua Carl <josua.carl@uni-tuebingen.de>

muffato · 2026-02-06T09:09:58Z

Personally, yes, but again I'm just a mere (moderately advanced) user. The call should come from Seqera.

bentsherman · 2026-02-27T03:39:40Z

Closing for the same reason as #6731 -- this seems like a lot of added complexity for not much added value, consider doing it through a plugin instead.

JosuaCarl added 4 commits February 3, 2026 15:44

Feature: Report the number of CPUs that are available to a task, base…

dc8b3bf

…d on system level information at execution time Signed-off-by: Josua Carl <josua.carl@uni-tuebingen.de>

Refactor: Made parts variable local

e573521

Signed-off-by: Josua Carl <josua.carl@uni-tuebingen.de>

Test: Adjusted BashWrapperBuilderTest to changes in insertion script …

b8529b5

…`command-trace.txt` Signed-off-by: Josua Carl <josua.carl@uni-tuebingen.de>

Refactor: Tracking the real numbers of CPUs through polling

59230e5

Signed-off-by: Josua Carl <josua.carl@uni-tuebingen.de>

JosuaCarl mentioned this pull request Feb 5, 2026

cpus trace value does not correspond to used logical cores of the task #6743

Open

bentsherman closed this Feb 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Reporting the correct number of utilized CPUs#6809

Fix: Reporting the correct number of utilized CPUs#6809
JosuaCarl wants to merge 5 commits intonextflow-io:masterfrom
JosuaCarl:num-cpu-call

JosuaCarl commented Feb 5, 2026 •

edited

Loading

Uh oh!

netlify bot commented Feb 5, 2026 •

edited

Loading

Uh oh!

muffato commented Feb 5, 2026

Uh oh!

JosuaCarl commented Feb 6, 2026

Uh oh!

muffato commented Feb 6, 2026

Uh oh!

bentsherman commented Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	if( cpus && !legacy )
	result << "--cpu-shares ${cpus * 1024} "

Conversation

JosuaCarl commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

netlify bot commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for nextflow-docs-staging canceled.

Uh oh!

muffato commented Feb 5, 2026

Uh oh!

JosuaCarl commented Feb 6, 2026

Uh oh!

muffato commented Feb 6, 2026

Uh oh!

bentsherman commented Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

JosuaCarl commented Feb 5, 2026 •

edited

Loading

netlify bot commented Feb 5, 2026 •

edited

Loading