Worker: lost run when a task is deferred by the pool?

When the worker claims a Run, it'll hand it off to the engine to execute.

The engine maintains a pool of child processes. When the Run is handed to it, it'll assign a child process.

The engine maintains its own queue: so if all child processes are busy, it'll queue the Run and execute it when it's ready.

This in practice should never happen, because the worker should never claim more work than it has capacity for, and it reads capacity from the engine.

However we do occasionally see that the pool will defer a Run. And think this causes problems because the Worker will probably claim over capacity and maybe there's an unregistered run somewhere out there.

Here's a [GCP log](https://console.cloud.google.com/logs/query;cursorTimestamp=2026-01-05T00:15:04.329182403Z;duration=P1D;pinnedLogId=2026-01-05T00:15:04.329182403Z%2Fb5e3r9hu83advdww;query=resource.type%3D%22k8s_container%22%0Aresource.labels.location%3D%22europe-west6-a%22%0Aresource.labels.pod_name%3D%22global-worker-56566dc7df-rf9d4%22%0Aresource.labels.project_id%3D%22platform-test-267207%22%0Aresource.labels.cluster_name%3D%22swiss-standard-1%22%0Aresource.labels.namespace_name%3D%22prod%22%0Aresource.labels.container_name%3D%22global-worker%22%0Atimestamp%3D%222026-01-05T00:15:04.329182403Z%22%0AinsertId%3D%22b5e3r9hu83advdww%22?project=platform-test-267207) where run `cf14bef8-9a10-4ade-8f17-cc3879084121` seems to deferred in the engine. But I don't actually think it executes properly and the run goes on to be marked lost

Maybe we should throw an error when this happens, rather than trying to pool the work. Because really we're just undermining the worker here. But how do we ensure this run doesn't get lost? It needs to be queued up somewhere. At least until we have some kind of reject event which puts the run back on the queue

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Worker: lost run when a task is deferred by the pool? #1201

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Worker: lost run when a task is deferred by the pool? #1201

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions