Skip to content

Cascade of Disconnects from Overloaded Worker #2092

@MarcusSorealheis

Description

@MarcusSorealheis

NativeLink uploads outputs concurrently and uses update_with_whole_file, which moves the file into CAS. If two uploads race on the same path, the second one hits NotFound on rename even though the digest is already in CAS.

What happens next is RunningActionImpl did not cleanup and it is usually just a consequence of futures being dropped during a cascade of disconnects.

Logs look like this:

2025-12-10T13:28:21.427171Z ERROR nativelink_worker::running_actions_manager: RunningActionImpl did not cleanup. This is a violation of the
requirements, will attempt to do it in the background., operation_id: bac6bbc0-0ae9-4503-beb4-c714e00ee88c        at
nativelink-worker/src/running_actions_manager.rs:1393       in nativelink_worker::local_worker::run with shutdown_rx: broadcast::Receiver
                                  in nativelink::worker with name: "worker_0"
      2025-12-10T13:28:21.427181Z  WARN nativelink_worker::running_actions_manager: Sending kill to running operation, operation_id:
Uuid(c2b53c14-f948-4f13-88e4-54ccaf58829d)                      at nativelink-worker/src/running_actions_manager.rs:2069       in
nativelink_worker::local_worker::run with shutdown_rx: broadcast::Receiver                                                 in nativelink::worker with
name: "worker_0"                                                                                 2025-12-10T13:28:21.427189Z ERROR
nativelink_worker::running_actions_manager: RunningActionImpl did not cleanup. This is a violation of the requirements, will attempt to do it in the
background., operation_id: c2b53c14-f948-4f13-88e4-54ccaf58829d
    at nativelink-worker/src/running_actions_manager.rs:1393
    in nativelink_worker::local_worker::run with shutdown_rx: broadcast::Receiver
    in nativelink::worker with name: "worker_0"

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions