Skip to content

Cancelled workflows lead to extra runners #206

@alfred-stokespace

Description

@alfred-stokespace

This case is specific to two scenarios we have in our org.

  1. The run in question is canceled prior to being scheduled
  2. Dependent runs are cancelled when parent runs fail (I think,... )

In both cases we see that myshoes

  1. is notified
  2. creates a resource
  3. myshoes notices that the runner is idle
    • 2024/05/23 19:20:32 7a8d1181-4452-49d4-93d4-272bada8dc76 is idle and not running 6h0m0s, so not will delete (created_at: 2024-05-23 19:14:55 +0000 UTC, now: 2024-05-23 19:20:32.330051792 +0000 UTC)
  4. myshoes waits 6hours before killing the idle runner

We're using some pretty expensive ec2 instances as well as have several contingent runs (that sometimes fail) so having unnecessary instances running for 6 hours is pretty expensive.

Having looked through your code base and understanding the challenges I can see why this hasn't been solved.

I modified my code base of myshoes to handle this, reasonably well. I'll post my solution in a follow up comment.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions