Feature: Add option to hibernate runners instead of terminating them when scaling down

Hello,

As we have a big bazel monorepo some CI jobs require a long initialization phase that takes up to 20-30minutes before actually running the test. After this, next jobs picked up by the same runner can re-use the cache and execute quite fast. The issue here is that the cache is held in memory and terminating and then starting runners again can become time-costly. 

A possible solution to this problem is to hibernate inactive runners instead of terminating them when scaling down. This can help re-use the in-memory cache for an acceptable extra storage cost. When scaling up if there's a hibernated runner then we should use it otherwise a new runner spins up.

Another possible approach would be very similar to this issue https://github.com/github-aws-runners/terraform-aws-github-runner/issues/4033. Scale up to the maximum, hibernate `runners_maximum_count - idleCount` and then for scale up we wake up some of the hibernated runners.

Do you see any blockers or risks to implement such a change?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature: Add option to hibernate runners instead of terminating them when scaling down #4737

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Feature: Add option to hibernate runners instead of terminating them when scaling down #4737

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions