-
Notifications
You must be signed in to change notification settings - Fork 635
✨ Cancel instance refresh on any relevant change to ASG instead of blocking until previous one is finished (which may have led to failing nodes due to outdated join token) #5173
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…king until previous one is finished (which may have led to failing nodes due to outdated join token)
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
5d04702 to
c121c71
Compare
|
PR needs rebase. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
c121c71 to
8c7c142
Compare
|
I need to reopen this one because I can't force-push the branch. Please review #5318 first since I'm stacking on that. |
|
The Kubernetes project currently lacks enough contributors to adequately respond to all PRs. This bot triages PRs according to the following rules:
You can:
Please send feedback to sig-contributor-experience at kubernetes/community. /lifecycle stale |
|
/remove-lifecycle stale |
|
@AndiDog: The following tests failed, say
Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
|
I can't force-push using the |
What type of PR is this?
/kind feature
What this PR does / why we need it:
Changing any relevant
spec.*for anAWSMachinePooltriggers rolling of nodes via ASG instance refresh. If another change happens shortly afterwards, it has to wait until the first rollout is done, and will then trigger another instance refresh. But it is neither necessary nor desired to roll all worker nodes twice in such a case, and it's much slower. Instead, cancel the one pending instance refresh, wait until another one can be started, and apply the latest change as soon as possible with the second instance refresh.This change has been running fine in Giant Swarm's CAPA fork for three months at the time of opening this PR.
Special notes for your reviewer:
This PR stacks on top of #5148, so let's please review and merge that other PR first. After that's done, this PR can be retargeted to
main. I didn't want to separate these independent changes because otherwise I have to deal with merge conflicts.Checklist:
Release note: