- Kubernetes controller to repair nodes which are
NotReadyby replacing them with new fresh nodes. This is done by manipulatingAutoScalingGroupsto repair the nodes. - Currently supports only AWS cloud provider.
- This component is not used by gardener anymore and no longer maintained. It was archived in the gardener-attic.
- Control loop for each Auto Scaling Group configured for a shoot cluster :
- Identify
Nodeswhich areNotReadysince configurable amount of time (~10 minutes). - Create new nodes and wait until they are
Ready - Cordon and drain all
NotReadynodes. - Delete the
NotReadynodes.
- Identify
- Apply this approach for each ASG in a shoot cluster one by one.
- For a given ASG, create excess
Nodesin parallel but cordon, drain and deleteNodesone by one. - If ASG does not have sufficient capacity for excess
Nodes, first delete theNotReadynodes then create new one.
| Command | Implication |
|---|---|
| Make compile | Build the go code locally |
| Make release | Deploy image into Gcloud |
Use the deploy/kubernetes/deployment.yaml to deploy the auto-node-repair into the cluster. Refer to this file for more details.