Skip to content

Commit 777b6df

Browse files
committed
more tweaks
1 parent 2dd06aa commit 777b6df

File tree

1 file changed

+6
-6
lines changed

1 file changed

+6
-6
lines changed

articles/aks/node-auto-repair.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.date: 03/10/2020
88

99
# Azure Kubernetes Service (AKS) node auto-repair
1010

11-
AKS continuously checks the health state of worker nodes and performs automatic repair of the nodes if they become unhealthy. This documentation describes how Azure Kubernetes Service (AKS) monitors worker nodes, and repairs unhealthy worker nodes. The documentation is to inform AKS operators on the behavior of node repair functionality.
11+
AKS continuously checks the health state of worker nodes and performs automatic repair of the nodes if they become unhealthy. This documentation describes how Azure Kubernetes Service (AKS) monitors worker nodes, and repairs unhealthy worker nodes. The documentation is to inform AKS operators on the behavior of node repair functionality. It is also important to note that Azure platform [performs maintenance on Virtual Machines][vm-updates] that experience issues. AKS and Azure work together to minimize service disruptions for your clusters.
1212

1313
## How AKS checks for unhealthy nodes
1414

@@ -28,11 +28,11 @@ kubectl get nodes
2828

2929
## How automatic repair works
3030

31-
Auto-repair takes several steps to repair a broken node. If a node is determined to be unhealthy, AKS attempts several remediation steps. The steps are performed in this order:
31+
This behavior is for Virtual Machine Scale Sets. Auto-repair takes several steps to repair a broken node. If a node is determined to be unhealthy, AKS attempts several remediation steps. The steps are performed in this order:
3232

33-
1. After the container runtime becomes unresponsive for 10 minutes, the failing runtime daemons and related services are restarted on the node.
34-
2. If the node does not become available within 10 minutes, the node is rebooted.
35-
3. If the node is not available within 30 minutes, the node is re-imaged.
33+
1. After the container runtime becomes unresponsive for 10 minutes, the failing runtime services are restarted on the node.
34+
2. If the node is not ready within 10 minutes, the node is rebooted.
35+
3. If the node is not ready within 30 minutes, the node is re-imaged.
3636

3737
> [!Note]
3838
> If multiple nodes are unhealthy, they are repaired one by one
@@ -45,4 +45,4 @@ Use [Availability Zones][availability-zones] to increase high availability with
4545

4646
<!-- LINKS - Internal -->
4747
[availability-zones]: ./availability-zones.md
48-
48+
[vm-updates]: ./virtual-machines/maintenance-and-updates.md

0 commit comments

Comments
 (0)