Skip to content

Commit 20b9975

Browse files
committed
Updates to doc
1 parent d099b6f commit 20b9975

File tree

1 file changed

+19
-5
lines changed

1 file changed

+19
-5
lines changed

articles/reliability/reliability-aks.md

Lines changed: 19 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ ms.author: schaffererin
66
ms.topic: reliability-article
77
ms.custom: subject-reliability, references_regions #Required - use references_regions if specific regions are mentioned.
88
ms.service: azure-kubernetes-service
9-
ms.date: 02/06/2025
9+
ms.date: 02/07/2025
1010
#Customer intent: As an engineer responsible for business continuity, I want to understand who need to understand the details of how AKS works from a reliability perspective and plan disaster recovery strategies in alignment with the exact processes that Azure services follow during different kinds of situations.
1111
---
1212

@@ -57,6 +57,8 @@ You can configure AKS to be *zone redundant*, which means your resources are s
5757

5858
You can deploy zone redundant AKS resources into any [Azure region that supports availability zones](./availability-zones-region-support.md).
5959

60+
<!-- Add information about the different types of node pools customers can deploy (maybe we add another subsection for this? -->
61+
6062
### Region support
6163

6264
<!-- Add information on HA/DR docs, how to load balance across multiple clusters, Fleet -->
@@ -69,22 +71,29 @@ When using availability zones in AKS, consider the following:
6971

7072
- You can only define availability zones during creation of the cluster or node pool.
7173
- It's not possible to update an existing non-availability zone cluster to use availability zones after creating the cluster.
72-
- The chosen node size (VM SKU) selected must be available across all availability zones selected.
73-
- Clusters with availability zones enabled require using Azure Standard Load Balancers for distribution across zones. You can only define this load balancer type at cluster create time. For more information and the limitations of the standard load balancer, see [Azure load balancer standard SKU limitations](/azure/aks/load-balancer-standard#limitaitons).
74+
- Clusters with availability zones enabled require using Azure Standard Load Balancer for distribution across zones. You can only define this load balancer type at cluster create time. For more information and the limitations of the standard load balancer, see [Azure load balancer standard SKU limitations](/azure/aks/load-balancer-standard#limitaitons).
7475
- When implementing **availability zones with the [cluster autoscaler](/azure/aks/cluster-autoscaler-overview)**, we recommend using a single node pool for each zone. You can set the `--balance-similar-node-groups` parameter to `true` to maintain a balanced distribution of nodes across zones for your workloads during scale up operations. When this approach isn't implemented, scale down operations can disrupt the balance of nodes across zones. This configuration doesn't guarantee that similar node groups will have the same number of nodes:
7576
- Currently, balancing happens during scale up operations only. The cluster autoscaler scales down underutilized nodes regardless of the relative sizes of the node groups.
76-
- The cluster autoscaler only adds as many nodes as required to run all existing pods. Some groups might have more nodes than others if they have more pods scheduled.
77+
- The cluster autoscaler adds nodes based on pending pods and the requests of the pods to calculate the number of nodes to add.
7778
- The cluster autoscaler only balances between node groups that can support the same set of pending pods.
7879
- You can use Azure zone-redundant storage (ZRS) disks to replicate your storage across three availability zones in the region you select. A ZRS disk lets you recover from availability zone failure without data loss. For more information, see [ZRS for managed disks](/azure/virtual-machines/disks-redundancy#zone-redundant-storage-for-managed-disks).
7980

8081
### Cost
8182

83+
Availability zones are free to use. You only pay for the virtual machines (VMs) and other resources that you deploy in the availability zones.
84+
8285
### Configure availability zone support
8386

8487
[Create an Azure Kubernetes Service (AKS) cluster that uses availability zones](/azure/aks/availability-zones)
8588

8689
### Capacity planning and management
8790

91+
We recommend that you use the following best practices for capacity planning and management:
92+
93+
- [Node autoprovisioning (NAP)](/azure/aks/node-autoprovision)
94+
- [Single instance VM node pools](/azure/aks/virtual-machines-node-pools)
95+
- [Go multi-region with Azure Kubernetes Fleet Manager](/azure/kubernetes-fleet-overview)
96+
8897
### Traffic routing between zones
8998

9099
### Data replication between zones
@@ -93,7 +102,12 @@ When using availability zones in AKS, consider the following:
93102

94103
### Failback
95104

96-
### Testing for zone failures
105+
### Testing for zone failures
106+
107+
You can test for resiliency to failures using the following methods:
108+
109+
- [Cordon and drain nodes in a single availability zone](/azure/aks/aks-zone-resiliency#method-1-cordon-and-drain-nodes-in-a-single-az)
110+
- [Simulate an availability zone failure using Azure Chaos Studio](/azure/aks/aks-zone-resiliency#method-2-simulate-an-az-failure-using-azure-chaos-studio)
97111

98112
## Multi-region support
99113

0 commit comments

Comments
 (0)