Skip to content

Commit bc5c9bb

Browse files
authored
Merge pull request #277091 from fengzhou-msft/patch-1
Update Availability zone support section
2 parents 76fface + 25a94cb commit bc5c9bb

File tree

1 file changed

+37
-5
lines changed

1 file changed

+37
-5
lines changed

articles/reliability/reliability-hdinsight-on-aks.md

Lines changed: 37 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -32,13 +32,45 @@ This article describes reliability support in [Azure HDInsight on Azure Kubernet
3232

3333
[!INCLUDE [next step](includes/reliability-availability-zone-description-include.md)]
3434

35-
Currently, Azure HDInsight on AKS doesn't support availability zone in its service offerings.
35+
Azure HDInsight on AKS supports availability zone by leveraging Azure Kubernetes Service's ability to create zone redundant node pools. You can select which availability zones to deploy the cluster pool and cluster during their creation. Once the cluster pool or cluster is created, you can't change the availability zones.
36+
37+
### Prerequisites
38+
39+
- Availability zones are only supported for cluster pool version >= `1.2` and cluster version >= `1.2.1`.
40+
- Azure HDInsight on AKS only has one default SKU and it supports AZ as long as the Azure region has AZ support.
41+
42+
Below regions don't support AZ:
43+
44+
| Americas | Europe | Middle East | Africa | Asia Pacific |
45+
|------------------|----------------------|---------------|--------------------|----------------|
46+
| West US | Germany North | | | |
47+
- Some VM SKUs may not support all availability zones in a region. If you select those SKUs, HDInsight on AKS cluster pools or clusters don't support corresponding availability zones either.
48+
49+
### SLA improvements
50+
51+
There are no increased SLAs for Azure HDInsight on AKS clusters with availability zones enabled.
52+
53+
#### Create a resource with availability zone enabled
54+
55+
- Cluster Pools
56+
You can select one or more availability zones during cluster pool creation after you select the region.
57+
58+
- Clusters
59+
You can select one or more availability zones during cluster creation.
60+
61+
### Fault tolerance
62+
63+
To prepare for availability zone failure, it's recommended to over-provision capacity of service to ensure that your cluster can tolerate the loss of capacity from one availability zone down and continue to function without degraded performance during zone-wide outages. For instance, if you enable 3 availability zones, your cluster should tolerate 1/3 of the nodes down (round up to the nearest integer).
64+
65+
### Zone down experience
66+
67+
Azure HDInsight on AKS service is zone redundant. During a zone-wide outage, the customer should expect degradation of performance due to capacity drop. Customers can still create new cluster pools and clusters in the availability zones that are not impacted. Existing clusters can function with reduced capacity. Individual open source workloads recommendations and best practices are provided on the documentation.
3668

3769
## Disaster recovery and business continuity
3870

3971
[!INCLUDE [introduction to disaster recovery](includes/reliability-disaster-recovery-description-include.md)]
4072

41-
Currently, Azure HDInsight on AKS CP(Control Plane) service and databases are deployed across regions of Azure. Among these regions, the Azure HDInsight on AKS instances and database instances are isolated. When an outage at region level occurs, one region is down. All the resources in this region, including the RP(Resource Provider) of Azure HDInsight on AKS CP, database of Azure HDInsight on AKS CP and all customer clusters in this region. In this case, we can only wait for the regional outage to end. When the outage is recovered, the Azure HDInsight on AKS service is back and all customer clusters are back, too. It's possible that there may be some problems due to data inconsistency after the outage and needs a manual fix.
73+
Azure HDInsight on AKS control plane service and databases are deployed across regions of Azure. Among these regions, the Azure HDInsight on AKS instances and database instances are isolated. When an outage at region level occurs, one region is down. All the resources in this region, including the RP (Resource Provider) of Azure HDInsight on AKS control plane, database of Azure HDInsight on AKS control plane and all customer clusters in this region. In this case, we can only wait for the regional outage to end. When the zonal outage is fully recovered, Azure HDInsight on AKS service is back and all customer clusters are back to normalcy. It's possible you may encounter some problems due to data inconsistency after the outage and may need a manual fix based on your application workloads.
4274

4375
### Multi-region disaster recovery
4476

@@ -52,13 +84,13 @@ Azure HDInsight on AKS currently doesn't support cross-region failover. Improvin
5284

5385
### Single-region disaster recovery
5486

55-
Currently, Azure HDInsight on AKS only has one standard service offering and clusters are created in a single-region geography. Customers are responsible for diaster recovery.
87+
Currently, Azure HDInsight on AKS only has one standard service offering and clusters are created in a single-region geography. Customers are responsible for diaster recovery settings based on the application requirements.
5688

5789
### Capacity and proactive disaster recovery resiliency
5890

59-
Azure HDInsight on AKS and its customers operate under the Shared responsibility model, which means that the customer must address DR for the service they deploy and control. To ensure that recovery is proactive, customers should always predeploy secondaries because there's no guarantee of capacity at time of impact for those who haven't preallocated.
91+
Azure HDInsight on AKS and its customers operate under the Shared responsibility model, which means that the customer must address disaster recovery requirements for the service they deploy and control. To ensure that recovery is proactive, customers should always predeploy secondaries because there's no guarantee of capacity at time of impact for those who haven't preallocated.
6092

61-
Unlike the original version of HDInsight, the Virtual Machines used in HDInsight on AKS clusters require the same Quota as Azure VMs. For more information, see [Capacity planning](../hdinsight-aks/virtual-machine-recommendation-capacity-planning.md#capacity-planning).
93+
Unlike HDInsight, the Virtual Machines used in HDInsight on AKS clusters require the same Quota as Azure VMs. For more information, see [Capacity planning](../hdinsight-aks/virtual-machine-recommendation-capacity-planning.md#capacity-planning).
6294

6395
## Related content
6496

0 commit comments

Comments
 (0)