You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/reliability/reliability-aks.md
+18-18Lines changed: 18 additions & 18 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -135,62 +135,62 @@ For more information, see [Zone resiliency considerations for AKS](/azure/aks/ak
135
135
136
136
### Failback
137
137
138
-
When the availability zone recovers, failback behavior differs depending on the component:
138
+
When the availability zone recovers, failback behavior depends on the component:
139
139
140
140
-**Control plane:** AKS automatically restores control plane operations across all availability zones. No manual intervention is required.
141
141
142
-
-**Node pools and nodes:** Immediately after failback, nodes remain in the previously healthy zones and don't get restored into the recovered zone. However, the next time a nodescaling operation is performed, such as when you scale out your node pool, the node pool can create nodes in the recovered zone.
142
+
-**Node pools and nodes:** Immediately after failback, nodes remain in the previously healthy zones and aren't restored into the recovered zone. However, the next time you perform a node-scaling operation, such as when you scale out your node pool, the node pool can create nodes in the recovered zone.
143
143
144
-
-**Pods:** Immediately after failback, pods continue to run on the nodes they're already running on. When new pods are created or existing pods are recreated, they're eligible to use nodes in the recovered zone.
144
+
-**Pods:** Immediately after failback, pods continue to run on the nodes that they currently run on. When new pods are created or existing pods are re-created, they're eligible to use nodes in the recovered zone.
145
145
146
146
-**Storage:** Any storage attached to pods recovers based on [how zone-redundant storage works](/azure/storage/common/storage-redundancy).
147
147
148
148
### Testing for zone failures
149
149
150
-
You can test your resiliency to availability zone failures using the following methods:
150
+
You can test your resiliency to availability zone failures by using the following methods:
151
151
152
152
-[Cordon and drain nodes in a single availability zone](/azure/aks/aks-zone-resiliency#method-1-cordon-and-drain-nodes-in-a-single-az)
153
-
-[Simulate an availability zone failure using Azure Chaos Studio](/azure/aks/aks-zone-resiliency#method-2-simulate-an-az-failure-using-azure-chaos-studio)
153
+
-[Simulate an availability zone failure by using Azure Chaos Studio](/azure/aks/aks-zone-resiliency#method-2-simulate-an-az-failure-using-azure-chaos-studio)
154
154
155
155
## Multiple-region support
156
156
157
157
AKS clusters are single-region resources. If the region is unavailable, your AKS cluster is also unavailable.
158
158
159
159
### Alternative multiple-region approaches
160
160
161
-
If you need to deploy your Kubernetes workload to multiple Azure regions, you have two categories of options to manage the orchestration of these clusters:
161
+
If you need to deploy your Kubernetes workload to multiple Azure regions, you have two options to manage the orchestration of these clusters.
162
162
163
-
-**[Azure Kubernetes Fleet Manager (Fleet)](/azure/kubernetes-fleet/overview)** offers a simple and more managed experience. With Fleet, you can:
163
+
-Use [Azure Kubernetes Fleet Manager](/azure/kubernetes-fleet/overview) if you want a simpler, managed experience. By using Azure Kubernetes Fleet Manager, you can:
164
164
165
165
- Manage a set of AKS clusters as a single unit, and those clusters can be distributed across multiple Azure regions.
166
166
167
167
- Automate certain aspects of cluster management, such as cluster and node image upgrades.
168
168
169
-
- Use Fleet traffic distribution capabilities to spread traffic across the clusters and automatically fail over if a region is unavailable.
169
+
- Use traffic distribution capabilities to spread traffic across the clusters and automatically fail over if a region is unavailable.
170
170
171
-
-**Orchestrate failover with a manual active-active or active-passive deployment model** if your workload requires more nuanced control over the different components of inter-region failovers. For more information, see [High availability and disaster recovery overview for Azure Kubernetes Service (AKS)](/azure/aks/ha-dr-overview).
171
+
- Orchestrate failover with a manual active-active or active-passive deployment model if your workload requires more nuanced control over the different components of interregional failovers. For more information, see [High availability and disaster recovery overview for AKS](/azure/aks/ha-dr-overview).
172
172
173
173
## Backups
174
174
175
-
With Azure Backup, you can use a backup extension to back up AKS cluster resources and persistent volumes attached to the cluster. The Backup vault communicates with the AKS cluster through the extension to perform backup and restore operations.
175
+
By using Azure Backup, you can use a backup extension to back up AKS cluster resources and persistent volumes that attach to the cluster. The Backup vault communicates with the AKS cluster through the extension to perform backup and restore operations.
176
176
177
177
If your AKS cluster is in a [region with a pair](./regions-paired.md), you can configure backups to be stored in geo-redundant storage. You can restore geo-redundant backups into the paired region.
178
178
179
179
For more information, see the following articles:
180
180
181
-
-[About AKS backup using Azure Backup](/azure/backup/azure-kubernetes-service-backup-overview)
182
-
-[Back up AKS using Azure Backup](/azure/backup/azure-kubernetes-service-cluster-backup)
181
+
-[What is Azure Kubernetes Service backup?](/azure/backup/azure-kubernetes-service-backup-overview)
182
+
-[Back up AKS by using Azure Backup](/azure/backup/azure-kubernetes-service-cluster-backup)
183
183
184
-
For most solutions, you shouldn't rely exclusively on backups. Instead, use the other capabilities described in this guide to support your resiliency requirements, and strive to use stateless clusters that minimize the need for backup. Store data in external storage systems and databases instead of within your cluster.
184
+
For most solutions, you shouldn't rely exclusively on backups. Instead, use the other capabilities described in this guide to support your resiliency requirements. Strive to use stateless clusters that minimize the need for backup. Store data in external storage systems and databases instead of within your cluster.
185
185
186
-
If you maintain state in your cluster, backups protect against some risks that other approaches don't. For more information, see [What are redundancy, replication, and backup?](concept-redundancy-replication-backup.md)
186
+
If you maintain state in your cluster, backups protect against some risks that other approaches don't. For more information, see [Redundancy, replication, and backup](concept-redundancy-replication-backup.md).
187
187
188
-
## Service-level agreement
188
+
## Service-level agreement and pricing tiers
189
189
190
-
The service-level agreement (SLA) for Azure Logic Apps describes the expected availability of the service. This agreement also describes the conditions to meet for achieving this expectation. To understand these conditions, make sure that you review the [Service Level Agreements (SLA) for Online Services](https://www.microsoft.com/licensing/docs/view/Service-Level-Agreements-SLA-for-Online-Services).
190
+
The service-level agreement (SLA) for Azure Logic Apps describes the expected availability of the service. This agreement also describes the conditions to meet to achieve this expectation. To understand these conditions, review [SLAs for online services](https://www.microsoft.com/licensing/docs/view/Service-Level-Agreements-SLA-for-Online-Services).
191
191
192
-
AKS offers three pricing tiers for cluster management: **Free**, **Standard**, and **Premium**. For more information, see [Free, Standard, and Premium pricing tiers for Azure Kubernetes Service (AKS) cluster management](/azure/aks/free-standard-pricing-tiers). The Free tier enables you to use AKS to test your workloads. The Standard and Premium tiers are designed for production workloads. When you deploy an AKS cluster with availability zones enabled, the uptime percentage defined in the SLA increases. However, you're only covered by the SLA if you deploy a cluster in the Standard or Premium pricing tiers.
192
+
AKS offers three pricing tiers for cluster management: **Free**, **Standard**, and **Premium**. For more information, see [Free, Standard, and Premium pricing tiers for AKS cluster management](/azure/aks/free-standard-pricing-tiers). The Free tier enables you to use AKS to test your workloads. The Standard and Premium tiers are designed for production workloads. When you deploy an AKS cluster that has availability zones enabled, the uptime percentage defined in the SLA increases. However, the SLA applies only if you deploy a cluster in the Standard or Premium pricing tier.
0 commit comments