You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/service-fabric/how-to-managed-cluster-availability-zones.md
+13-15Lines changed: 13 additions & 15 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -26,10 +26,10 @@ Sample templates are available: [Service Fabric cross availability zone template
26
26
27
27
A Service Fabric cluster distributed across Availability Zones ensures high availability of the cluster state.
28
28
29
-
The recommended topology for managed cluster requires the resources outlined below:
29
+
The recommended topology for managed cluster requires the following resources:
30
30
31
31
* The cluster SKU must be Standard
32
-
* Primary node type should have at least nine nodes for best resiliency, but supports minimum number of six.
32
+
* Primary node type should have at least nine nodes (3 in each AZ) for best resiliency, but supports minimum number of six (2 in each AZ).
33
33
* Secondary node type(s) should have at least six nodes for best resiliency, but supports minimum number of three.
34
34
35
35
>[!NOTE]
@@ -49,18 +49,16 @@ Sample node list depicting FD/UD formats in a virtual machine scale set spanning
49
49
When a service is deployed on the node types that are spanning zones, the replicas are placed to ensure they land up in separate zones. This separation is ensured as the fault domain’s on the nodes present in each of these node types are configured with the zone information (i.e FD = fd:/zone1/1 etc.). For example: for five replicas or instances of a service, the distribution will be 2-2-1 and runtime will try to ensure equal distribution across AZs.
50
50
51
51
**User Service Replica Configuration**:
52
-
Stateful user services deployed on the cross-availability zone node types should be configured with this configuration: replica count with target = 9, min = 5. This configuration helps the service to be working even when one zone goes down since 6 replicas will be still up in the other two zones. An application upgrade in such a scenario will also go through.
52
+
Stateful user services deployed on the cross-availability zone node types should be configured with this configuration: replica count with target = 9, min = 5. This configuration helps the service to be working even when one zone goes down since six replicas will be still up in the other two zones. An application upgrade in such a scenario will also go through.
53
53
54
54
**Zone down scenario**:
55
-
When a zone goes down, all the nodes in that zone appear as down. Service replicas on these nodes will also be down. Since there are replicas in the other zones, the service continues to be responsive with primary replicas failing over to the zones which are functioning. The services will appear in warning state as the target replica count is not met and the VM count is still more than the defined min target replica size. As a result, Service Fabric load balancer brings up replicas in the working zones to match the configured target replica count. At this point, the services should appear healthy. When the zone that was down comes back up, the load balancer will again spread all the service replicas evenly across all the zones.
55
+
When a zone goes down, all the nodes in that zone appear as down. Service replicas on these nodes will also be down. Since there are replicas in the other zones, the service continues to be responsive with primary replicas failing over to the zones that are functioning. The services will appear in warning state as the target replica count is not met and the VM count is still more than the defined min target replica size. As a result, Service Fabric load balancer brings up replicas in the working zones to match the configured target replica count. At this point, the services should appear healthy. When the zone that was down comes back up, the load balancer will again spread all the service replicas evenly across all the zones.
56
56
57
57
## Networking Configuration
58
58
For more information, see [Configure network settings for Service Fabric managed clusters](./how-to-managed-cluster-networking.md)
59
59
60
60
## Enabling a zone resilient Azure Service Fabric managed cluster
61
-
To enable a zone resilient Azure Service Fabric managed cluster, you must include the following in the managed cluster resource definition.
62
-
63
-
* The **ZonalResiliency** property, which specifies if the cluster is zone resilient or not.
61
+
To enable a zone resilient Azure Service Fabric managed cluster, you must include the following **ZonalResiliency** property, which specifies if the cluster is zone resilient or not.
64
62
65
63
```json
66
64
{
@@ -74,8 +72,8 @@ To enable a zone resilient Azure Service Fabric managed cluster, you must includ
74
72
}
75
73
```
76
74
77
-
## Migrate an existing non-zone resilient cluster to Zone Resilient (Preview)
78
-
Existing Service Fabric managed clusters which are not spanned across availability zones can now be migrated in-place to span availability zones. Supported scenarios include clusters created in regions that have three availability zones as well as clusters in regions where three availability zones are made available post-deployment.
75
+
## Migrate an existing nonzone resilient cluster to Zone Resilient (Preview)
76
+
Existing Service Fabric managed clusters that are not spanned across availability zones can now be migrated in-place to span availability zones. Supported scenarios include clusters created in regions that have three availability zones and clusters in regions where three availability zones are made available post-deployment.
79
77
80
78
Requirements:
81
79
* Standard SKU cluster
@@ -118,16 +116,16 @@ Requirements:
118
116
}
119
117
```
120
118
121
-
If the Public IP resource is not zone resilient, migration of the cluster will cause a brief loss of external connectivity. This is due to the migration setting up new Public IP and updating the cluster FQDN to the new IP. If the Public IP resource is zone resilient, migration will not modify the Public IP resource or FQDN and there will be no external connectivity impact.
119
+
If the Public IP resource is not zone resilient, migration of the cluster will cause a brief loss of external connectivity. This connection loss is due to the migration setting up new Public IP and updating the cluster FQDN to the new IP. If the Public IP resource is zone resilient, migration will not modify the Public IP resource nor the FQDN, and there will be no external connectivity impact.
122
120
123
121
2) Initiate conversion of the underlying storage account created for managed cluster from LRS to ZRS using [customer-initiated conversion](../storage/common/redundancy-migration.md#customer-initiated-conversion). The resource group of storage account that needs to be migrated would be of the form "SFC_ClusterId"(ex SFC_9240df2f-71ab-4733-a641-53a8464d992d) under the same subscription as the managed cluster resource.
124
122
125
123
3) Add a new primary node type which spans across availability zones
126
124
127
-
This step will trigger the resource provider to perform the migration of the primary node type and Public IP along with a cluster FQDN DNS update, if needed, to become zone resilient. Use the above API to understand implication of this step.
125
+
This step triggers the resource provider to perform the migration of the primary node type and Public IP along with a cluster FQDN DNS update, if needed, to become zone resilient. Use the above API to understand implication of this step.
128
126
129
127
* Use apiVersion 2022-02-01-preview or higher.
130
-
* Add a new primary node type to the cluster with zones parameter set to ["1", "2", "3"] as show below:
128
+
* Add a new primary node type to the cluster with zones parameter set to ["1," "2," "3"] as show below:
131
129
```json
132
130
{
133
131
"apiVersion": "2022-02-01-preview",
@@ -194,7 +192,7 @@ Requirements:
194
192
```http
195
193
POST https://management.azure.com/subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/Microsoft.ServiceFabric/managedClusters/{clusterName}/getazresiliencystatus?api-version=2022-02-01-preview
196
194
```
197
-
This should provide with response similar to:
195
+
This API call should provide a response similar to:
198
196
```json
199
197
{
200
198
"baseResourceStatus" :[
@@ -220,9 +218,9 @@ Requirements:
220
218
If you run in to any problems reach out to support for assistance.
221
219
222
220
## Enable FastZonalUpdate on Service Fabric managed clusters (preview)
223
-
Service Fabric managed clusters support faster cluster and application upgrades by reducing the max upgrade domains per availability zone. The default configuration right now can have at most 15 UDs in multiple AZ nodetype. This huge number of UDs reduced the upgrade velocity. Using the new configuration, the max UDs are reduced, which results in faster updates, keeping the safety of the upgrades intact.
221
+
Service Fabric managed clusters support faster cluster and application upgrades by reducing the max upgrade domains per availability zone. The default configuration right now can have at most 15 UDs in multiple AZ nodetype. This huge number of UDs reduced the upgrade velocity. The new configuration reduces the max UDs, which results in faster updates, keeping the safety of the upgrades intact.
224
222
225
-
The update should be done via ARM template by setting the zonalUpdateMode property to “fast” and then modifying a node type attribute, such as adding a node and then removing the node to each nodetype (see required steps 2 and 3 below). The Service Fabric managed cluster resource apiVersion should be 2022-10-01-preview or later.
223
+
The update should be done via ARM template by setting the zonalUpdateMode property to “fast” and then modifying a node type attribute, such as adding a node and then removing the node to each nodetype (see required steps 2 and 3). The Service Fabric managed cluster resource apiVersion should be 2022-10-01-preview or later.
226
224
227
225
1. Modify the ARM template with the new property mentioned above.
0 commit comments