Skip to content

Commit 6922694

Browse files
authored
Merge pull request #256319 from anaharris-ms/rel-comm-train
Reliability - Community Training
2 parents b20e5e2 + 8409e88 commit 6922694

File tree

3 files changed

+109
-4
lines changed

3 files changed

+109
-4
lines changed

articles/reliability/TOC.yml

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -158,6 +158,8 @@
158158
href: ../vpn-gateway/create-zone-redundant-vnet-gateway.md?toc=/azure/reliability/toc.json&bc=/azure/reliability/breadcrumb/toc.json
159159
- name: Azure VPN Gateway
160160
href: ../vpn-gateway/about-zone-redundant-vnet-gateways.md?toc=/azure/reliability/toc.json&bc=/azure/reliability/breadcrumb/toc.json
161+
- name: Community Training
162+
href: reliability-community-training.md
161163
- name: Disaster recovery
162164
items:
163165
- name: What is disaster recovery?
@@ -420,6 +422,8 @@
420422
href: reliability-spring-apps.md
421423
- name: Azure Storage Mover
422424
href: reliability-azure-storage-mover.md
425+
- name: Community Training
426+
href: reliability-community-training.md
423427
- name: Microsoft Defender for Cloud DevOps security
424428
href: reliability-defender-devops.md
425429
- name: Microsoft Fabric
Lines changed: 100 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,100 @@
1+
---
2+
title: Reliability in Azure Community Training
3+
description: Find out about reliability in Azure Community Training.
4+
author: atulsoni87AI
5+
ms.author: anaharris
6+
ms.topic: reliability-article
7+
ms.custom: subject-reliability, references_regions
8+
ms.prod: non-product-specific
9+
ms.date: 12/06/2023
10+
---
11+
12+
13+
# Reliability in Community Training
14+
15+
Community Training is an Azure-powered cloud-based solution that can deliver large-scale, far-spread training programs with high quality and efficiency. With Community Training, organizations of all sizes and types can run large scale training programs for their internal and external communities. Communities can include frontline workers, extended workforces, a partner ecosystem, a volunteer network, and program beneficiaries.
16+
17+
This article describes reliability support in Community Training, and covers both regional resiliency with [availability zones](/azure/reliability/availability-zones-overview?tabs=azure-clit) and [disaster recovery and business continuity](/azure/reliability/disaster-recovery-overview). For a more detailed overview of a reliability principle in Azure, see [Azure reliability](/azure/reliability/overview).
18+
19+
## Availability zone support
20+
21+
[!INCLUDE [Availability zone description](includes/reliability-availability-zone-description-include.md)]
22+
23+
Community Training uses Azure availability zones to provide high availability and fault tolerance within an Azure region. Community training offers both control and data plane availability zone support:
24+
25+
- The [control plane](/azure/azure-resource-manager/management/control-plane-and-data-plane#control-plane) is zone redundant in the primary regions of availability.
26+
27+
- The [data plane](/azure/azure-resource-manager/management/control-plane-and-data-plane#data-plane) can be either zonal or zone-redundant, depending on what you choose for your needs. However, it's highly recommended that you choose a zone-redundant deployment in order to avoid data loss and maintain service availability during a zone outage.
28+
29+
30+
### Prerequisites
31+
32+
- Availability zones are supported for the following Community Training SKUs:
33+
34+
- Standard (lower scale of users)
35+
- Premium (high scale of users)
36+
37+
- Community Training is only supported in [paired regions](./cross-region-replication-azure.md#azure-paired-regions). Each secondary region is deployed with a zonal configuration. The following table shows all regions that support availability zones for Community Training, along with their paired region.
38+
39+
| Primary Region | Paired Region |
40+
|--------------------|--------------------|
41+
| UKSouth | UKWest |
42+
| AustraliaEast | AustraliaSoutheast |
43+
| EastUS | WestUS |
44+
| EastUS2 | CentralUS |
45+
| NorthEurope | WestEurope |
46+
| WestUS3 | EastUS |
47+
| SwedenCentral | SwedenSouth |
48+
49+
50+
51+
### Zonal failover support
52+
53+
Community Training uses many dependency Azure services, such as App service and Azure SQL. When you choose a zone redundant deployment, Community Training also creates zonal redundant offerings of those underlying Azure service resources. If one zone fails, all resources, including dependency resources, fail over to one of the healthy zones.
54+
55+
56+
#### Create a resource with availability zone enabled
57+
58+
Community Training provides configuration for availability zones only at the time of instance creation. If you wish to change your availability zone configuration after instance creation, you'll need to create a new instance. To learn how to create your Community Training instance, see [Create Community Training]().
59+
60+
### Zone down experience
61+
62+
- **Zonal**. During a zone-wide outage, Community Training can have either complete or partial service disruption. The extent to which it's available depends on various factors, such as whether the entire datacenter is down, or whether a specific dependency service isn't no longer available in that zone.
63+
64+
- **Zone redundant**. During a zone-wide outage, you shouldn't experience any impact on provisioned resources. However, you should be prepared for a brief interruption in communication with those resources. In a zone down situation, clients typically receive 409 error codes, as well as retry logic attempts to re-establish connections at appropriate intervals. New requests are directed to healthy nodes with zero impact on the user. During zone-wide outages, users are able to create new resources and successfully scale existing ones.
65+
66+
## Disaster recovery and business continuity
67+
68+
[!INCLUDE [introduction to disaster recovery](includes/reliability-disaster-recovery-description-include.md)]
69+
70+
The Microsoft Community Training team manages the entire disaster recovery procedure for Community Training. Disaster recovery isn't active-active or active passive, but is instead based on recovery from the most recent available backup of Azure services. The Community Training team manually creates all resources in the paired region from data backup.
71+
72+
>[!NOTE]
73+
>Community Training disaster recovery is only supported in [paired regions](./cross-region-replication-azure.md#azure-paired-regions).
74+
75+
### Disaster recovery in multi-region geography
76+
77+
- In a regional disaster, the **control plane** is manually failed over to the paired region. You should expect some service degradation in the time before the failover completes. After the failover, only read-only operations are supported until the disaster region is back online. The service is manually failed back to the original region once it's back online and all operations resume. Recovery Point Objective (RPO) is expected to be 10 minutes; Recovery Time Objective (RTO), 24 hours.
78+
79+
- For the **data plane**, Community Training offers microsoft managed disaster recovery. To use managed disaster recovery, you need to [enable disaster recovery]() during Community Training instance creation in Azure. Once you enable disaster recovery, Microsoft maintains the backup of storage and database in the paired region. Recovery Point Objective (RPO) is expected to be 12 hours; Recovery Time Objective (RTO), 48 hours.
80+
81+
82+
>[!NOTE]
83+
> RTO depends on database and storage size, latency between the paired region. Database or storage VM capacity (SKU). RPO depends on underlying Azure resources, such as [Azure SQL](/azure/azure-sql/database/recovery-using-backups?view=azuresql&tabs=azure-portal&preserve-view=true#geo-restore-considerations) and Azure storage. For more information on RTO and RPO, see [Overview of Disaster Recovery](./disaster-recovery-overview.md).
84+
85+
86+
#### Outage detection, notification, and management
87+
88+
When a Community Training health check detects an outage of any service, and in any region, Microsoft requests your consent for failover to the paired region. Microsoft informs you which features are available during the disaster recovery procedure. Once Microsoft receives your consent, the Community Training team can then start the disaster recovery procedure.
89+
90+
>[!IMPORTANT]
91+
> Learners will not be able to consume audio/video features until the primary region is operational. It's recommended that you avoid media upload operations until the primary site is operational.
92+
93+
94+
### Capacity and proactive disaster recovery resiliency
95+
96+
Microsoft and its customers operate under the shared responsibility model. Once any region is down, not only is the Community Training instance migrated to the paired region, but also all product and customer workloads are also migrated to paired region. This procedure can cause a shortage for resources in the paired region or datacenter. As a result, Disaster recovery availability depends on the available capacity of the underlying Azure resources.
97+
98+
## Next steps
99+
100+
- [Reliability in Azure](./overview.md)

articles/reliability/reliability-guidance-overview.md

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -89,11 +89,12 @@ Azure reliability guidance contains the following:
8989
| **Products** |
9090
|--------------|
9191
|[Azure Cosmos DB for MongoDB vCore](../cosmos-db/mongodb/vcore/failover-disaster-recovery.md?toc=/azure/reliability/toc.json&bc=/azure/reliability/breadcrumb/toc.json) |
92-
| [Azure Data Manager for Energy](./reliability-energy-data-services.md) |
93-
| [Azure Deployment Environments](reliability-deployment-environments.md)|
92+
|[Azure Data Manager for Energy](./reliability-energy-data-services.md) |
93+
|[Azure Deployment Environments](reliability-deployment-environments.md)|
9494
|[Azure Private 5G Core](../private-5g-core/reliability-private-5g-core.md?toc=/azure/reliability/toc.json&bc=/azure/reliability/breadcrumb/toc.json)|
95-
| [Azure Spring Apps](reliability-spring-apps.md) |
96-
| [Azure Storage Mover](./reliability-azure-storage-mover.md)|
95+
|[Azure Spring Apps](reliability-spring-apps.md) |
96+
|[Azure Storage Mover](./reliability-azure-storage-mover.md)|
97+
|[Community Training](reliability-community-training.md) |
9798
|[Microsoft Defender for Cloud DevOps security](./reliability-defender-devops.md)|
9899
| [Microsoft Fabric](reliability-fabric.md) |
99100

0 commit comments

Comments
 (0)