Skip to content

Commit 926332a

Browse files
Update articles/ai-services/openai/concepts/provisioned-reservation-update.md
Co-authored-by: Michael <[email protected]>
1 parent e4948e9 commit 926332a

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

articles/ai-services/openai/concepts/provisioned-reservation-update.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -53,7 +53,7 @@ Existing customers of provisioned deployments can choose to migrate to global or
5353
Two approaches are available for customers to migrate from provisioned deployments to global or data zone provisioned deployments.
5454

5555
### Zero downtime migration
56-
The zero downtime migration approach allows customers to migrate their existing provisioned deployments to Global or Data zone provisioned deployments without interrupting the existing inference traffic on their deployment. This migration approach minimizes workload interruptions but does require a customer to coexist multiple deployments to shift traffic over. The process to migrate a provisioned deployment using the zero downtime migration approach is as follows:
56+
The zero downtime migration approach allows customers to migrate their existing provisioned deployments to global or data zone provisioned deployments without interrupting the existing inference traffic on their deployment. This migration approach minimizes workload interruptions, but does require a customer to have multiple coexisting deployments while shifting traffic over. The process to migrate a provisioned deployment using the zero downtime migration approach is as follows:
5757
- Create a new deployment using the global or data zone provisioned deployment types in the target Azure OpenAI resource.
5858
- Transition traffic from the existing regional provisioned deployment type to the newly created global or data zone provisioned deployment until all traffic is offloaded from the existing regional provisioned deployment.
5959
- Once traffic is migrated over to the new deployment, validate that there are no inference requests being processed on the previous provisioned deployment by ensuring the Azure OpenAI Requests metric does not show any API calls made within 5-10 minutes of the inference traffic being migrated over to the new deployment. For more information on this metric, [see the Monitor Azure OpenAI documentation](https://aka.ms/aoai/docs/monitor-azure-openai).

0 commit comments

Comments
 (0)