Skip to content

Commit 94343be

Browse files
authored
Update provisioned-reservation-update.md
1 parent be1ff55 commit 94343be

File tree

1 file changed

+14
-5
lines changed

1 file changed

+14
-5
lines changed

articles/ai-services/openai/concepts/provisioned-reservation-update.md

Lines changed: 14 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -14,9 +14,9 @@ recommendations: false
1414
# Azure OpenAI provisioned December 2024 update
1515

1616
In early-December, 2024, Microsoft launched several changes to the Provisioned offering. These changes include:
17-
- A new deployment type, Data zone provisioned
18-
- Updated hourly pricing for Global and Data zone provisioned deployment types
19-
- New Azure Reservations for Global and Data zone provisioned deployment types
17+
- A new deployment type, Data zone provisioned
18+
- Updated hourly pricing for Global and Data zone provisioned deployment types
19+
- New Azure Reservations for Global and Data zone provisioned deployment types
2020

2121
This article is intended for existing users of the provisioned throughput offering. New customers should refer to the [Azure OpenAI provisioned onboarding guide](../how-to/provisioned-throughput-onboarding.md).
2222

@@ -54,9 +54,18 @@ Two approaches are available for customers to migrate from provisioned deploymen
5454

5555
### Zero downtime migration
5656
The zero downtime migration approach allows customers to migrate their existing provisioned deployments to Global or Data zone provisioned deployments without interrupting the existing inference traffic on their deployment. This migration approach minimizes workload interruptions but does require a customer to coexist multiple deployments to shift traffic over. The process to migrate a provisioned deployment using the zero downtime migration approach is as follows:
57-
- Create a new
57+
- Create a new deployment using the `GlobalProvisionedManaged` or `DataZoneProvisionedManaged` deployment types in the target Azure OpenAI resource.
58+
- Transition traffic from the existing `ProvisionedManaged` deployment type to the newly created `GlobalProvisionedManaged` or `DataZoneProvisionedManaged` deployment until all traffic is offloaded from the existing `ProvisionedManaged` deployment.
59+
- Once traffic is migrated over to the new deployment, validate that there are no inference requests being processed on the previous `ProvisionedManaged` deployment by ensuring the Azure OpenAI Requests metric ([see Monitor Azure OpenAI](https://aka.ms/aoai/docs/monitor-azure-openai) for more details on this metric) does not show any API calls made within 5-10 minutes of the inference traffic being migrated over to the new deployment.
60+
- Once you confirm that no inference calls have been made on the previous `ProvisionedManaged` deployment, delete the `ProvisionedManaged` deployment.
5861

5962
### Migration with downtime
6063
The migration with downtime approach involves migrating existing provisioned deployments to Global or Data zone provisioned deployments while stopping any exisitng inference traffic on the original provisioned deployment. This migration approach does not require coexistence of multiple deployments to support but does require workload interruption to complete. The process to migration a provisioned deployment using the migration with downtime approach is as follows:
61-
-
64+
- Validate that there are no inference requests being processed on the previous `ProvisionedManaged` deployment by ensuring the Azure OpenAI Requests metric ([see Monitor Azure OpenAI](https://aka.ms/aoai/docs/monitor-azure-openai) for more details on this metric) does not show any API calls made within the last 5-10 minutes.
65+
- Once you confirm that no inference calls have been made on your `ProvisionedManaged` deployment, delete the `ProvisionedManaged` deployment.
66+
- Create a new deployment using the `GlobalProvisionedManaged` or `DataZoneProvisionedManaged` deployment types in the target Azure OpenAI resource.
67+
- Once your new deployment has succeeded, you may resume inference traffic on the new `GlobalProvisionedManaged` or `DataZoneProvisionedManaged` deployment.
68+
69+
## How do I migrate my existing Azure Reservation to the new Azure Reservation products?
70+
As noted above, Azure Reservation products for Azure OpenAI Service provisioned will be specific to the provisioned deployment type. If the Azure Reservation purhcased does not match the provisioned deployment type, the deployment will default to the hourly payment model. If you choose to migrate to Global or Data zone provisioned deployments, you may need to purchase a new Azure Reservation for these deployments to support additional discounts. For more information on how to purchase a new Azure Reservation or make changes to an existing Azure Reservation, see the [Azure Reservations for Azure OpenAI Service Provisioned guidance](https://aka.ms/oai/docs/ptum-reservations).
6271

0 commit comments

Comments
 (0)