You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/openai/concepts/provisioned-migration.md
+13-13Lines changed: 13 additions & 13 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -51,7 +51,7 @@ Provisioned quota granularity is changing from model-specific to model-independe
51
51
52
52
## Model-independent quota
53
53
54
-
Starting on August 12, 2024, existing customers' current, model-specific quota has been converted to model-independent. This happens automatically. No quota is lost in the transition. Existing quota limits are summed and assigned to a new model-independent quota item.
54
+
As of August 12, 2024, existing customers' current, model-specific quota has been converted to model-independent. This happens automatically. No quota is lost in the transition. Existing quota limits are summed and assigned to a new model-independent quota item.
@@ -67,15 +67,15 @@ For existing customers, if the region already contains a quota assignment, the q
67
67
68
68
Customers no longer obtain quota by contacting their sales teams. Instead, they use the self-service quota request form and specify the PTU-Managed quota type. The form is accessible from a link to the right of the quota item. The target is to respond to all quota requests within two business days.
69
69
70
-
The following quota screenshot shows model-independent quota being used by deployments of different types, as well as the link for requesting additional quota.
70
+
The following quota screenshot shows model-independent quota being used by deployments of different types, and the link for requesting additional quota.
71
71
72
72
:::image type="content" source="../media/provisioned/quota-request-type.png" alt-text="Screenshot of new request type UI for Azure OpenAI provisioned for requesting more quota." lightbox="../media/provisioned/quota-request-type.png":::
73
73
74
74
## Quota as a limit
75
75
76
76
Prior to the August update, Azure OpenAI Provisioned was only available to a few customers, and quota was allocated to maximize the ability for them to deploy and use it. With these changes, the process of acquiring quota is simplified for all users, and there is a greater likelihood of running into service capacity limitations when deployments are attempted. A new API and portal experience are available to help users find regions where the subscription has quota and the service has capacity to support deployments of a desired model.
77
77
78
-
We also recommend that customers using commitments now create their deployments before creating or expanding commitments to cover them. This guarantees that capacity is available before creating a commitment and prevents over-purchase of the commitment. To support this, the restriction that prevented deployments from being created larger than their commitments has been removed. This new approach to quota, capacity availability and commitments matches what is provided under the hourly/reservation model, and the guidance to deploy before purchasing a commitment (or reservation, for the hourly model) is the same for both.
78
+
We also recommend that customers using commitments now create their deployments before creating or expanding commitments to cover them. This guarantees that capacity is available before creating a commitment and prevents over-purchase of the commitment. To support this, the restriction that prevented deployments from being created larger than their commitments has been removed. This new approach to quota, capacity availability, and commitments matches what is provided under the hourly/reservation model, and the guidance to deploy before purchasing a commitment (or reservation, for the hourly model) is the same for both.
79
79
80
80
See the following links for more information. The guidance for reservations and commitments is the same:
81
81
@@ -85,15 +85,15 @@ See the following links for more information. The guidance for reservations and
85
85
## New hourly reservation payment model
86
86
87
87
> [!NOTE]
88
-
> The following description of payment models does not apply to the older "Provisioned Classic (PTU-C)" offering. They only affect the Provisioned (aka Provisioned Managed) offering. Provisioned Classic continues to be governed by the unchanged monthly commitment payment model.
88
+
> The following description of payment models doesn't apply to the older "Provisioned Classic (PTU-C)" offering. They only affect the Provisioned (also known as Provisioned Managed) offering. Provisioned Classic continues to be governed by the unchanged monthly commitment payment model.
89
89
90
90
Microsoft has introduced a new "Hourly/reservation" payment model for provisioned deployments. This is in addition to the current **Commitment** payment model, which will continue to be supported at least through the end of 2024.
91
91
92
92
### Commitment payment model
93
93
94
94
- A regional, monthly commitment is required to use provisioned (longer terms available contractually).
95
95
96
-
- Commitments are bound to Azure OpenAI resources, which makes moving deployments across resources difficult.
96
+
- Commitments are bound to Azure OpenAI resources, which will make moving deployments across resources difficult.
97
97
98
98
- Commitments can't be canceled or altered during their term, except to add new PTUs.
99
99
@@ -149,9 +149,9 @@ Steps 1 and 2 are the same in all cases. The difference is whether a commitment
149
149
* The best practice is to create deployments first, and then to apply discounts. This is to guarantee that service. capacity is available to support your deployments prior to creating a term commitment for PTUs you cannot use.
150
150
151
151
> [!NOTE]
152
-
> When you follow best practices, you may receive hourly charges between the time you create the deployment and increase your discount (commitment or reservation).
152
+
> When you follow best practices, you might receive hourly charges between the time you create the deployment and increase your discount (commitment or reservation).
153
153
>
154
-
> For this reason, we recommend that you be prepared to increase your discount immediately following the deployment. The prerequisites for purchasing an Azure reservations are different than for commitments, and we recommend you validate them prior to deployment if you intend to use them to discount your deployment. For more information, see [Permissions to view and manage Azure reservations](/azure/cost-management-billing/reservations/view-reservations)
154
+
> For this reason, we recommend that you be prepared to increase your discount immediately following the deployment. The prerequisites for purchasing an Azure reservations are different than for commitments, and we recommend you validate them prior to deployment if you intend to use them to discount your deployment. For more information, see [Permissions to view and manage Azure reservations](/azure/cost-management-billing/reservations/view-reservations)
155
155
156
156
## Mapping deployments to discounting method
157
157
@@ -206,7 +206,7 @@ An alternative approach to self-service migration is to switch the reservation p
206
206
* There will be a short period of double-billing or hourly charges during the switchover from committed to hourly/reservation billing.
207
207
208
208
> [!IMPORTANT]
209
-
> Both self-service approaches generate some additional charges as the payment mode is switched from Committed to Hourly/Reservation. These are characteristics of the migration approaches and customers aren't credited for these charges. Customers may choose to use the managed migration approach described below to avoid them.
209
+
> Both self-service approaches generate some additional charges as the payment mode is switched from Committed to Hourly/Reservation. These are characteristics of the migration approaches and customers aren't credited for these charges. Customers can choose to use the managed migration approach described below to avoid them.
210
210
211
211
### Managed migration
212
212
@@ -273,11 +273,11 @@ For each new commitment you need to create, follow these steps:
273
273
:::image type="content" source="../media/how-to/provisioned-onboarding/commitment-tier.png" alt-text="Screenshot of commitment purchase UI." lightbox="../media/how-to/provisioned-onboarding/commitment-tier.png":::
274
274
275
275
> [!IMPORTANT]
276
-
> A new commitment is billed up-front for the entire term. If the renewal settings are set to auto-renew, then you will be billed again on each renewal date based on the renewal settings.
276
+
> A new commitment is billed up-front for the entire term. If the renewal settings are set to auto-renew, then you will be billed again on each renewal date based on the renewal settings.
277
277
278
278
### Edit an existing Provisioned Throughput commitment
279
279
280
-
From the Manage Commitments view, you can also edit an existing commitment. There are two types of changes you can make to an existing commitment:
280
+
From the **Manage Commitments** view, you can also edit an existing commitment. There are two types of changes you can make to an existing commitment:
281
281
282
282
- You can add PTUs to the commitment.
283
283
- You can change the renewal settings.
@@ -291,14 +291,14 @@ Adding PTUs to an existing commitment will allow you to create larger or more nu
291
291
:::image type="content" source="../media/how-to/provisioned-onboarding/increase-commitment.png" alt-text="Screenshot of commitment purchase UI with an increase in the amount to commit value." lightbox="../media/how-to/provisioned-onboarding/increase-commitment.png":::
292
292
293
293
> [!IMPORTANT]
294
-
> When you add PTUs to a commitment, they will be billed immediately, at a pro-rated amount from the current date to the end of the existing commitment term. Adding PTUs does not reset the commitment term.
294
+
> When you add PTUs to a commitment, they will be billed immediately, at a pro-rated amount from the current date to the end of the existing commitment term. Adding PTUs doesn't reset the commitment term.
295
295
296
296
### Changing renewal settings
297
297
298
-
Commitment renewal settings can be changed at any time before the expiration date of your commitment. Reasons you might want to change the renewal settings include ending your use of provisioned throughput by setting the commitment to not autorenew, or to decrease usage of provisioned throughput by lowering the number of PTUs that will be committed in the next period.
298
+
Commitment renewal settings can be changed at any time before the expiration date of your commitment. Reasons you might want to change the renewal settings include ending your use of provisioned throughput by setting the commitment to not autorenew, or to decrease usage of provisioned throughput by lowering the number of PTUs that will be committed in the next period.
299
299
300
300
> [!IMPORTANT]
301
-
> If you allow a commitment to expire or decrease in size such that the deployments under the resource require more PTUs than you have in your resource commitment, you will receive hourly overage charges for any excess PTUs. For example, a resource that has deployments that total 500 PTUs and a commitment for 300 PTUs will generate hourly overage charges for 200 PTUs.
301
+
> If you allow a commitment to expire or decrease in size such that the deployments under the resource require more PTUs than you have in your resource commitment, you will receive hourly overage charges for any excess PTUs. For example, a resource that has deployments that total 500 PTUs and a commitment for 300 PTUs will generate hourly overage charges for 200 PTUs.
302
302
303
303
## Monitor commitments and prevent unexpected billings
0 commit comments