Skip to content

Commit 6dbb0a5

Browse files
Merge pull request #2206 from aahill/ptu-update
updating PTU article
2 parents 891224d + 9aacfad commit 6dbb0a5

File tree

2 files changed

+24
-26
lines changed

2 files changed

+24
-26
lines changed

articles/ai-services/openai/concepts/provisioned-migration.md

Lines changed: 22 additions & 22 deletions
Original file line numberDiff line numberDiff line change
@@ -7,9 +7,9 @@ ms.service: azure-ai-openai
77
ms.custom:
88
- ignite-2024
99
ms.topic: how-to
10-
ms.date: 11/11/2024
11-
author: mrbullwinkle
12-
ms.author: mbullwin
10+
ms.date: 01/08/2025
11+
author: aahill
12+
ms.author: aahi
1313
recommendations: false
1414
---
1515

@@ -51,11 +51,11 @@ Provisioned quota granularity is changing from model-specific to model-independe
5151

5252
## Model-independent quota
5353

54-
Starting on August 12, 2024, existing customers' current, model-specific quota has been converted to model-independent. This happens automatically. No quota is lost in the transition. Existing quota limits are summed and assigned to a new model-independent quota item.
54+
As of August 12, 2024, existing customers' current, model-specific quota has been converted to model-independent. This happens automatically. No quota is lost in the transition. Existing quota limits are summed and assigned to a new model-independent quota item.
5555

5656
:::image type="content" source="../media/provisioned/consolidation.png" alt-text="Diagram showing quota consolidation." lightbox="../media/provisioned/consolidation.png":::
5757

58-
The new model-independent quota shows up as a quota item named **Provisioned Managed Throughput Unit**, with model and version no longer included in the name. In the Studio Quota pane, expanding the quota item still shows all of the deployments that contribute to the quota item.
58+
The new model-independent quota shows up as a quota item named **Provisioned Managed Throughput Unit**, with model and version no longer included in the name. In the Azure AI Foundry's quota pane, expanding the quota item still shows all of the deployments that contribute to the quota item.
5959

6060
### Default quota
6161

@@ -67,15 +67,15 @@ For existing customers, if the region already contains a quota assignment, the q
6767

6868
Customers no longer obtain quota by contacting their sales teams. Instead, they use the self-service quota request form and specify the PTU-Managed quota type. The form is accessible from a link to the right of the quota item. The target is to respond to all quota requests within two business days.
6969

70-
The following quota screenshot shows model-independent quota being used by deployments of different types, as well as the link for requesting additional quota.
70+
The following quota screenshot shows model-independent quota being used by deployments of different types, and the link for requesting additional quota.
7171

7272
:::image type="content" source="../media/provisioned/quota-request-type.png" alt-text="Screenshot of new request type UI for Azure OpenAI provisioned for requesting more quota." lightbox="../media/provisioned/quota-request-type.png":::
7373

7474
## Quota as a limit
7575

76-
Prior to the August update, Azure OpenAI Provisioned was only available to a few customers, and quota was allocated to maximize the ability for them to deploy and use it. With these changes, the process of acquiring quota is simplified for all users, and there is a greater likelihood of running into service capacity limitations when deployments are attempted. A new API and Studio experience are available to help users find regions where the subscription has quota and the service has capacity to support deployments of a desired model.
76+
Prior to the August update, Azure OpenAI Provisioned was only available to a few customers, and quota was allocated to maximize the ability for them to deploy and use it. With these changes, the process of acquiring quota is simplified for all users, and there is a greater likelihood of running into service capacity limitations when deployments are attempted. A new API and portal experience are available to help users find regions where the subscription has quota and the service has capacity to support deployments of a desired model.
7777

78-
We also recommend that customers using commitments now create their deployments before creating or expanding commitments to cover them. This guarantees that capacity is available before creating a commitment and prevents over-purchase of the commitment. To support this, the restriction that prevented deployments from being created larger than their commitments has been removed. This new approach to quota, capacity availability and commitments matches what is provided under the hourly/reservation model, and the guidance to deploy before purchasing a commitment (or reservation, for the hourly model) is the same for both.
78+
We also recommend that customers using commitments now create their deployments before creating or expanding commitments to cover them. This guarantees that capacity is available before creating a commitment and prevents over-purchase of the commitment. To support this, the restriction that prevented deployments from being created larger than their commitments has been removed. This new approach to quota, capacity availability, and commitments matches what is provided under the hourly/reservation model, and the guidance to deploy before purchasing a commitment (or reservation, for the hourly model) is the same for both.
7979

8080
See the following links for more information. The guidance for reservations and commitments is the same:
8181

@@ -85,15 +85,15 @@ See the following links for more information. The guidance for reservations and
8585
## New hourly reservation payment model
8686

8787
> [!NOTE]
88-
> The following description of payment models does not apply to the older "Provisioned Classic (PTU-C)" offering. They only affect the Provisioned (aka Provisioned Managed) offering. Provisioned Classic continues to be governed by the unchanged monthly commitment payment model.
88+
> The following description of payment models doesn't apply to the older "Provisioned Classic (PTU-C)" offering. They only affect the Provisioned (also known as Provisioned Managed) offering. Provisioned Classic continues to be governed by the unchanged monthly commitment payment model.
8989
9090
Microsoft has introduced a new "Hourly/reservation" payment model for provisioned deployments. This is in addition to the current **Commitment** payment model, which will continue to be supported at least through the end of 2024.
9191

9292
### Commitment payment model
9393

9494
- A regional, monthly commitment is required to use provisioned (longer terms available contractually).
9595

96-
- Commitments are bound to Azure OpenAI resources, which makes moving deployments across resources difficult.
96+
- Commitments are bound to Azure OpenAI resources, which will make moving deployments across resources difficult.
9797

9898
- Commitments can't be canceled or altered during their term, except to add new PTUs.
9999

@@ -149,9 +149,9 @@ Steps 1 and 2 are the same in all cases. The difference is whether a commitment
149149
* The best practice is to create deployments first, and then to apply discounts. This is to guarantee that service. capacity is available to support your deployments prior to creating a term commitment for PTUs you cannot use.
150150

151151
> [!NOTE]
152-
> When you follow best practices, you may receive hourly charges between the time you create the deployment and increase your discount (commitment or reservation).
152+
> When you follow best practices, you might receive hourly charges between the time you create the deployment and increase your discount (commitment or reservation).
153153
>
154-
> For this reason, we recommend that you be prepared to increase your discount immediately following the deployment. The prerequisites for purchasing an Azure reservations are different than for commitments, and we recommend you validate them prior to deployment if you intend to use them to discount your deployment. For more information, see [Permissions to view and manage Azure reservations](/azure/cost-management-billing/reservations/view-reservations)
154+
> For this reason, we recommend that you be prepared to increase your discount immediately following the deployment. The prerequisites for purchasing an Azure reservations are different than for commitments, and we recommend you validate them prior to deployment if you intend to use them to discount your deployment. For more information, see [Permissions to view and manage Azure reservations](/azure/cost-management-billing/reservations/view-reservations)
155155
156156
## Mapping deployments to discounting method
157157

@@ -206,7 +206,7 @@ An alternative approach to self-service migration is to switch the reservation p
206206
* There will be a short period of double-billing or hourly charges during the switchover from committed to hourly/reservation billing.
207207

208208
> [!IMPORTANT]
209-
> Both self-service approaches generate some additional charges as the payment mode is switched from Committed to Hourly/Reservation. These are characteristics of the migration approaches and customers aren't credited for these charges. Customers may choose to use the managed migration approach described below to avoid them.
209+
> Both self-service approaches generate some additional charges as the payment mode is switched from Committed to Hourly/Reservation. These are characteristics of the migration approaches and customers aren't credited for these charges. Customers can choose to use the managed migration approach described below to avoid them.
210210
211211
### Managed migration
212212

@@ -233,7 +233,7 @@ Customers must reach out to their account teams to schedule a managed migration.
233233

234234
## Managing Provisioned Throughput Commitments
235235

236-
Provisioned throughput commitments are created and managed from the **Manage Commitments** menu in Azure OpenAI Studio. You can navigate to this view by selecting **Manage Commitments** from the Quota menu:
236+
Provisioned throughput commitments are created and managed by selecting **Management center** in the Azure AI Foundry portal's navigation menu > **Quota** > **Manage Commitments**.
237237

238238
:::image type="content" source="../media/how-to/provisioned-onboarding/notifications.png" alt-text="Screenshot of commitment purchase UI with notifications." lightbox="../media/how-to/provisioned-onboarding/notifications.png":::
239239

@@ -247,7 +247,7 @@ The following sections will take you through these tasks.
247247

248248
## Purchase a Provisioned Throughput Commitment
249249

250-
With your commitment plan ready, the next step is to create the commitments. Commitments are created manually via Azure OpenAI Studio and require the user creating the commitment to have either the [Contributor or Cognitive Services Contributor role](../how-to/role-based-access-control.md) at the subscription level.
250+
With your commitment plan ready, the next step is to create the commitments. Commitments are created manually via the Azure AI Foundry and require the user creating the commitment to have either the [Contributor or Cognitive Services Contributor role](../how-to/role-based-access-control.md) at the subscription level.
251251

252252
For each new commitment you need to create, follow these steps:
253253

@@ -273,11 +273,11 @@ For each new commitment you need to create, follow these steps:
273273
:::image type="content" source="../media/how-to/provisioned-onboarding/commitment-tier.png" alt-text="Screenshot of commitment purchase UI." lightbox="../media/how-to/provisioned-onboarding/commitment-tier.png":::
274274

275275
> [!IMPORTANT]
276-
> A new commitment is billed up-front for the entire term. If the renewal settings are set to auto-renew, then you will be billed again on each renewal date based on the renewal settings.
276+
> A new commitment is billed up-front for the entire term. If the renewal settings are set to auto-renew, then you will be billed again on each renewal date based on the renewal settings.
277277
278278
### Edit an existing Provisioned Throughput commitment
279279

280-
From the Manage Commitments view, you can also edit an existing commitment. There are two types of changes you can make to an existing commitment:
280+
From the **Manage Commitments** view, you can also edit an existing commitment. There are two types of changes you can make to an existing commitment:
281281

282282
- You can add PTUs to the commitment.
283283
- You can change the renewal settings.
@@ -291,14 +291,14 @@ Adding PTUs to an existing commitment will allow you to create larger or more nu
291291
:::image type="content" source="../media/how-to/provisioned-onboarding/increase-commitment.png" alt-text="Screenshot of commitment purchase UI with an increase in the amount to commit value." lightbox="../media/how-to/provisioned-onboarding/increase-commitment.png":::
292292

293293
> [!IMPORTANT]
294-
> When you add PTUs to a commitment, they will be billed immediately, at a pro-rated amount from the current date to the end of the existing commitment term. Adding PTUs does not reset the commitment term.
294+
> When you add PTUs to a commitment, they will be billed immediately, at a pro-rated amount from the current date to the end of the existing commitment term. Adding PTUs doesn't reset the commitment term.
295295
296296
### Changing renewal settings
297297

298-
Commitment renewal settings can be changed at any time before the expiration date of your commitment. Reasons you might want to change the renewal settings include ending your use of provisioned throughput by setting the commitment to not autorenew, or to decrease usage of provisioned throughput by lowering the number of PTUs that will be committed in the next period.
298+
Commitment renewal settings can be changed at any time before the expiration date of your commitment. Reasons you might want to change the renewal settings include ending your use of provisioned throughput by setting the commitment to not autorenew, or to decrease usage of provisioned throughput by lowering the number of PTUs that will be committed in the next period.
299299

300300
> [!IMPORTANT]
301-
> If you allow a commitment to expire or decrease in size such that the deployments under the resource require more PTUs than you have in your resource commitment, you will receive hourly overage charges for any excess PTUs. For example, a resource that has deployments that total 500 PTUs and a commitment for 300 PTUs will generate hourly overage charges for 200 PTUs.
301+
> If you allow a commitment to expire or decrease in size such that the deployments under the resource require more PTUs than you have in your resource commitment, you will receive hourly overage charges for any excess PTUs. For example, a resource that has deployments that total 500 PTUs and a commitment for 300 PTUs will generate hourly overage charges for 200 PTUs.
302302
303303
## Monitor commitments and prevent unexpected billings
304304

@@ -319,7 +319,7 @@ To end use of provisioned throughput, and prevent hourly overage charges after c
319319

320320
**Move a commitment/deployment to a new resource in the same subscription/region**
321321

322-
It isn't possible in Azure OpenAI Studio to directly *move* a deployment or a commitment to a new resource. Instead, a new deployment needs to be created on the target resource and traffic moved to it. There will need to be a commitment purchased established on the new resource to accomplish this. Because commitments are charged up-front for a 30-day period, it's necessary to time this move with the expiration of the original commitment to minimize overlap with the new commitment and “double-billing” during the overlap.
322+
It isn't possible in Azure AI Foundry to directly *move* a deployment or a commitment to a new resource. Instead, a new deployment needs to be created on the target resource and traffic moved to it. There will need to be a commitment purchased established on the new resource to accomplish this. Because commitments are charged up-front for a 30-day period, it's necessary to time this move with the expiration of the original commitment to minimize overlap with the new commitment and “double-billing” during the overlap.
323323

324324
There are two approaches that can be taken to implement this transition.
325325

@@ -357,4 +357,4 @@ The same approaches apply in moving the commitment and deployment within the reg
357357

358358
### View and edit an existing resource
359359

360-
In Azure OpenAI Studio, select **Quota** > **Provisioned** > **Manage commitments** and select a resource with an existing commitment to view/change it.
360+
In Azure AI Foundry, select **Management center** > **Quota** > **Provisioned** > **Manage commitments** and select a resource with an existing commitment to view/change it.

articles/ai-services/openai/how-to/use-web-app.md

Lines changed: 2 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ ms.service: azure-ai-openai
77
ms.topic: how-to
88
author: aahill
99
ms.author: aahi
10-
ms.date: 08/09/2024
10+
ms.date: 01/08/2025
1111
recommendations: false
1212
---
1313

@@ -19,7 +19,7 @@ Along with Azure AI Foundry portal, APIs, and SDKs, you can use the customizable
1919
* Conversation history and user feedback collection through Cosmos DB.
2020
* Authentication with role-based access control via Microsoft Entra ID.
2121
* Customization of the user interface, data sources, and features using environment variables (no-code via Azure portal).
22-
* Support for modifying the underlying web application source code as an open-source repository.
22+
* Sample source code for the web app is available on [GitHub](https://github.com/microsoft/sample-app-aoai-chatGPT). Source code is provided "as is" and as a sample only. Customers are responsible for all customization and implementation of their web apps.
2323

2424
You can deploy the app via the [Azure AI Foundry portal](/azure/ai-studio/tutorials/deploy-chat-web-app), the [Azure portal](https://portal.azure.com), or the Azure Developer CLI via your local machine [(instructions available at the repository here)](https://github.com/microsoft/sample-app-aoai-chatGPT). Depending on your deployment channel, you can preload a data source to chat with via the web application, but this can be changed after deployment.
2525

@@ -59,8 +59,6 @@ When you're customizing the app, we recommend:
5959

6060
- Updating the app settings for each of your deployed apps to use new API keys after you rotate keys for your Azure OpenAI or Azure AI Search resource.
6161

62-
Sample source code for the web app is available on [GitHub](https://github.com/microsoft/sample-app-aoai-chatGPT). Source code is provided "as is" and as a sample only. Customers are responsible for all customization and implementation of their web apps.
63-
6462
## Modifying the application user interface
6563

6664
The environment variables relevant to user interface customization are:

0 commit comments

Comments
 (0)