You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/openai/concepts/provisioned-migration.md
+22-22Lines changed: 22 additions & 22 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,9 +7,9 @@ ms.service: azure-ai-openai
7
7
ms.custom:
8
8
- ignite-2024
9
9
ms.topic: how-to
10
-
ms.date: 11/11/2024
11
-
author: mrbullwinkle
12
-
ms.author: mbullwin
10
+
ms.date: 01/08/2025
11
+
author: aahill
12
+
ms.author: aahi
13
13
recommendations: false
14
14
---
15
15
@@ -51,11 +51,11 @@ Provisioned quota granularity is changing from model-specific to model-independe
51
51
52
52
## Model-independent quota
53
53
54
-
Starting on August 12, 2024, existing customers' current, model-specific quota has been converted to model-independent. This happens automatically. No quota is lost in the transition. Existing quota limits are summed and assigned to a new model-independent quota item.
54
+
As of August 12, 2024, existing customers' current, model-specific quota has been converted to model-independent. This happens automatically. No quota is lost in the transition. Existing quota limits are summed and assigned to a new model-independent quota item.
The new model-independent quota shows up as a quota item named **Provisioned Managed Throughput Unit**, with model and version no longer included in the name. In the Studio Quota pane, expanding the quota item still shows all of the deployments that contribute to the quota item.
58
+
The new model-independent quota shows up as a quota item named **Provisioned Managed Throughput Unit**, with model and version no longer included in the name. In the Azure AI Foundry's quota pane, expanding the quota item still shows all of the deployments that contribute to the quota item.
59
59
60
60
### Default quota
61
61
@@ -67,15 +67,15 @@ For existing customers, if the region already contains a quota assignment, the q
67
67
68
68
Customers no longer obtain quota by contacting their sales teams. Instead, they use the self-service quota request form and specify the PTU-Managed quota type. The form is accessible from a link to the right of the quota item. The target is to respond to all quota requests within two business days.
69
69
70
-
The following quota screenshot shows model-independent quota being used by deployments of different types, as well as the link for requesting additional quota.
70
+
The following quota screenshot shows model-independent quota being used by deployments of different types, and the link for requesting additional quota.
71
71
72
72
:::image type="content" source="../media/provisioned/quota-request-type.png" alt-text="Screenshot of new request type UI for Azure OpenAI provisioned for requesting more quota." lightbox="../media/provisioned/quota-request-type.png":::
73
73
74
74
## Quota as a limit
75
75
76
-
Prior to the August update, Azure OpenAI Provisioned was only available to a few customers, and quota was allocated to maximize the ability for them to deploy and use it. With these changes, the process of acquiring quota is simplified for all users, and there is a greater likelihood of running into service capacity limitations when deployments are attempted. A new API and Studio experience are available to help users find regions where the subscription has quota and the service has capacity to support deployments of a desired model.
76
+
Prior to the August update, Azure OpenAI Provisioned was only available to a few customers, and quota was allocated to maximize the ability for them to deploy and use it. With these changes, the process of acquiring quota is simplified for all users, and there is a greater likelihood of running into service capacity limitations when deployments are attempted. A new API and portal experience are available to help users find regions where the subscription has quota and the service has capacity to support deployments of a desired model.
77
77
78
-
We also recommend that customers using commitments now create their deployments before creating or expanding commitments to cover them. This guarantees that capacity is available before creating a commitment and prevents over-purchase of the commitment. To support this, the restriction that prevented deployments from being created larger than their commitments has been removed. This new approach to quota, capacity availability and commitments matches what is provided under the hourly/reservation model, and the guidance to deploy before purchasing a commitment (or reservation, for the hourly model) is the same for both.
78
+
We also recommend that customers using commitments now create their deployments before creating or expanding commitments to cover them. This guarantees that capacity is available before creating a commitment and prevents over-purchase of the commitment. To support this, the restriction that prevented deployments from being created larger than their commitments has been removed. This new approach to quota, capacity availability, and commitments matches what is provided under the hourly/reservation model, and the guidance to deploy before purchasing a commitment (or reservation, for the hourly model) is the same for both.
79
79
80
80
See the following links for more information. The guidance for reservations and commitments is the same:
81
81
@@ -85,15 +85,15 @@ See the following links for more information. The guidance for reservations and
85
85
## New hourly reservation payment model
86
86
87
87
> [!NOTE]
88
-
> The following description of payment models does not apply to the older "Provisioned Classic (PTU-C)" offering. They only affect the Provisioned (aka Provisioned Managed) offering. Provisioned Classic continues to be governed by the unchanged monthly commitment payment model.
88
+
> The following description of payment models doesn't apply to the older "Provisioned Classic (PTU-C)" offering. They only affect the Provisioned (also known as Provisioned Managed) offering. Provisioned Classic continues to be governed by the unchanged monthly commitment payment model.
89
89
90
90
Microsoft has introduced a new "Hourly/reservation" payment model for provisioned deployments. This is in addition to the current **Commitment** payment model, which will continue to be supported at least through the end of 2024.
91
91
92
92
### Commitment payment model
93
93
94
94
- A regional, monthly commitment is required to use provisioned (longer terms available contractually).
95
95
96
-
- Commitments are bound to Azure OpenAI resources, which makes moving deployments across resources difficult.
96
+
- Commitments are bound to Azure OpenAI resources, which will make moving deployments across resources difficult.
97
97
98
98
- Commitments can't be canceled or altered during their term, except to add new PTUs.
99
99
@@ -149,9 +149,9 @@ Steps 1 and 2 are the same in all cases. The difference is whether a commitment
149
149
* The best practice is to create deployments first, and then to apply discounts. This is to guarantee that service. capacity is available to support your deployments prior to creating a term commitment for PTUs you cannot use.
150
150
151
151
> [!NOTE]
152
-
> When you follow best practices, you may receive hourly charges between the time you create the deployment and increase your discount (commitment or reservation).
152
+
> When you follow best practices, you might receive hourly charges between the time you create the deployment and increase your discount (commitment or reservation).
153
153
>
154
-
> For this reason, we recommend that you be prepared to increase your discount immediately following the deployment. The prerequisites for purchasing an Azure reservations are different than for commitments, and we recommend you validate them prior to deployment if you intend to use them to discount your deployment. For more information, see [Permissions to view and manage Azure reservations](/azure/cost-management-billing/reservations/view-reservations)
154
+
> For this reason, we recommend that you be prepared to increase your discount immediately following the deployment. The prerequisites for purchasing an Azure reservations are different than for commitments, and we recommend you validate them prior to deployment if you intend to use them to discount your deployment. For more information, see [Permissions to view and manage Azure reservations](/azure/cost-management-billing/reservations/view-reservations)
155
155
156
156
## Mapping deployments to discounting method
157
157
@@ -206,7 +206,7 @@ An alternative approach to self-service migration is to switch the reservation p
206
206
* There will be a short period of double-billing or hourly charges during the switchover from committed to hourly/reservation billing.
207
207
208
208
> [!IMPORTANT]
209
-
> Both self-service approaches generate some additional charges as the payment mode is switched from Committed to Hourly/Reservation. These are characteristics of the migration approaches and customers aren't credited for these charges. Customers may choose to use the managed migration approach described below to avoid them.
209
+
> Both self-service approaches generate some additional charges as the payment mode is switched from Committed to Hourly/Reservation. These are characteristics of the migration approaches and customers aren't credited for these charges. Customers can choose to use the managed migration approach described below to avoid them.
210
210
211
211
### Managed migration
212
212
@@ -233,7 +233,7 @@ Customers must reach out to their account teams to schedule a managed migration.
233
233
234
234
## Managing Provisioned Throughput Commitments
235
235
236
-
Provisioned throughput commitments are created and managed from the**Manage Commitments**menu in Azure OpenAI Studio. You can navigate to this view by selecting **Manage Commitments** from the Quota menu:
236
+
Provisioned throughput commitments are created and managed by selecting**Management center** in the Azure AI Foundry portal's navigation menu > **Quota** > **Manage Commitments**.
237
237
238
238
:::image type="content" source="../media/how-to/provisioned-onboarding/notifications.png" alt-text="Screenshot of commitment purchase UI with notifications." lightbox="../media/how-to/provisioned-onboarding/notifications.png":::
239
239
@@ -247,7 +247,7 @@ The following sections will take you through these tasks.
247
247
248
248
## Purchase a Provisioned Throughput Commitment
249
249
250
-
With your commitment plan ready, the next step is to create the commitments. Commitments are created manually via Azure OpenAI Studio and require the user creating the commitment to have either the [Contributor or Cognitive Services Contributor role](../how-to/role-based-access-control.md) at the subscription level.
250
+
With your commitment plan ready, the next step is to create the commitments. Commitments are created manually via the Azure AI Foundry and require the user creating the commitment to have either the [Contributor or Cognitive Services Contributor role](../how-to/role-based-access-control.md) at the subscription level.
251
251
252
252
For each new commitment you need to create, follow these steps:
253
253
@@ -273,11 +273,11 @@ For each new commitment you need to create, follow these steps:
273
273
:::image type="content" source="../media/how-to/provisioned-onboarding/commitment-tier.png" alt-text="Screenshot of commitment purchase UI." lightbox="../media/how-to/provisioned-onboarding/commitment-tier.png":::
274
274
275
275
> [!IMPORTANT]
276
-
> A new commitment is billed up-front for the entire term. If the renewal settings are set to auto-renew, then you will be billed again on each renewal date based on the renewal settings.
276
+
> A new commitment is billed up-front for the entire term. If the renewal settings are set to auto-renew, then you will be billed again on each renewal date based on the renewal settings.
277
277
278
278
### Edit an existing Provisioned Throughput commitment
279
279
280
-
From the Manage Commitments view, you can also edit an existing commitment. There are two types of changes you can make to an existing commitment:
280
+
From the **Manage Commitments** view, you can also edit an existing commitment. There are two types of changes you can make to an existing commitment:
281
281
282
282
- You can add PTUs to the commitment.
283
283
- You can change the renewal settings.
@@ -291,14 +291,14 @@ Adding PTUs to an existing commitment will allow you to create larger or more nu
291
291
:::image type="content" source="../media/how-to/provisioned-onboarding/increase-commitment.png" alt-text="Screenshot of commitment purchase UI with an increase in the amount to commit value." lightbox="../media/how-to/provisioned-onboarding/increase-commitment.png":::
292
292
293
293
> [!IMPORTANT]
294
-
> When you add PTUs to a commitment, they will be billed immediately, at a pro-rated amount from the current date to the end of the existing commitment term. Adding PTUs does not reset the commitment term.
294
+
> When you add PTUs to a commitment, they will be billed immediately, at a pro-rated amount from the current date to the end of the existing commitment term. Adding PTUs doesn't reset the commitment term.
295
295
296
296
### Changing renewal settings
297
297
298
-
Commitment renewal settings can be changed at any time before the expiration date of your commitment. Reasons you might want to change the renewal settings include ending your use of provisioned throughput by setting the commitment to not autorenew, or to decrease usage of provisioned throughput by lowering the number of PTUs that will be committed in the next period.
298
+
Commitment renewal settings can be changed at any time before the expiration date of your commitment. Reasons you might want to change the renewal settings include ending your use of provisioned throughput by setting the commitment to not autorenew, or to decrease usage of provisioned throughput by lowering the number of PTUs that will be committed in the next period.
299
299
300
300
> [!IMPORTANT]
301
-
> If you allow a commitment to expire or decrease in size such that the deployments under the resource require more PTUs than you have in your resource commitment, you will receive hourly overage charges for any excess PTUs. For example, a resource that has deployments that total 500 PTUs and a commitment for 300 PTUs will generate hourly overage charges for 200 PTUs.
301
+
> If you allow a commitment to expire or decrease in size such that the deployments under the resource require more PTUs than you have in your resource commitment, you will receive hourly overage charges for any excess PTUs. For example, a resource that has deployments that total 500 PTUs and a commitment for 300 PTUs will generate hourly overage charges for 200 PTUs.
302
302
303
303
## Monitor commitments and prevent unexpected billings
304
304
@@ -319,7 +319,7 @@ To end use of provisioned throughput, and prevent hourly overage charges after c
319
319
320
320
**Move a commitment/deployment to a new resource in the same subscription/region**
321
321
322
-
It isn't possible in Azure OpenAI Studio to directly *move* a deployment or a commitment to a new resource. Instead, a new deployment needs to be created on the target resource and traffic moved to it. There will need to be a commitment purchased established on the new resource to accomplish this. Because commitments are charged up-front for a 30-day period, it's necessary to time this move with the expiration of the original commitment to minimize overlap with the new commitment and “double-billing” during the overlap.
322
+
It isn't possible in Azure AI Foundry to directly *move* a deployment or a commitment to a new resource. Instead, a new deployment needs to be created on the target resource and traffic moved to it. There will need to be a commitment purchased established on the new resource to accomplish this. Because commitments are charged up-front for a 30-day period, it's necessary to time this move with the expiration of the original commitment to minimize overlap with the new commitment and “double-billing” during the overlap.
323
323
324
324
There are two approaches that can be taken to implement this transition.
325
325
@@ -357,4 +357,4 @@ The same approaches apply in moving the commitment and deployment within the reg
357
357
358
358
### View and edit an existing resource
359
359
360
-
In Azure OpenAI Studio, select **Quota** > **Provisioned** > **Manage commitments** and select a resource with an existing commitment to view/change it.
360
+
In Azure AI Foundry, select**Management center** >**Quota** > **Provisioned** > **Manage commitments** and select a resource with an existing commitment to view/change it.
Copy file name to clipboardExpand all lines: articles/ai-services/openai/how-to/use-web-app.md
+2-4Lines changed: 2 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@ ms.service: azure-ai-openai
7
7
ms.topic: how-to
8
8
author: aahill
9
9
ms.author: aahi
10
-
ms.date: 08/09/2024
10
+
ms.date: 01/08/2025
11
11
recommendations: false
12
12
---
13
13
@@ -19,7 +19,7 @@ Along with Azure AI Foundry portal, APIs, and SDKs, you can use the customizable
19
19
* Conversation history and user feedback collection through Cosmos DB.
20
20
* Authentication with role-based access control via Microsoft Entra ID.
21
21
* Customization of the user interface, data sources, and features using environment variables (no-code via Azure portal).
22
-
*Support for modifying the underlying web application source code as an open-source repository.
22
+
*Sample source code for the web app is available on [GitHub](https://github.com/microsoft/sample-app-aoai-chatGPT). Source code is provided "as is" and as a sample only. Customers are responsible for all customization and implementation of their web apps.
23
23
24
24
You can deploy the app via the [Azure AI Foundry portal](/azure/ai-studio/tutorials/deploy-chat-web-app), the [Azure portal](https://portal.azure.com), or the Azure Developer CLI via your local machine [(instructions available at the repository here)](https://github.com/microsoft/sample-app-aoai-chatGPT). Depending on your deployment channel, you can preload a data source to chat with via the web application, but this can be changed after deployment.
25
25
@@ -59,8 +59,6 @@ When you're customizing the app, we recommend:
59
59
60
60
- Updating the app settings for each of your deployed apps to use new API keys after you rotate keys for your Azure OpenAI or Azure AI Search resource.
61
61
62
-
Sample source code for the web app is available on [GitHub](https://github.com/microsoft/sample-app-aoai-chatGPT). Source code is provided "as is" and as a sample only. Customers are responsible for all customization and implementation of their web apps.
63
-
64
62
## Modifying the application user interface
65
63
66
64
The environment variables relevant to user interface customization are:
0 commit comments