Skip to content

Commit 66e49c5

Browse files
authored
Merge pull request #5340 from MicrosoftDocs/release-preview-ptu
[Out of Band Publish] release-preview-ptu -> main -- 06/04 - 01:00 PM PT
2 parents 924fdb8 + 5dcfc1d commit 66e49c5

16 files changed

+269
-169
lines changed

articles/ai-foundry/toc.yml

Lines changed: 15 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -595,12 +595,25 @@ items:
595595
- name: Plan and manage costs for AI Foundry resources
596596
href: how-to/costs-plan-manage.md
597597
displayName: pricing, budget, estimate
598-
- name: Manage quotas and limits for Foundry Models
599-
href: ../ai-foundry/model-inference/quotas-limits.md?context=/azure/ai-foundry/context/context
600598
- name: Manage quotas
601599
href: how-to/quota.md
602600
- name: Increase rate limit
603601
href: how-to/autoscale.md
602+
- name: Cost management for Foundry Models
603+
items:
604+
- name: Manage quotas and limits for Foundry Models
605+
href: ../ai-foundry/model-inference/quotas-limits.md?context=/azure/ai-foundry/context/context
606+
- name: What is the Provisioned Throughput offering (PTU)?
607+
href: ../ai-services/openai/concepts/provisioned-throughput.md?context=/azure/ai-foundry/context/context
608+
displayName: PTU, provisioned, provisioned throughput units
609+
- name: Understanding and calculating PTU costs
610+
href: ../ai-services/openai/how-to/provisioned-throughput-onboarding.md?context=/azure/ai-foundry/context/context
611+
displayName: PTU, provisioned, provisioned throughput units
612+
- name: Get started with Provisioned Deployments
613+
href: ../ai-services/openai/how-to/provisioned-get-started.md?context=/azure/ai-foundry/context/context
614+
displayName: PTU, provisioned, provisioned throughput units
615+
- name: Provisioned spillover
616+
href: ../ai-services/openai/how-to/spillover-traffic-management.md?context=/azure/ai-foundry/context/context
604617
- name: Security & Governance
605618
items:
606619
- name: Identity & access management

articles/ai-services/openai/concepts/gov-provisioned.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ author: challenp
66
ms.service: azure-ai-openai
77
ms.topic: how-to
88
ms.custom: references_regions, azuregovernment
9-
ms.date: 5/1/2025
9+
ms.date: 05/30/2025
1010
recommendations: false
1111
---
1212

@@ -84,7 +84,7 @@ In addition to the updates for the hourly payment model, new [Azure Reservations
8484

8585
#### Supported models on commitment payment model:
8686

87-
Only the following list of Azure OpenAI models are supported in Commitments. For onboarding any other models that aren't in the list below, or any newer models on provisioned throughput offering, refer to the [Azure OpenAI provisioned onboarding guide](../how-to/provisioned-throughput-onboarding.md) and [Azure Reservations for Azure OpenAI provisioned deployments](../how-to/provisioned-throughput-onboarding.md#azure-reservations-for-azure-openai-provisioned-deployments)
87+
Only the following list of Azure OpenAI models are supported in Commitments. For onboarding any other models that aren't in the list below, or any newer models on provisioned throughput offering, refer to the [Azure OpenAI provisioned onboarding guide](../how-to/provisioned-throughput-onboarding.md) and [Azure Reservations for Azure OpenAI provisioned deployments](../how-to/provisioned-throughput-onboarding.md#azure-reservations-for-azure-ai-foundry-provisioned-throughput)
8888

8989
|Supported models on Commitment plan |Versions|
9090
|-|-|

articles/ai-services/openai/concepts/provisioned-migration.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ ms.service: azure-ai-openai
77
ms.custom:
88
- ignite-2024
99
ms.topic: how-to
10-
ms.date: 03/26/2025
10+
ms.date: 05/30/2025
1111
author: aahill
1212
ms.author: aahi
1313
recommendations: false
@@ -44,7 +44,7 @@ This article is intended for existing users of the provisioned throughput offeri
4444
| Default provisioned-managed quota in many regions | Get started quickly in new regions without having to first request quota. |
4545
| Flexible choice of payment model for existing provisioned customers | Customers with commitments can stay on the commitment model until the end of life of the currently supported models, and can choose to migrate existing commitments to hourly/reservations via managed process. We recommend migrating to hourly/ reservations to take advantage of term discounts and to work with the latest models. |
4646
| Supports latest model generations | The latest models are available only on hourly/ reservations in provisioned offering. |
47-
| Differentiated pricing | Greater flexibility and control of pricing and performance. In December 2024, we introduced differentiated hourly pricing across [global provisioned](../how-to/deployment-types.md#global-provisioned), [data zone provisioned](../how-to/deployment-types.md#data-zone-provisioned), and [provisioned](../how-to/deployment-types.md#provisioned) deployment types with the option to purchase [Azure Reservations](#new-azure-reservations-for-global-and-data-zone-provisioned-deployments) to support additional discounts. For more information on the hourly price for each provisioned deployment type, see the [Pricing details](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) page. |
47+
| Differentiated pricing | Greater flexibility and control of pricing and performance. In December 2024, we introduced differentiated hourly pricing across [global provisioned](../how-to/deployment-types.md#global-provisioned), [data zone provisioned](../how-to/deployment-types.md#data-zone-provisioned), and [regional provisioned](../how-to/deployment-types.md#regional-provisioned) deployment types with the option to purchase [Azure Reservations](#new-azure-reservations-for-global-and-data-zone-provisioned-deployments) to support additional discounts. For more information on the hourly price for each provisioned deployment type, see the [Pricing details](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) page. |
4848

4949
## Usability improvement details
5050

@@ -81,7 +81,7 @@ We also recommend that customers using commitments now create their deployments
8181
See the following links for more information. The guidance for reservations and commitments is the same:
8282

8383
* [Capacity Transparency](#self-service-migration)
84-
* [Sizing reservations](../how-to/provisioned-throughput-onboarding.md#important-sizing-azure-openai-provisioned-reservations)
84+
* [Sizing reservations](../how-to/provisioned-throughput-onboarding.md#important-sizing-azure-ai-foundry-provisioned-throughput-reservation)
8585

8686
## New hourly reservation payment model
8787

@@ -112,7 +112,7 @@ In addition to the updates for the hourly payment model, in December 2024 new [A
112112
- Commitments can't be canceled or altered during their term, except to add new PTUs.
113113

114114
#### Supported models on commitment payment model:
115-
Only the following list of Azure OpenAI models are supported in Commitments. For onboarding any other models that aren't in the list below, or any newer models on provisioned throughput offering, refer to the [Azure OpenAI provisioned onboarding guide](../how-to/provisioned-throughput-onboarding.md) and [Azure Reservations for Azure OpenAI provisioned deployments](../how-to/provisioned-throughput-onboarding.md#azure-reservations-for-azure-openai-provisioned-deployments)
115+
Only the following list of Azure OpenAI models are supported in Commitments. For onboarding any other models that aren't in the list below, or any newer models on provisioned throughput offering, refer to the [Azure OpenAI provisioned onboarding guide](../how-to/provisioned-throughput-onboarding.md) and [Azure Reservations for Azure OpenAI provisioned deployments](../how-to/provisioned-throughput-onboarding.md#azure-reservations-for-azure-ai-foundry-provisioned-throughput)
116116

117117
|Supported models on Commitment plan |Versions|
118118
|-|-|

0 commit comments

Comments
 (0)