Skip to content

Commit 557f19d

Browse files
Merge pull request #3645 from aahill/ptu-update-2
Ptu update 2
2 parents 0751955 + 46369a5 commit 557f19d

File tree

8 files changed

+145
-150
lines changed

8 files changed

+145
-150
lines changed

articles/ai-services/openai/concepts/provisioned-throughput.md

Lines changed: 24 additions & 81 deletions
Large diffs are not rendered by default.

articles/ai-services/openai/how-to/fine-tuning-deploy.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ manager: nitinme
66
ms.service: azure-ai-openai
77
ms.custom: build-2023, build-2023-dataai, devx-track-python, references_regions
88
ms.topic: how-to
9-
ms.date: 02/24/2025
9+
ms.date: 03/31/2025
1010
author: mrbullwinkle
1111
ms.author: mbullwin
1212
---
@@ -389,7 +389,7 @@ Global Standard fine-tuned deployments currently support structured outputs only
389389
- `gpt-4o-mini-2024-07-18`
390390
- `gpt-4o-2024-08-06`
391391

392-
[Provisioned managed](./deployment-types.md#provisioned) fine-tuned deployments offer [predictable performance](../concepts/provisioned-throughput.md#what-do-the-provisioned-deployment-types-provide) for fine-tuned deployments. As part of public preview, provisioned managed deployments may be created regionally via the data-plane [REST API](../reference.md#data-plane-inference) version `2024-10-01` or newer. See below for examples.
392+
[Provisioned managed](./deployment-types.md#provisioned) fine-tuned deployments offer [predictable performance](../concepts/provisioned-throughput.md) for fine-tuned deployments. As part of public preview, provisioned managed deployments may be created regionally via the data-plane [REST API](../reference.md#data-plane-inference) version `2024-10-01` or newer. See below for examples.
393393

394394
Provisioned Managed fine-tuned deployments currently support structured outputs only on GPT-4o.
395395

@@ -423,7 +423,7 @@ curl -X PUT "https://management.azure.com/subscriptions/<SUBSCRIPTION>/resourceG
423423

424424
#### Scaling a fine-tuned model on Provisioned Managed
425425

426-
To scale a fine-tuned provision managed deployment to increase or decrease PTU capacity, perform the same `PUT` REST API call as you did when [creating the deployment](#creating-a-provisioned-managed-deployment) and provide an updated `capacity` value for the `sku`. Keep in mind, provisioned deployments must scale in [minimum increments](../concepts/provisioned-throughput.md#how-much-throughput-per-ptu-you-get-for-each-model).
426+
To scale a fine-tuned provision managed deployment to increase or decrease PTU capacity, perform the same `PUT` REST API call as you did when [creating the deployment](#creating-a-provisioned-managed-deployment) and provide an updated `capacity` value for the `sku`. Keep in mind, provisioned deployments must scale in [minimum increments](../how-to/provisioned-throughput-onboarding.md#how-much-throughput-per-ptu-you-get-for-each-model).
427427

428428
For example, to scale the model deployed in the previous section from 25 to 40 PTU, make another `PUT` call and increase the capacity:
429429

articles/ai-services/openai/how-to/provisioned-get-started.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -6,13 +6,13 @@ manager: nitinme
66
ms.service: azure-ai-openai
77
ms.custom: openai
88
ms.topic: how-to
9-
author: mrbullwinkle
10-
ms.author: mbullwin
11-
ms.date: 03/26/2025
9+
author: aahill
10+
ms.author: aahi
11+
ms.date: 03/31/2025
1212
recommendations: false
1313
---
1414

15-
# Get started using Provisioned Deployments on the Azure OpenAI Service
15+
# Get started using provisioned deployments on the Azure OpenAI Service
1616

1717
The following guide walks you through key steps in creating a provisioned deployment with your Azure OpenAI Service resource. For more details on the concepts discussed here, see:
1818
* [Azure OpenAI Provisioned Onboarding Guide](./provisioned-throughput-onboarding.md)
@@ -51,7 +51,7 @@ Provisioned deployments are created via Azure OpenAI resource objects within Azu
5151
5252
## Create your provisioned deployment - capacity is available
5353

54-
once you have verified your quota, you can create a deployment. To create a provisioned deployment, you can follow these steps; the choices described reflect the entries shown in the screenshot.
54+
Once you have verified your quota, you can create a deployment. To create a provisioned deployment, you can follow these steps; the choices described reflect the entries shown in the screenshot.
5555

5656
:::image type="content" source="../media/provisioned/deployment-screen.png" alt-text="Screenshot of the Azure AI Foundry portal deployment page for a provisioned deployment." lightbox="../media/provisioned/deployment-screen.png":::
5757

@@ -80,7 +80,7 @@ Once you have entered the deployment settings, click **Confirm Pricing** to cont
8080
If you are unsure of the costs, cancel the deployment and proceed once you understand the payment model and underlying costs for provisioned deployment. This step may prevent unexpected, high charges on your payment invoice. Resources to educate yourself include:
8181

8282
* [Azure Pricing Portal](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/)
83-
* [Understanding the provisioned throughput purchase model](provisioned-throughput-onboarding.md#understanding-the-provisioned-throughput-purchase-model)
83+
* [Understanding the provisioned throughput costs](provisioned-throughput-onboarding.md)
8484

8585
The image below shows the pricing confirmation you will see. The price shown is an example only.
8686

0 commit comments

Comments
 (0)