Skip to content

Commit 48c4e08

Browse files
committed
Learn Editor: Update provisioned-throughput-onboarding.md
1 parent 50a0c2a commit 48c4e08

File tree

1 file changed

+6
-6
lines changed

1 file changed

+6
-6
lines changed

articles/ai-services/openai/how-to/provisioned-throughput-onboarding.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -33,9 +33,9 @@ Determining the right amount of provisioned throughput, or PTUs, you require for
3333

3434
### Estimate provisioned throughput units and cost
3535

36-
To get a quick estimate for your workload using input and output TPM, leverage the built-in capacity planner in the deployment details section of the deployment dialogue screen. The built-in capacity planner is under **Shared resources** > **Deployments** where you can select to deploy a base model, choose your target model, and select any of the Provisioned deployment type options. When configuring your Provisioned deployment, leverage the deployment details section to estimate your PTU capacity using input and output TPM data. For more information on how to identify and estimate TPM data, review the recommendations in our [performance and latency documentation](./latency.md).
36+
To get a quick estimate for your workload using input and output TPM, leverage the built-in capacity planner in the deployment details section of the deployment dialogue screen. The built-in capacity planner is part of the deployment workflow to help streamline the sizing and allocation of quota to a PTU deployment for a given workload. For more information on how to identify and estimate TPM data, review the recommendations in our [performance and latency documentation](./latency.md).
3737

38-
After filling out the input and output TPM data in the capacity planner, select the **Calculate** button to view your recommended PTU capacity information.
38+
After filling out the input and output TPM data in the built-in capacity calculator, select the **Calculate** button to view your PTU allocation recommendation.
3939

4040
![User's image](media/provisioned-throughput-onboarding/image.png)
4141

@@ -60,15 +60,15 @@ The values in the output column are the estimated value of PTU units required fo
6060
> [!NOTE]
6161
> The capacity calculators provide an estimate based on simple input criteria. The most accurate way to determine your capacity is to benchmark a deployment with a representational workload for your use case.
6262
63-
## Understanding the Provisioned Throughput Purchase Model
63+
## Understanding the provisioned throughput purchase model
6464

6565
Azure OpenAI Provisioned and Global Provisioned are purchased on-demand at an hourly basis based on the number of deployed PTUs, with substantial term discount available via the purchase of Azure Reservations.
6666

6767
The hourly model is useful for short-term deployment needs, such as validating new models or acquiring capacity for a hackathon.  However, the discounts provided by the Azure Reservation for Azure OpenAI Provisioned and Global Provisioned are considerable and most customers with consistent long-term usage will find a reserved model to be a better value proposition.
6868

6969
> [!NOTE]
7070
> Azure OpenAI Provisioned customers onboarded prior to the August self-service update use a purchase model called the Commitment model. These customers can continue to use this older purchase model alongside the Hourly/reservation purchase model. The Commitment model is not available for new customers. For details on the Commitment purchase model and options for coexistence and migration, please see the [Azure OpenAI Provisioned August Update](../concepts/provisioned-migration.md).
71-
## Hourly Usage 
71+
## Hourly usage
7272

7373
Provisioned and Global Provisioned deployments are charged an hourly rate ($/PTU/hr) on the number of PTUs that have been deployed.  For example, a 300 PTU deployment will be charged the hourly rate times 300.  All Azure OpenAI pricing is available in the Azure Pricing Calculator.
7474

@@ -88,7 +88,7 @@ Customers that require long-term usage of provisioned and global provisioned dep
8888
> * Having unused provisioned quota (PTUs) does not guarentee that capacity will be available to support increasing the size of the deployment when required. Quota limits the maximum number of PTUs that can be deployed, but it is not a capacity guarantee. Provisioned capacity for each region and modal dynamically changes throughout the day and might not be available when required. As a result, it is recommended to maintain a permanant deployment to cover your traffic needs (paid for via a reservation).
8989
> * Charges for deployments on a deleted resource will continue until the resource is purged. To prevent this, delete a resource’s deployment before deleting the resource. For more information, see [Recover or purge deleted Azure AI services resources](../../recover-purge-resources.md).
9090
91-
## Azure Reservations for Azure OpenAI Provisioned Deployments
91+
## Azure Reservations for Azure OpenAI provisioned deployments
9292

9393
Discounts on top of the hourly usage price can be obtained by purchasing an Azure Reservation for Azure OpenAI Provisioned and Global Provisioned. An Azure Reservation is a term-discounting mechanism shared by many Azure products. For example, Compute and Cosmos DB. For Azure OpenAI Provisioned and Global Provisioned, the reservation provides a discount for committing to payment for fixed number of PTUs for a one-month or one-year period. 
9494

@@ -115,7 +115,7 @@ Discounts on top of the hourly usage price can be obtained by purchasing an Azur
115115
>
116116
> * The Azure role and tenant policy requirements to purchase a reservation are different than those required to create a deployment or Azure OpenAI resource. Verify authorization to purchase reservations in advance of needing to do so. See Azure OpenAI [Provisioned reservation documentation](https://aka.ms/oai/docs/ptum-reservations) for more details.
117117
118-
## Important: Sizing Azure OpenAI Provisioned Reservations
118+
## Important: sizing Azure OpenAI provisioned reservations
119119

120120
The PTU amounts in reservation purchases are independent of PTUs allocated in quota or used in deployments. It is possible to purchase a reservation for more PTUs than you have in quota, or can deploy for the desired region, model, or version. Credits for over-purchasing a reservation are limited, and customers must take steps to ensure they maintain their reservation sizes in line with their deployed PTUs.
121121

0 commit comments

Comments
 (0)