Skip to content

Commit 3101b1d

Browse files
committed
Learn Editor: Update provisioned-throughput-onboarding.md
1 parent d42bd7f commit 3101b1d

File tree

1 file changed

+5
-3
lines changed

1 file changed

+5
-3
lines changed

articles/ai-services/openai/how-to/provisioned-throughput-onboarding.md

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ This article walks you through the process of onboarding to [Provisioned Through
1616

1717
## When to use provisioned throughput units (PTU)
1818

19-
You should consider switching from pay-as-you-go to provisioned throughput when you have well-defined, predictable throughput requirements. Typically, this occurs when the application is ready for production or has already been deployed in production and there's an understanding of the expected traffic. This allows users to accurately forecast the required capacity and avoid unexpected billing.
19+
You should consider switching from standard deployments to provisioned deployments when you have well-defined, predictable throughput requirements and a need for latency predictability . Typically, this occurs when the application is ready for production or has already been deployed in production and there's an understanding of the expected traffic. This allows users to accurately forecast the required capacity and avoid unexpected billing.
2020

2121
### Typical PTU scenarios
2222

@@ -29,9 +29,11 @@ You should consider switching from pay-as-you-go to provisioned throughput when
2929
3030
## Sizing and estimation: provisioned and global provisioned
3131

32-
Determining the right amount of provisioned throughput, or PTUs, you require for your workload is an essential step to optimizing performance and cost. This section describes how to use the Azure OpenAI capacity planning tool. The tool provides you with an estimate of the required PTU to meet the needs of your workload.
32+
Determining the right amount of provisioned throughput, or PTUs, you require for your workload is an essential step to optimizing performance and cost. If you are not familiar with the different approaches available to estimate system level throughput, review the system level throughput estimation recommendations in our [performance and latency documentation](./latency.md). This section describes how to use the Azure OpenAI capacity planning tooling available. The tool provides you with an estimate of the required PTU to meet the needs of your workload.
3333

34-
### Estimate provisioned throughput and cost
34+
### Estimate provisioned throughput units and cost
35+
36+
There are two capacity planning options available to estimate
3537

3638
To get a quick estimate for your workload, open the capacity planner in the [Azure AI Studio](https://ai.azure.com). The capacity calculator is under **Shared resources** > **Model Quota** > **Azure OpenAI Provisioned**.
3739

0 commit comments

Comments
 (0)