Skip to content

Commit 8a3e858

Browse files
committed
include file
1 parent b4cb667 commit 8a3e858

File tree

4 files changed

+31
-5
lines changed

4 files changed

+31
-5
lines changed

articles/ai-services/openai/concepts/provisioned-throughput.md

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -14,9 +14,6 @@ recommendations: false
1414

1515
The provisioned throughput capability allows you to specify the amount of throughput you require in a deployment. The service then allocates the necessary model processing capacity and ensures it's ready for you. Throughput is defined in terms of provisioned throughput units (PTU) which is a normalized way of representing the throughput for your deployment. Each model-version pair requires different amounts of PTU to deploy and provide different amounts of throughput per PTU.
1616

17-
> [!NOTE]
18-
> On July 29th 2024, Microsoft switched to an hourly/reservation PTU offering that offers usability improvements. For more details, see the [PTU migration article](../provisioned-migration.md#whats-changing).
19-
2017
## What does the provisioned deployment type provide?
2118

2219
- **Predictable performance:** stable max latency and throughput for uniform workloads.
@@ -39,6 +36,12 @@ An Azure OpenAI Deployment is a unit of management for a specific OpenAI Model.
3936
| Utilization | Provisioned-managed Utilization measure provided in Azure Monitor. |
4037
| Estimating size | Provided calculator in the studio & benchmarking script. |
4138

39+
### Hourly/reservation commercial model
40+
41+
On July 29th 2024, Microsoft switched to an hourly/reservation PTU offering that offers usability improvements. For more details, see the [PTU migration article](../provisioned-migration.md#whats-changing).
42+
43+
[!INCLUDE [hourly-ptu-description](../includes/hourly-ptu-description.md)]
44+
4245
## What models and regions are available for provisioned throughput?
4346

4447
[!INCLUDE [Provisioned](../includes/model-matrix/provisioned-models.md)]

articles/ai-services/openai/how-to/provisioned-throughput-onboarding.md

Lines changed: 6 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -21,8 +21,12 @@ This article walks you through the process of onboarding to [Provisioned Through
2121

2222
You should consider switching from pay-as-you-go to provisioned throughput when you have well-defined, predictable throughput requirements. Typically, this occurs when the application is ready for production or has already been deployed in production and there's an understanding of the expected traffic. This will allow users to accurately forecast the required capacity and avoid unexpected billing.
2323

24-
> [!NOTE]
25-
> On July 29th 2024, Microsoft switched to an hourly/reservation PTU offering that offers usability improvements. For more details, see the [PTU migration article](../provisioned-migration.md#whats-changing).
24+
### Hourly/reservation commercial model
25+
26+
On July 29th 2024, Microsoft switched to an hourly/reservation PTU offering that offers usability improvements. For more details, see the [PTU migration article](../provisioned-migration.md#whats-changing).
27+
28+
[!INCLUDE [hourly-ptu-description](../includes/hourly-ptu-description.md)]
29+
2630

2731
### Typical PTU scenarios
2832

Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,17 @@
1+
---
2+
manager: nitinme
3+
ms.service: azure-ai-openai
4+
ms.topic: include
5+
author: mrbullwinkle
6+
ms.author: aahi
7+
ms.date: 08/01/2024
8+
author: aahill
9+
---
10+
11+
|Feature | Benefit|
12+
|---|---|
13+
|Hourly, uncommitted usage | Hourly payment option without a required commitment enables short-term deployment scenarios |
14+
|Term discounts via Azure Reservations | Azure reservations provide substantial discounts over the hourly rate for one month and one year terms, and provide flexible scopes that minimize administration and associated with today’s resource-bound commitments.|
15+
| Default provisioned-managed quota in many regions | Get started quickly in new regions without having to first request quota |
16+
| Flexible choice of payment model for existing provisioned customers | Customers with commitments can stay on the commitment model at least through the end of 2024, and can choose to migrate existing commitments to hourly/reservations via a self-service or managed process. |
17+
| Supports latest model generations | The hourly/reservation model will be required to deploy models released after June 28, 2024. |

articles/ai-services/openai/provisioned-migration.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -36,6 +36,8 @@ The capabilities below are rolling out for the Provisioned Managed offering.
3636

3737
### New hourly/reservation commercial model
3838

39+
[!INCLUDE [hourly-ptu-description](includes/hourly-ptu-description.md)]
40+
3941
|Feature | Benefit|
4042
|---|---|
4143
|Hourly, uncommitted usage | Hourly payment option without a required commitment enables short-term deployment scenarios |

0 commit comments

Comments
 (0)