Skip to content

Commit a811a46

Browse files
committed
make heading more readable
1 parent 2f19883 commit a811a46

File tree

1 file changed

+18
-18
lines changed

1 file changed

+18
-18
lines changed

articles/ai-services/openai/concepts/provisioned-throughput.md

Lines changed: 18 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -163,7 +163,7 @@ For provisioned deployments, we use a variation of the leaky bucket algorithm to
163163
The number of concurrent calls you can achieve depends on each call's shape (prompt size, `max_tokens` parameter, etc.). The service continues to accept calls until the utilization reaches 100%. To determine the approximate number of concurrent calls, you can model out the maximum requests per minute for a particular call shape in the [capacity calculator](https://ai.azure.com/resource/calculator). If the system generates less than the number of output tokens set for the `max_tokens` parameter, then the provisioned deployment will accept more requests.
164164

165165

166-
## Foundry Models with provisioned throughput capability
166+
## Provisioned throughput capability for Models Sold Directly by Azure
167167

168168
This section lists Foundry Models that support the provisioned throughput capability. You can use your PTU quota and PTU reservation across the models shown in the table.
169169

@@ -179,23 +179,23 @@ The following points are some important takeaways from the table:
179179

180180
- Spillover is an optional capability that manages traffic fluctuations on provisioned deployments. For more information on spillover, see [Manage traffic with spillover for provisioned deployments (Preview)](../how-to/spillover-traffic-management.md).
181181

182-
| Model Family | Model name | Global provisioned | Data zone provisioned | Regional provisioned | Spillover feature |
183-
|----------------|-----------------|--------------------|-----------------------|----------------------|-------------------|
184-
| **Azure OpenAI** | Gpt4.1 |||||
185-
| | Gpt 4.1 mini |||||
186-
| | Gpt 4.1 nano |||||
187-
| | Gpt 4o |||||
188-
| | Gpt 4o mini |||||
189-
| | Gpt 3.5 Turbo |||||
190-
| | o1 |||||
191-
| | O3 mini |||||
192-
| | O4 mini |||||
193-
| **Azure DeepSeek** | DeepSeek-R1 || | | |
194-
| | DeepSeek-V3-0324|| | | |
195-
| | MAI-DeepSeek-R1 || | | |
196-
197-
198-
### Available regions for models sold directly by Azure with provisioned throughput capability
182+
| Model Family | Model name | Global provisioned | Data zone provisioned | Regional provisioned | Spillover feature |
183+
|--------------------|------------------|--------------------|-----------------------|----------------------|-------------------|
184+
| **Azure OpenAI** | Gpt4.1 || || |
185+
| | Gpt 4.1 mini || || |
186+
| | Gpt 4.1 nano || || |
187+
| | Gpt 4o || || |
188+
| | Gpt 4o mini || || |
189+
| | Gpt 3.5 Turbo || || |
190+
| | o1 || || |
191+
| | O3 mini || || |
192+
| | O4 mini || || |
193+
| **Azure DeepSeek** | DeepSeek-R1 || | | |
194+
| | DeepSeek-V3-0324 || | | |
195+
196+
197+
198+
### Region availablilty for provisioned throughput capability
199199

200200
# [Global Provisioned Throughput](#tab/global-ptum)
201201

0 commit comments

Comments
 (0)