Skip to content

Commit 803bc08

Browse files
authored
Merge pull request #4027 from MicrosoftDocs/main
Merge main to live, 4 AM
2 parents 0f2c249 + 36d0d8a commit 803bc08

File tree

1 file changed

+20
-11
lines changed

1 file changed

+20
-11
lines changed

articles/ai-services/openai/quotas-limits.md

Lines changed: 20 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ ms.custom:
99
- ignite-2023
1010
- references_regions
1111
ms.topic: conceptual
12-
ms.date: 4/08/2025
12+
ms.date: 4/09/2025
1313
ms.author: mbullwin
1414
---
1515

@@ -86,7 +86,7 @@ The following sections provide you with a quick guide to the default quotas and
8686
>
8787
> This is particularly important for programmatic model deployment as this change in RPM/TPM ratio can result in accidental under allocation of quota if one is still assuming the 1:1000 ratio followed by older chat completion models.
8888
>
89-
> There is a known issue with the [quota/usages API](/rest/api/aiservices/accountmanagement/usages/list?view=rest-aiservices-accountmanagement-2024-06-01-preview&tabs=HTTP&preserve-view=true) where it assumes the old ratio applies to the new o1-series models. The API returns the correct base capacity number, but doesn't apply the correct ratio for the accurate calculation of TPM.
89+
> There's a known issue with the [quota/usages API](/rest/api/aiservices/accountmanagement/usages/list?view=rest-aiservices-accountmanagement-2024-06-01-preview&tabs=HTTP&preserve-view=true) where it assumes the old ratio applies to the new o1-series models. The API returns the correct base capacity number, but doesn't apply the correct ratio for the accurate calculation of TPM.
9090
9191
### `o-series` global standard
9292

@@ -199,14 +199,18 @@ If your Azure subscription is linked to certain [offer types](https://azure.micr
199199

200200
|Tier| Quota Limit in tokens per minute (TPM) |
201201
|---|:---|
202-
|Azure for Students, Free Trials | 1 K (all models) <br>Exception o-series & GPT 4.5 Preview: 0|
203-
| MSDN | GPT 3.5 Turbo Series: 30 K <br> GPT-4 series: 8 K <br>computer-use-preview: 30 K <br> gpt-4o-realtime-preview: 1 K <br> o-series: 0 <br> GPT 4.5 Preview: 0 |
204-
|Pay-as-you-go | GPT 3.5 Turbo Series: 30 K <br> GPT-4 series: 8 K <br>computer-use-preview: 30 K <br> o-series: 0 <br> GPT 4.5 Preview: 0 |
205-
| CSP Dev Test<sup>*</sup> | All models: 0 |
202+
|`Azure for Students` | 1 K (all models) <br>Exception o-series & GPT 4.5 Preview: 0|
203+
| `MSDN` | GPT-4o-mini: 200 K <br> GPT 3.5 Turbo Series: 200 K <br> GPT-4 series: 50 K <br>computer-use-preview: 8 K <br> gpt-4o-realtime-preview: 1 K <br> o-series: 0 <br> GPT 4.5 Preview: 0 |
204+
|`Pay-as-you-go` | GPT-4o-mini: 200 K <br> GPT 3.5 Turbo Series: 200 K <br> GPT-4 series: 50 K <br>computer-use-preview: 30 K <br> o-series: 0 <br> GPT 4.5 Preview: 0 |
205+
| `Azure_MS-AZR-0111P` <br> `Azure_MS-AZR-0035P` <br> `Azure_MS-AZR-0025P` <br> `Azure_MS-AZR-0052P` <br>| GPT-4o-mini: 200 K <br> GPT 3.5 Turbo Series: 200 K <br> GPT-4 series: 50 K |
206+
| `CSP Integration Sandbox` <sup>*</sup> | All models: 0 |
207+
| `Lightweight trial`<br>`Free Trials`<br>`Azure Pass` | All models: 0 |
206208

207-
<sup>*</sup>This only applies to a small number of dev/test CSP subscriptions. Use the query below to determine what `quotaId` is associated with your subscription.
208209

209-
To determine the offer type that is associated with your subscription you can check your `quotaId`. If your `quotaId` is not listed in this table your subscription qualifies for default quota.
210+
211+
<sup>*</sup>This only applies to a small number of legacy CSP sandbox subscriptions. Use the query below to determine what `quotaId` is associated with your subscription.
212+
213+
To determine the offer type that is associated with your subscription you can check your `quotaId`. If your `quotaId` isn't listed in this table your subscription qualifies for default quota.
210214

211215
# [REST](#tab/REST)
212216

@@ -247,14 +251,19 @@ az rest --method GET --uri "https://management.azure.com/subscriptions/{sub-id}?
247251
}
248252
```
249253

250-
| Quota allocation | Subscription quota ID |
254+
| Quota allocation/Offer type | Subscription quota ID |
251255
|:---|:----|
252256
| Enterprise | `EnterpriseAgreement_2014-09-01` |
253257
| Pay-as-you-go | `PayAsYouGo_2014-09-01`|
254258
| MSDN | `MSDN_2014-09-01` |
255-
| CSP Dev/Test | `CSPDEVTEST_2018-05-01` |
259+
| CSP Integration Sandbox | `CSPDEVTEST_2018-05-01` |
256260
| Azure for Students | `AzureForStudents_2018-01-01` |
257-
| Free Trial | `FreeTrial_2014-09-01` |
261+
| Free Trial | `FreeTrial_2014-09-01` |
262+
| Azure Pass | `AzurePass_2014-09-01` |
263+
| Azure_MS-AZR-0111P | `AzureInOpen_2014-09-01` |
264+
| Azure_MS-AZR-0150P | `LightweightTrial_2016-09-01` |
265+
| Azure_MS-AZR-0035P <br> Azure_MS-AZR-0025P <br> Azure_MS-AZR-0052P <br>| `MPN_2014-09-01` |
266+
| Azure_MS-AZR-0023P <br> Azure_MS-AZR-0060P <br> Azure_MS-AZR-0148P <br> Azure_MS-AZR-0148G | `MSDNDevTest_2014-09-01`|
258267
| Default | Any quota ID not listed in this table |
259268

260269
### General best practices to remain within rate limits

0 commit comments

Comments
 (0)