Skip to content

Commit 1a3801b

Browse files
committed
update
1 parent 6ecd040 commit 1a3801b

File tree

1 file changed

+29
-17
lines changed

1 file changed

+29
-17
lines changed

articles/ai-foundry/openai/quotas-limits.md

Lines changed: 29 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ description: Quick reference, detailed description, and best practices on the qu
44
author: mrbullwinkle
55
ms.author: mbullwin
66
manager: nitinme
7-
ms.date: 05/29/2025
7+
ms.date: 07/02/2025
88
ms.service: azure-ai-openai
99
ms.topic: conceptual
1010
ms.custom:
@@ -72,16 +72,27 @@ The following sections provide you with a quick guide to the default quotas and
7272
| `gpt-4.5` | Enterprise Tier | 200 K | 200 |
7373
| `gpt-4.5` | Default | 150 K | 150 |
7474

75-
### GPT-4.1 series
75+
### GPT-4.1 series global standard
7676

7777
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
7878
|---|---|:---:|:---:|
7979
| `gpt-4.1` (2025-04-14) | Enterprise Tier | 5 M | 5 K |
8080
| `gpt-4.1` (2025-04-14) | Default | 1 M | 1 K |
81-
| `gpt-4.1-nano` (2025-04-14) | Enterprise Tier | 5 M | 5 K |
82-
| `gpt-4.1-nano` (2025-04-14) | Default | 1 M | 1 K |
83-
| `gpt-4.1-mini` (2025-04-14) | Enterprise Tier | 5 M | 5 K |
84-
| `gpt-4.1-mini` (2025-04-14) | Default | 1 M | 1 K |
81+
| `gpt-4.1-nano` (2025-04-14) | Enterprise Tier | 150 M | 150 K |
82+
| `gpt-4.1-nano` (2025-04-14) | Default | 5 M | 5 K |
83+
| `gpt-4.1-mini` (2025-04-14) | Enterprise Tier | 150 M | 150 K |
84+
| `gpt-4.1-mini` (2025-04-14) | Default | 5 M | 5 K |
85+
86+
### GPT-4.1 series data zone standard
87+
88+
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
89+
|---|---|:---:|:---:|
90+
| `gpt-4.1` (2025-04-14) | Enterprise Tier | 2 M | 2 K |
91+
| `gpt-4.1` (2025-04-14) | Default | 300 K | 300 |
92+
| `gpt-4.1-nano` (2025-04-14) | Enterprise Tier | 50 M | 50 K |
93+
| `gpt-4.1-nano` (2025-04-14) | Default | 2 M | 2 K |
94+
| `gpt-4.1-mini` (2025-04-14) | Enterprise Tier | 50 M | 50 K |
95+
| `gpt-4.1-mini` (2025-04-14) | Default | 2 M | 2 K |
8596

8697
### GPT-4 Turbo
8798

@@ -230,19 +241,20 @@ The Usage Limit determines the level of usage above which customers might see la
230241
> [!NOTE]
231242
> Usage tiers only apply to standard, data zone standard, and global standard deployment types. Usage tiers don't apply to global batch and provisioned throughput deployments.
232243
233-
### GPT-4o global standard, data zone standard, & standard
244+
### Global standard, data zone standard, & standard
234245

235246
|Model| Usage Tiers per month |
236-
|----|----|
237-
|`gpt-4o` | 12 Billion tokens |
238-
|`gpt-4o-mini` | 85 Billion tokens |
239-
240-
### GPT-4 standard
241-
242-
|Model| Usage Tiers per month|
243-
|---|---|
244-
| `gpt-4` + `gpt-4-32k` (all versions) | 6 Billion |
245-
247+
|----|:----|
248+
| `gpt-4` + `gpt-4-32k` (all versions) | 6 Billion tokens |
249+
| `gpt-4o` | 12 Billion tokens |
250+
| `gpt-4o-mini` | 85 Billion tokens |
251+
| `o3-mini` | 50 Billion tokens |
252+
| `o1` | 4 Billon tokens |
253+
| `o4-mini` | 50 Billion tokens |
254+
| `o3` | 5 Billion tokens |
255+
| `gpt-4.1` | 30 Billion tokens |
256+
| `gpt-4.1-mini` | 150 Billion tokens |
257+
| `gpt-4.1-nano` | 550 Billion tokens |
246258

247259
## Other offer types
248260

0 commit comments

Comments
 (0)