Skip to content

Commit d46c5f9

Browse files
Merge pull request #284043 from mrbullwinkle/mrb_08_08_2024_usage_tier
[Azure OpenAI] Usage Tier Updates
2 parents f084352 + 58de85e commit d46c5f9

File tree

1 file changed

+12
-2
lines changed

1 file changed

+12
-2
lines changed

articles/ai-services/openai/quotas-limits.md

Lines changed: 12 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.custom:
1010
- ignite-2023
1111
- references_regions
1212
ms.topic: conceptual
13-
ms.date: 08/06/2024
13+
ms.date: 08/08/2024
1414
ms.author: mbullwin
1515
---
1616

@@ -91,13 +91,23 @@ Global Standard deployments use Azure's global infrastructure, dynamically routi
9191

9292
The Usage Limit determines the level of usage above which customers might see larger variability in response latency. A customer’s usage is defined per model and is the total tokens consumed across all deployments in all subscriptions in all regions for a given tenant.
9393

94+
> [!NOTE]
95+
> Usage tiers only apply to standard and global standard deployment types. Usage tiers do not apply to global batch deployments.
96+
9497
#### GPT-4o global standard & standard
9598

9699
|Model| Usage Tiers per month |
97100
|----|----|
98-
|`gpt-4o` |1.5 Billion tokens |
101+
|`gpt-4o` | 8 Billion tokens |
99102
|`gpt-4o-mini` | 45 Billion tokens |
100103

104+
#### GPT-4 standard
105+
106+
|Model| Usage Tiers per month|
107+
|---|---|
108+
| `gpt-4` + `gpt-4-32k` (all versions) | 4 Billion |
109+
110+
101111
## Other offer types
102112

103113
If your Azure subscription is linked to certain [offer types](https://azure.microsoft.com/support/legal/offer-details/) your max quota values are lower than the values indicated in the above tables.

0 commit comments

Comments
 (0)