Skip to content

Commit 856b9e4

Browse files
Merge pull request #235863 from mrbullwinkle/mrb_04_25_2023_quota_update
[Azure OpenAI] update TPM for gpt-4 and gpt-4 32k
2 parents cdc55f8 + 28d48fc commit 856b9e4

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

articles/cognitive-services/openai/quotas-limits.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ manager: nitinme
88
ms.service: cognitive-services
99
ms.subservice: openai
1010
ms.topic: conceptual
11-
ms.date: 04/24/2023
11+
ms.date: 04/25/2023
1212
ms.author: chrhoder
1313
---
1414

@@ -24,7 +24,7 @@ The following sections provide you with a quick guide to the quotas and limits t
2424
|--|--|
2525
| OpenAI resources per region per Azure subscription | 3 |
2626
| Requests per minute per model* | Davinci-models (002 and later): 120 <br> ChatGPT model (preview): 300 <br> GPT-4 models (preview): 18 <br> All other models: 300 |
27-
| Tokens per minute per model* | Davinci-models (002 and later): 40,000 <br> ChatGPT model: 120,000 <br> All other models: 120,000 |
27+
| Tokens per minute per model* | Davinci-models (002 and later): 40,000 <br> ChatGPT model: 120,000 <br> GPT-4 8k model: 10,000 <br> GPT-4 32k model: 32,000 <br> All other models: 120,000 |
2828
| Max fine-tuned model deployments* | 2 |
2929
| Ability to deploy same model to multiple deployments | Not allowed |
3030
| Total number of training jobs per resource | 100 |

0 commit comments

Comments
 (0)