Skip to content

Commit be0dd76

Browse files
Merge pull request #234904 from PatrickFarley/openai-updates
quota update
2 parents 2df9b16 + 41b6190 commit be0dd76

File tree

1 file changed

+9
-1
lines changed

1 file changed

+9
-1
lines changed

articles/cognitive-services/openai/quotas-limits.md

Lines changed: 9 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@ The following sections provide you with a quick guide to the quotas and limits t
2323
| Limit Name | Limit Value |
2424
|--|--|
2525
| OpenAI resources per region | 2 |
26-
| Requests per minute per model* | Davinci-models (002 and later): 120 <br> ChatGPT model (preview): 300 <br> GPT-4 models (preview): 12 <br> All other models: 300 |
26+
| Requests per minute per model* | Davinci-models (002 and later): 120 <br> ChatGPT model (preview): 300 <br> GPT-4 models (preview): 18 <br> All other models: 300 |
2727
| Tokens per minute per model* | Davinci-models (002 and later): 40,000 <br> ChatGPT model: 120,000 <br> All other models: 120,000 |
2828
| Max fine-tuned model deployments* | 2 |
2929
| Ability to deploy same model to multiple deployments | Not allowed |
@@ -37,6 +37,7 @@ The following sections provide you with a quick guide to the quotas and limits t
3737

3838
*The limits are subject to change. We anticipate that you will need higher limits as you move toward production and your solution scales. When you know your solution requirements, please reach out to us by applying for a quota increase here: <https://aka.ms/oai/quotaincrease>
3939

40+
4041
For information on max tokens for different models, consult the [models article](./concepts/models.md#model-summary-table-and-region-availability)
4142

4243
### General best practices to mitigate throttling during autoscaling
@@ -54,6 +55,13 @@ The next sections describe specific cases of adjusting quotas.
5455

5556
If you need to increase the limit, you can apply for a quota increase here: <https://aka.ms/oai/quotaincrease>
5657

58+
### How to request an increase to the number of resources per region
59+
60+
If you need to increase the number of resources, you can apply for a resource increase here: <https://aka.ms/oai/resourceincrease>
61+
62+
> [!NOTE]
63+
> Ensure that you thoroughly assess your current resource utilization, approaching its full capacity. Be aware that we will not grant additional resources if efficient usage of existing resources is not observed.
64+
5765
## Next steps
5866

5967
Learn more about the [underlying models that power Azure OpenAI](./concepts/models.md).

0 commit comments

Comments
 (0)