Skip to content

Commit a4882d9

Browse files
committed
quota update
1 parent 2da2186 commit a4882d9

File tree

1 file changed

+10
-2
lines changed

1 file changed

+10
-2
lines changed

articles/cognitive-services/openai/quotas-limits.md

Lines changed: 10 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@ The following sections provide you with a quick guide to the quotas and limits t
2323
| Limit Name | Limit Value |
2424
|--|--|
2525
| OpenAI resources per region | 2 |
26-
| Requests per minute per model* | Davinci-models (002 and later): 120 <br> ChatGPT model (preview): 300 <br> GPT-4 models (preview): 12 <br> All other models: 300 |
26+
| Requests per minute per model* | Davinci-models (002 and later): 120 <br> ChatGPT model (preview): 300 <br> GPT-4 models (preview): 18 <br> All other models: 300 |
2727
| Tokens per minute per model* | Davinci-models (002 and later): 40,000 <br> ChatGPT model: 120,000 <br> All other models: 120,000 |
2828
| Max fine-tuned model deployments* | 2 |
2929
| Ability to deploy same model to multiple deployments | Not allowed |
@@ -35,7 +35,8 @@ The following sections provide you with a quick guide to the quotas and limits t
3535
| Max training job time (job will fail if exceeded) | 720 hours |
3636
| Max training job size (tokens in training file) x (# of epochs) | 2 Billion |
3737

38-
*The limits are subject to change. We anticipate that you will need higher limits as you move toward production and your solution scales. When you know your solution requirements, please reach out to us by applying for a quota increase here: <https://aka.ms/oai/quotaincrease>
38+
*The limits are subject to change. We anticipate that you will need higher limits as you move toward production and your solution scales. When you know your solution requirements, please reach out to us by applying for a quota increase here: https://aka.ms/oai/quotaincrease
39+
3940

4041
For information on max tokens for different models, consult the [models article](./concepts/models.md#model-summary-table-and-region-availability)
4142

@@ -54,6 +55,13 @@ The next sections describe specific cases of adjusting quotas.
5455

5556
If you need to increase the limit, you can apply for a quota increase here: <https://aka.ms/oai/quotaincrease>
5657

58+
### How to request an increase to the number of resources per region
59+
60+
If you need to increase the number of resources, you can apply for a resource increase here: <https://aka.ms/oai/resourceincrease>
61+
62+
> [!NOTE]
63+
> Ensure that you thoroughly assess your current resource utilization, approaching its full capacity. Be aware that we will not grant additional resources if efficient usage of existing resources is not observed.
64+
5765
## Next steps
5866

5967
Learn more about the [underlying models that power Azure OpenAI](./concepts/models.md).

0 commit comments

Comments
 (0)