Skip to content

Commit d7bafa3

Browse files
authored
Merge pull request #1689 from MicrosoftDocs/main
11/22 11:00 AM IST Publish
2 parents 567d1b1 + 641238b commit d7bafa3

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

articles/ai-services/openai/quotas-limits.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -29,7 +29,7 @@ The following sections provide you with a quick guide to the default quotas and
2929
| Default DALL-E 3 quota limits| 2 capacity units (6 requests per minute)|
3030
| Default Whisper quota limits | 3 requests per minute |
3131
| Maximum prompt tokens per request | Varies per model. For more information, see [Azure OpenAI Service models](./concepts/models.md)|
32-
| Max Standard deployments per resource | 32 |
32+
| Max Standard deployments per resource | 32 |
3333
| Max fine-tuned model deployments | 5 |
3434
| Total number of training jobs per resource | 100 |
3535
| Max simultaneous running training jobs per resource | 1 |
@@ -46,9 +46,9 @@ The following sections provide you with a quick guide to the default quotas and
4646
| Maximum number of Provisioned throughput units per deployment | 100,000 |
4747
| Max files per Assistant/thread | 10,000 when using the API or AI Studio. 20 when using Azure OpenAI Studio.|
4848
| Max file size for Assistants & fine-tuning | 512 MB |
49-
| Max size for all uploaded files for Assistants |100 GB |
49+
| Max size for all uploaded files for Assistants |100 GB |
5050
| Assistants token limit | 2,000,000 token limit |
51-
| GPT-4o max images per request (# of images in the messages array/conversation history) | 10 |
51+
| GPT-4o max images per request (# of images in the messages array/conversation history) | 50 |
5252
| GPT-4 `vision-preview` & GPT-4 `turbo-2024-04-09` default max tokens | 16 <br><br> Increase the `max_tokens` parameter value to avoid truncated responses. GPT-4o max tokens defaults to 4096. |
5353
| Max number of custom headers in API requests<sup>1</sup> | 10 |
5454
| Max number requests per minute<br/><br/>Current rate limits for real time audio (`gpt-4o-realtime-preview`) are defined as the number of new websocket connections per minute. For example, 6 request per minute (RPM) means 6 new connections per minute. At this time, the usage limits for `gpt-4o-realtime-preview` are suitable for test and development. | 6 new connections per minute |

0 commit comments

Comments
 (0)