Skip to content

Commit b0d907a

Browse files
Merge pull request #4076 from santiagxf/santiagxf-patch-1
Update quotas-limits.md
2 parents 64bb6f8 + 43fb49f commit b0d907a

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

articles/ai-foundry/model-inference/quotas-limits.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -32,9 +32,9 @@ Azure uses quotas and limits to prevent budget overruns due to fraud, and to hon
3232
| -------------------- | ------------------- | ----------- |
3333
| Tokens per minute | Azure OpenAI models | Varies per model and SKU. See [limits for Azure OpenAI](../../ai-services/openai/quotas-limits.md). |
3434
| Requests per minute | Azure OpenAI models | Varies per model and SKU. See [limits for Azure OpenAI](../../ai-services/openai/quotas-limits.md). |
35-
| Tokens per minute | DeepSeek-R1 | 5,000,000 |
36-
| Requests per minute | DeepSeek-R1 | 5,000 |
37-
| Concurrent requests | DeepSeek-R1 | 300 |
35+
| Tokens per minute | DeepSeek-R1<br />DeepSeek-V3-0324 | 5,000,000 |
36+
| Requests per minute | DeepSeek-R1<br />DeepSeek-V3-0324 | 5,000 |
37+
| Concurrent requests | DeepSeek-R1<br />DeepSeek-V3-0324 | 300 |
3838
| Tokens per minute | Rest of models | 400,000 |
3939
| Requests per minute | Rest of models | 1,000 |
4040
| Concurrent requests | Rest of models | 300 |

0 commit comments

Comments
 (0)