Skip to content

Commit 83f1d3d

Browse files
Merge pull request #4079 from MicrosoftDocs/main
Merged by Learn.Build PR Management system
2 parents f593d78 + b0d907a commit 83f1d3d

File tree

2 files changed

+9
-3
lines changed

2 files changed

+9
-3
lines changed

articles/ai-foundry/concepts/model-lifecycle-retirement.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -80,6 +80,12 @@ The following tables list the timelines for models that are on track for retirem
8080
| [Cohere-rerank-v3-english](https://ai.azure.com/explore/models/Cohere-rerank-v3-english/version/1/registry/azureml-cohere) | February 28, 2025 | March 31, 2025 | June 30, 2025 | [Cohere-rerank-v3.5-english](https://ai.azure.com/explore/models/Cohere-rerank-v3.5/version/1/registry/azureml-cohere) |
8181
| [Cohere-rerank-v3-multilingual](https://ai.azure.com/explore/models/Cohere-rerank-v3-multilingual/version/1/registry/azureml-cohere) | February 28, 2025 | March 31, 2025 | June 30, 2025 | [Cohere-rerank-v3.5-multilingual](https://ai.azure.com/explore/models/Cohere-rerank-v3.5/version/1/registry/azureml-cohere) |
8282

83+
#### DeepSeek
84+
85+
| Model | Legacy date (UTC) | Deprecation date (UTC) | Retirement date (UTC) | Suggested replacement model |
86+
|-------|-------------------|------------------------|-----------------------|-----------------------------|
87+
| [DeepSeek-V3](https://aka.ms/azureai/landing/DeepSeek-V3) | April 10, 2025 | May 31, 2025 | August 31, 2025 | [DeepSeek-V3-0324](https://aka.ms/azureai/landing/DeepSeek-V3-0324) |
88+
8389
#### Meta
8490

8591
| Model | Legacy date (UTC) | Deprecation date (UTC) | Retirement date (UTC) | Suggested replacement model |

articles/ai-foundry/model-inference/quotas-limits.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -32,9 +32,9 @@ Azure uses quotas and limits to prevent budget overruns due to fraud, and to hon
3232
| -------------------- | ------------------- | ----------- |
3333
| Tokens per minute | Azure OpenAI models | Varies per model and SKU. See [limits for Azure OpenAI](../../ai-services/openai/quotas-limits.md). |
3434
| Requests per minute | Azure OpenAI models | Varies per model and SKU. See [limits for Azure OpenAI](../../ai-services/openai/quotas-limits.md). |
35-
| Tokens per minute | DeepSeek-R1 | 5,000,000 |
36-
| Requests per minute | DeepSeek-R1 | 5,000 |
37-
| Concurrent requests | DeepSeek-R1 | 300 |
35+
| Tokens per minute | DeepSeek-R1<br />DeepSeek-V3-0324 | 5,000,000 |
36+
| Requests per minute | DeepSeek-R1<br />DeepSeek-V3-0324 | 5,000 |
37+
| Concurrent requests | DeepSeek-R1<br />DeepSeek-V3-0324 | 300 |
3838
| Tokens per minute | Rest of models | 400,000 |
3939
| Requests per minute | Rest of models | 1,000 |
4040
| Concurrent requests | Rest of models | 300 |

0 commit comments

Comments
 (0)