You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/concepts/model-lifecycle-retirement.md
+37-20Lines changed: 37 additions & 20 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -63,26 +63,43 @@ Models labeled _Retired_ are no longer available for use. You can't create new d
63
63
64
64
- For each subscription that has a model deployed as a severless API or deployed to the Azure AI model inference, members of the _owner_, _contributor_, _reader_, monitoring contributor_, and _monitoring reader_ roles receive a notification when a model deprecation is announced. The notification contains the dates when the model enters legacy, deprecated, and retired states. The notification might provide information about possible replacement model options, if applicable.
65
65
66
-
The following table lists the timelines for models that are on track for retirement. The specified dates are in UTC time.
67
-
68
-
| Model provider | Model | Legacy date (UTC) | Deprecation date (UTC) | Retirement date (UTC) | Suggested replacement model |
| AI21 Labs | Jamba Instruct | February 1, 2025 | February 1, 2025 | March 1, 2025 |[AI21-Jamba-1.5-Large](https://ai.azure.com/explore/models/AI21-Jamba-1.5-Large/version/1/registry/azureml-ai21) <br> [AI21-Jamba-1.5-Mini](https://ai.azure.com/explore/models/AI21-Jamba-1.5-Mini/version/1/registry/azureml-staging)|
71
-
| Cohere |[Command R](https://aka.ms/azureai/landing/Cohere-command-r)| February 24, 2025 | March 25, 2025 | June 30, 2025 |[Cohere Command R 08-2024](https://aka.ms/azureai/landing/Cohere-command-r-08-2024)|
72
-
| Cohere |[Command R+](https://aka.ms/azureai/landing/Cohere-command-r-plus)| February 24, 2025 | March 25, 2025 | June 30, 2025 |[Cohere Command R+ 08-2024](https://aka.ms/azureai/landing/Cohere-command-r-plus-08-2024)|
73
-
| Cohere |[Cohere-rerank-v3-english](https://ai.azure.com/explore/models/Cohere-rerank-v3-english/version/1/registry/azureml-cohere)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Cohere-rerank-v3.5-english](https://ai.azure.com/explore/models/Cohere-rerank-v3.5/version/1/registry/azureml-cohere)|
74
-
| Cohere |[Cohere-rerank-v3-multilingual](https://ai.azure.com/explore/models/Cohere-rerank-v3-multilingual/version/1/registry/azureml-cohere)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Cohere-rerank-v3.5-multilingual](https://ai.azure.com/explore/models/Cohere-rerank-v3.5/version/1/registry/azureml-cohere)|
75
-
| Meta |[Llama-2-13b](https://ai.azure.com/explore/models/Llama-2-13b/version/24/registry/azureml-meta)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Meta-Llama-3.1-8B-Instruct](https://ai.azure.com/explore/models/Meta-Llama-3.1-8B-Instruct/version/4/registry/azureml-meta)|
76
-
| Meta |[Llama-2-13b-chat](https://ai.azure.com/explore/models/Llama-2-13b-chat/version/22/registry/azureml-meta)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Meta-Llama-3.1-8B-Instruct](https://ai.azure.com/explore/models/Meta-Llama-3.1-8B-Instruct/version/4/registry/azureml-meta)|
77
-
| Meta |[Llama-2-70b](https://ai.azure.com/explore/models/Llama-2-70b/version/25/registry/azureml-meta)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Llama-3.3-70B-Instruct](https://ai.azure.com/explore/models/Llama-3.3-70B-Instruct/version/4/registry/azureml-meta)|
78
-
| Meta |[Llama-2-70b-chat](https://ai.azure.com/explore/models/Llama-2-70b-chat/version/22/registry/azureml-meta)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Llama-3.3-70B-Instruct](https://ai.azure.com/explore/models/Llama-3.3-70B-Instruct/version/4/registry/azureml-meta)|
79
-
| Meta |[Llama-2-7b](https://ai.azure.com/explore/models/Llama-2-7b/version/23/registry/azureml-meta)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Meta-Llama-3.1-8B-Instruct](https://ai.azure.com/explore/models/Meta-Llama-3.1-8B-Instruct/version/4/registry/azureml-meta)|
80
-
| Meta |[Llama-2-7b-chat](https://ai.azure.com/explore/models/Llama-2-7b-chat/version/27/registry/azureml-meta)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Meta-Llama-3.1-8B-Instruct](https://ai.azure.com/explore/models/Meta-Llama-3.1-8B-Instruct/version/4/registry/azureml-meta)|
81
-
| Meta |[Meta-Llama-3-70B-Instruct](https://ai.azure.com/explore/models/Meta-Llama-3-70B-Instruct/version/9/registry/azureml-meta)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Llama-3.3-70B-Instruct](https://ai.azure.com/explore/models/Llama-3.3-70B-Instruct/version/4/registry/azureml-meta)|
82
-
| Meta |[Meta-Llama-3-8B-Instruct](https://ai.azure.com/explore/models/Meta-Llama-3-8B-Instruct/version/9/registry/azureml-meta)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Meta-Llama-3.1-8B-Instruct](https://ai.azure.com/explore/models/Meta-Llama-3.1-8B-Instruct/version/4/registry/azureml-meta)|
83
-
| Meta |[Meta-Llama-3.1-70B-Instruct](https://ai.azure.com/explore/models/Meta-Llama-3.1-70B-Instruct/version/4/registry/azureml-meta)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Llama-3.3-70B-Instruct](https://ai.azure.com/explore/models/Llama-3.3-70B-Instruct/version/4/registry/azureml-meta)|
84
-
| Mistral AI |[Mistral-large-2407](https://aka.ms/azureai/landing/Mistral-Large-2407)| January 13, 2025 | February 13, 2025 | May 13, 2025 |[Mistral-large-2411](https://aka.ms/aistudio/landing/Mistral-Large-2411)|
85
-
| Mistral AI |[Mistral-large](https://aka.ms/azureai/landing/Mistral-Large)| December 15, 2024 | January 15, 2025 | April 15, 2025 |[Mistral-large-2411](https://aka.ms/aistudio/landing/Mistral-Large-2411)|
66
+
The following tables list the timelines for models that are on track for retirement. The specified dates are in UTC time.
67
+
68
+
#### AI21 Labs
69
+
70
+
| Model | Legacy date (UTC) | Deprecation date (UTC) | Retirement date (UTC) | Suggested replacement model |
|[Command R](https://aka.ms/azureai/landing/Cohere-command-r)| February 24, 2025 | March 25, 2025 | June 30, 2025 |[Cohere Command R 08-2024](https://aka.ms/azureai/landing/Cohere-command-r-08-2024)|
79
+
|[Command R+](https://aka.ms/azureai/landing/Cohere-command-r-plus)| February 24, 2025 | March 25, 2025 | June 30, 2025 |[Cohere Command R+ 08-2024](https://aka.ms/azureai/landing/Cohere-command-r-plus-08-2024)|
80
+
|[Cohere-rerank-v3-english](https://ai.azure.com/explore/models/Cohere-rerank-v3-english/version/1/registry/azureml-cohere)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Cohere-rerank-v3.5-english](https://ai.azure.com/explore/models/Cohere-rerank-v3.5/version/1/registry/azureml-cohere)|
81
+
|[Cohere-rerank-v3-multilingual](https://ai.azure.com/explore/models/Cohere-rerank-v3-multilingual/version/1/registry/azureml-cohere)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Cohere-rerank-v3.5-multilingual](https://ai.azure.com/explore/models/Cohere-rerank-v3.5/version/1/registry/azureml-cohere)|
82
+
83
+
#### Meta
84
+
85
+
| Model | Legacy date (UTC) | Deprecation date (UTC) | Retirement date (UTC) | Suggested replacement model |
|[Llama-2-13b](https://ai.azure.com/explore/models/Llama-2-13b/version/24/registry/azureml-meta)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Meta-Llama-3.1-8B-Instruct](https://ai.azure.com/explore/models/Meta-Llama-3.1-8B-Instruct/version/4/registry/azureml-meta)|
88
+
|[Llama-2-13b-chat](https://ai.azure.com/explore/models/Llama-2-13b-chat/version/22/registry/azureml-meta)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Meta-Llama-3.1-8B-Instruct](https://ai.azure.com/explore/models/Meta-Llama-3.1-8B-Instruct/version/4/registry/azureml-meta)|
89
+
|[Llama-2-70b](https://ai.azure.com/explore/models/Llama-2-70b/version/25/registry/azureml-meta)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Llama-3.3-70B-Instruct](https://ai.azure.com/explore/models/Llama-3.3-70B-Instruct/version/4/registry/azureml-meta)|
90
+
|[Llama-2-70b-chat](https://ai.azure.com/explore/models/Llama-2-70b-chat/version/22/registry/azureml-meta)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Llama-3.3-70B-Instruct](https://ai.azure.com/explore/models/Llama-3.3-70B-Instruct/version/4/registry/azureml-meta)|
91
+
|[Llama-2-7b](https://ai.azure.com/explore/models/Llama-2-7b/version/23/registry/azureml-meta)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Meta-Llama-3.1-8B-Instruct](https://ai.azure.com/explore/models/Meta-Llama-3.1-8B-Instruct/version/4/registry/azureml-meta)|
92
+
|[Llama-2-7b-chat](https://ai.azure.com/explore/models/Llama-2-7b-chat/version/27/registry/azureml-meta)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Meta-Llama-3.1-8B-Instruct](https://ai.azure.com/explore/models/Meta-Llama-3.1-8B-Instruct/version/4/registry/azureml-meta)|
93
+
|[Meta-Llama-3-70B-Instruct](https://ai.azure.com/explore/models/Meta-Llama-3-70B-Instruct/version/9/registry/azureml-meta)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Llama-3.3-70B-Instruct](https://ai.azure.com/explore/models/Llama-3.3-70B-Instruct/version/4/registry/azureml-meta)|
94
+
|[Meta-Llama-3-8B-Instruct](https://ai.azure.com/explore/models/Meta-Llama-3-8B-Instruct/version/9/registry/azureml-meta)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Meta-Llama-3.1-8B-Instruct](https://ai.azure.com/explore/models/Meta-Llama-3.1-8B-Instruct/version/4/registry/azureml-meta)|
95
+
|[Meta-Llama-3.1-70B-Instruct](https://ai.azure.com/explore/models/Meta-Llama-3.1-70B-Instruct/version/4/registry/azureml-meta)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Llama-3.3-70B-Instruct](https://ai.azure.com/explore/models/Llama-3.3-70B-Instruct/version/4/registry/azureml-meta)|
96
+
97
+
#### Mistral AI
98
+
99
+
| Model | Legacy date (UTC) | Deprecation date (UTC) | Retirement date (UTC) | Suggested replacement model |
|[Mistral-large-2407](https://aka.ms/azureai/landing/Mistral-Large-2407)| January 13, 2025 | February 13, 2025 | May 13, 2025 |[Mistral-large-2411](https://aka.ms/aistudio/landing/Mistral-Large-2411)|
102
+
|[Mistral-large](https://aka.ms/azureai/landing/Mistral-Large)| December 15, 2024 | January 15, 2025 | April 15, 2025 |[Mistral-large-2411](https://aka.ms/aistudio/landing/Mistral-Large-2411)|
|[DeepSeek-R1](https://ai.azure.com/explore/models/deepseek-r1/version/1/registry/azureml-deepseek)|[chat-completion with reasoning content](../model-inference/how-to/use-chat-reasoning.md?context=/azure/ai-foundry/context/context)| - **Input:** text (16,384 tokens) <br /> - **Output:** (163,840 tokens) <br /> - **Tool calling:** No <br /> - **Response formats:** Text. |
158
158
159
+
For a tutorial on DeepSeek-R1, see [Tutorial: Get started with DeepSeek-R1 reasoning model in Azure AI model inference](../model-inference/tutorials/get-started-deepseek-r1.md?context=/azure/ai-foundry/context/context).
159
160
160
161
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=deepseek).
161
162
@@ -321,13 +322,26 @@ There are four pricing meters that determine the price you pay. These meters are
321
322
322
323
See the [Nixtla model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=nixtla).
323
324
324
-
## NTT Data
325
+
## NTT DATA
325
326
326
-
**Tsuzumi** is an autoregressive language optimized transformer. The tuned versions use supervised fine-tuning (SFT). Tsuzumi is handles both Japanese and English language with high efficiency.
327
+
**tsuzumi** is an autoregressive language optimized transformer. The tuned versions use supervised fine-tuning (SFT). tsuzumi handles both Japanese and English language with high efficiency.
327
328
328
329
| Model | Type | Capabilities |
329
330
| ------ | ---- | ------------ |
330
-
|[Tsuzumi-7b](https://ai.azure.com/explore/models/Tsuzumi-7b/version/1/registry/azureml-nttdata)|[chat-completion](../model-inference/how-to/use-chat-completions.md?context=/azure/ai-foundry/context/context)| - **Input:** text (8,192 tokens) <br /> - **Output:** text (8,192 tokens) <br /> - **Tool calling:** No <br /> - **Response formats:** Text |
331
+
|[tsuzumi-7b](https://ai.azure.com/explore/models/Tsuzumi-7b/version/1/registry/azureml-nttdata)|[chat-completion](../model-inference/how-to/use-chat-completions.md?context=/azure/ai-foundry/context/context)| - **Input:** text (8,192 tokens) <br /> - **Output:** text (8,192 tokens) <br /> - **Tool calling:** No <br /> - **Response formats:** Text |
332
+
333
+
## Stability AI
334
+
335
+
The Stability AI collection of image generation models include Stable Image Core, Stable Image Ultra and Stable Diffusion 3.5 Large. Stable Diffusion 3.5 Large allows for an image and text input.
336
+
337
+
| Model | Type | Capabilities |
338
+
| ------ | ---- | ------------ |
339
+
|[Stable Diffusion 3.5 Large](https://ai.azure.com/explore/models/Stable-Diffusion-3.5-Large/versions/1)|[Image Generation](../how-to/deploy-stability-models.md?context=/azure/ai-foundry/context/context)| - Input: text and image (1000 tokens and 1 image) <br /> - Output: 1 Image <br /> - **Tool calling:** No <br /> - Response formats: Image (PNG and JPG) |
Copy file name to clipboardExpand all lines: articles/ai-foundry/how-to/deploy-models-serverless.md
+4Lines changed: 4 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -31,6 +31,10 @@ This article uses a Meta Llama model deployment for illustration. However, you c
31
31
32
32
- An [Azure AI Foundry project](create-projects.md).
33
33
34
+
- You have to disable the feature **Deploy models to Azure AI model inference service**. When this feature is on, serverless API endpoints are not available for deployment when using the Azure AI Foundry portal.
35
+
36
+
:::image type="content" source="../model-inference/media/quickstart-ai-project/ai-project-inference-endpoint.gif" alt-text="An animation showing how to turn on the Deploy models to Azure AI model inference service feature in Azure AI Foundry portal." lightbox="../model-inference/media/quickstart-ai-project/ai-project-inference-endpoint.gif":::
37
+
34
38
- Azure role-based access controls (Azure RBAC) are used to grant access to operations in Azure AI Foundry portal. To perform the steps in this article, your user account must be assigned the __Azure AI Developer role__ on the resource group. For more information on permissions, see [Role-based access control in Azure AI Foundry portal](../concepts/rbac-ai-foundry.md).
35
39
36
40
- You need to install the following software to work with Azure AI Foundry:
0 commit comments