You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/concepts/model-lifecycle-retirement.md
+6Lines changed: 6 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -80,6 +80,12 @@ The following tables list the timelines for models that are on track for retirem
80
80
|[Cohere-rerank-v3-english](https://ai.azure.com/explore/models/Cohere-rerank-v3-english/version/1/registry/azureml-cohere)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Cohere-rerank-v3.5-english](https://ai.azure.com/explore/models/Cohere-rerank-v3.5/version/1/registry/azureml-cohere)|
81
81
|[Cohere-rerank-v3-multilingual](https://ai.azure.com/explore/models/Cohere-rerank-v3-multilingual/version/1/registry/azureml-cohere)| February 28, 2025 | March 31, 2025 | June 30, 2025 |[Cohere-rerank-v3.5-multilingual](https://ai.azure.com/explore/models/Cohere-rerank-v3.5/version/1/registry/azureml-cohere)|
82
82
83
+
#### DeepSeek
84
+
85
+
| Model | Legacy date (UTC) | Deprecation date (UTC) | Retirement date (UTC) | Suggested replacement model |
|[DeepSeek-V3](https://aka.ms/azureai/landing/DeepSeek-V3)| April 10, 2025 | May 31, 2025 | August 31, 2025 |[DeepSeek-V3-0324](https://aka.ms/azureai/landing/DeepSeek-V3-0324)|
88
+
83
89
#### Meta
84
90
85
91
| Model | Legacy date (UTC) | Deprecation date (UTC) | Retirement date (UTC) | Suggested replacement model |
Copy file name to clipboardExpand all lines: articles/ai-foundry/concepts/models-featured.md
+3-2Lines changed: 3 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -141,11 +141,12 @@ For more examples of how to use Jais models, see the following examples:
141
141
142
142
## DeepSeek
143
143
144
-
DeepSeek family of models includes DeepSeek-R1, which excels at reasoning tasks using a step-by-step training process, such as language, scientific reasoning, and coding tasks, and DeepSeek-V3, a Mixture-of-Experts (MoE) language model.
144
+
DeepSeek family of models includes DeepSeek-R1, which excels at reasoning tasks using a step-by-step training process, such as language, scientific reasoning, and coding tasks, DeepSeek-V3-0324, a Mixture-of-Experts (MoE) language model, and more.
145
145
146
146
| Model | Type | Capabilities |
147
147
| ------ | ---- | --- |
148
-
|[DeepSeek-V3](https://ai.azure.com/explore/models/deepseek-v3/version/1/registry/azureml-deepseek)|[chat-completion](../model-inference/how-to/use-chat-completions.md?context=/azure/ai-foundry/context/context)| - **Input:** text (131,072 tokens) <br /> - **Output:** text (131,072 tokens) <br /> - **Tool calling:** No <br /> - **Response formats:** Text, JSON |
|[DeepSeek-V3](https://ai.azure.com/explore/models/deepseek-v3/version/1/registry/azureml-deepseek) <br />(Legacy) |[chat-completion](../model-inference/how-to/use-chat-completions.md?context=/azure/ai-foundry/context/context)| - **Input:** text (131,072 tokens) <br /> - **Output:** text (131,072 tokens) <br /> - **Tool calling:** No <br /> - **Response formats:** Text, JSON |
149
150
|[DeepSeek-R1](https://ai.azure.com/explore/models/deepseek-r1/version/1/registry/azureml-deepseek)|[chat-completion with reasoning content](../model-inference/how-to/use-chat-reasoning.md?context=/azure/ai-foundry/context/context)| - **Input:** text (163,840 tokens) <br /> - **Output:** text (163,840 tokens) <br /> - **Tool calling:** No <br /> - **Response formats:** Text. |
150
151
151
152
For a tutorial on DeepSeek-R1, see [Tutorial: Get started with DeepSeek-R1 reasoning model in Azure AI model inference](../model-inference/tutorials/get-started-deepseek-r1.md?context=/azure/ai-foundry/context/context).
Copy file name to clipboardExpand all lines: articles/ai-foundry/model-inference/concepts/models.md
+2-1Lines changed: 2 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -110,7 +110,8 @@ DeepSeek family of models includes DeepSeek-R1, which excels at reasoning tasks
110
110
| Model | Type | Tier | Capabilities |
111
111
| ------ | ---- | --- | ------------ |
112
112
|[DeekSeek-R1](https://ai.azure.com/explore/models/deepseek-r1/version/1/registry/azureml-deepseek)| chat-completion <br /> [(with reasoning content)](../how-to/use-chat-reasoning.md)| Global standard | - **Input:** text (163,840 tokens) <br /> - **Output:** (163,840 tokens) <br /> - **Languages:**`en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text. |
113
-
|[DeekSeek-V3](https://ai.azure.com/explore/models/deepseek-v3/version/1/registry/azureml-deepseek)| chat-completion | Global standard | - **Input:** text (131,072 tokens) <br /> - **Output:** (131,072 tokens) <br /> - **Languages:**`en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text, JSON |
113
+
|[DeekSeek-V3](https://ai.azure.com/explore/models/deepseek-v3/version/1/registry/azureml-deepseek) <br />(Legacy) | chat-completion | Global standard | - **Input:** text (131,072 tokens) <br /> - **Output:** (131,072 tokens) <br /> - **Languages:**`en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text, JSON |
114
+
|[DeekSeek-V3-0324](https://ai.azure.com/explore/models/deepseek-v3-0324/version/1/registry/azureml-deepseek)| chat-completion | Global standard | - **Input:** text (131,072 tokens) <br /> - **Output:** (131,072 tokens) <br /> - **Languages:**`en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text, JSON |
114
115
115
116
For a tutorial on DeepSeek-R1, see [Tutorial: Get started with DeepSeek-R1 reasoning model in Azure AI model inference](../tutorials/get-started-deepseek-r1.md).
4. Let's see first which models are available to you and under which SKU. The following command list all the model definitions available:
60
+
4. Let's see first which models are available to you and under which SKU. SKUs, also known as [deployment types](../../concepts/deployment-types.md), define how Azure infrastructure is used to process requests. Models may offer different deployment types. The following command list all the model definitions available:
60
61
61
62
```azurecli
62
63
az cognitiveservices account list-models \
@@ -77,10 +78,7 @@ To add a model, you first need to identify the model that you want to deploy. Yo
77
78
}
78
79
```
79
80
80
-
6. Identify the model you want to deploy. You need the properties `name`, `format`, `version`, and `sku`. Capacity might also be needed depending on the type of deployment.
81
-
82
-
> [!TIP]
83
-
> Notice that not all the models are available in all the SKUs.
81
+
6. Identify the model you want to deploy. You need the properties `name`, `format`, `version`, and `sku`. The property `format` indicates the provider offering the model. Capacity might also be needed depending on the type of deployment.
84
82
85
83
7. Add the model deployment to the resource. The following example adds `Phi-3.5-vision-instruct`:
Copy file name to clipboardExpand all lines: articles/ai-services/agents/how-to/tools/fabric.md
+2Lines changed: 2 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -34,6 +34,8 @@ You need to first build and publish a Fabric data agent and then connect your Fa
34
34
35
35
* Developers and end users have at least `READ` access to the Fabric data agent and the underlying data sources it connects with.
36
36
37
+
* Your Fabric Data Agent and Azure AI Agent need to be in the same tenant.
38
+
37
39
## Setup
38
40
> [!NOTE]
39
41
> * The model you selected in Azure AI Agent setup is only used for agent orchestration and response generation. It doesn't impact which model Fabric data agent uses for NL2SQL operation.
Copy file name to clipboardExpand all lines: articles/ai-services/language-service/whats-new.md
+8-8Lines changed: 8 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -17,14 +17,14 @@ Azure AI Language is updated on an ongoing basis. To stay up-to-date with recent
17
17
## March 2025
18
18
19
19
* Azure AI Language resource now can be deployed to 3 new regions, Jio India Central, UK West and Canada East, for the following capabilities:
20
-
* Language detection
21
-
* Sentiment analysis
22
-
* Key phrase extraction
23
-
* Named entity recognition (NER)
24
-
* Personally identifiable information (PII) entity recognition
25
-
* Entity linking
26
-
* Text analytics for health
27
-
* Extractive text summarization
20
+
* Language detection
21
+
* Sentiment analysis
22
+
* Key phrase extraction
23
+
* Named entity recognition (NER)
24
+
* Personally identifiable information (PII) entity recognition
25
+
* Entity linking
26
+
* Text analytics for health
27
+
* Extractive text summarization
28
28
29
29
* Back-end infrastructure for the Named entity recognition (NER) and Text Personally identifiable information (PII) entity recognition models is now updated with extended context window limits.
@@ -18,6 +18,7 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
18
18
19
19
| Models | Description |
20
20
|--|--|
21
+
|[GPT-4.1 series](#gpt-41-series)| Latest model release from Azure OpenAI |
21
22
|[computer-use-preview](#computer-use-preview)| An experimental model trained for use with the Responses API computer use tool. |
22
23
|[GPT-4.5 Preview](#gpt-45-preview)|The latest GPT model that excels at diverse text and image tasks. |
23
24
|[o-series models](#o-series-models)|[Reasoning models](../how-to/reasoning.md) with advanced problem-solving and increased focus and capability. |
@@ -30,6 +31,20 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
30
31
|[Whisper](#whisper-models)| A series of models in preview that can transcribe and translate speech to text. |
31
32
|[Text to speech](#text-to-speech-models-preview) (Preview) | A series of models in preview that can synthesize text to speech. |
32
33
34
+
## GPT 4.1 series
35
+
36
+
### Region Availability
37
+
38
+
| Model | Region |
39
+
|---|---|
40
+
|`gpt-4.1` (2025-04-14) | East US2 (Global Standard), Sweden Central (Global Standard) |
41
+
42
+
### Capabilities
43
+
44
+
| Model ID | Description | Context Window | Max Output Tokens | Training Data (up to) |
45
+
| --- | :--- |:--- |:---|:---: |
46
+
|`gpt-4.1` (2025-04-14) <br> <br> **Latest model from Azure OpenAI**| - Text & image input <br> - Text output <br> - Chat completions API <br>- Responses API <br> - Streaming <br> - Function calling <br> Structured outputs (chat completions) | 1,047,576 | 32,768 | May 31, 2024 |
47
+
33
48
## computer-use-preview
34
49
35
50
An experimental model trained for use with the [Responses API](../how-to/responses.md) computer use tool. It can be used in conjunction with 3rd-party libraries to allow the model to control mouse & keyboard input while getting context from screenshots of the current environment.
@@ -70,7 +85,7 @@ Once access has been granted, you will need to create a deployment for the model
70
85
71
86
| Model ID | Description | Context Window | Max Output Tokens | Training Data (up to) |
72
87
| --- | :--- |:--- |:---|:---: |
73
-
|`gpt-4.5-preview` (2025-02-27) <br> **GPT-4.5 Preview**|The **latest GPT model** that excels at diverse text and image tasks. <br>-Structured outputs <br>-Prompt caching <br>-Tools <br>-Streaming<br>-Text(input/output)<br>- Image(input) | 128,000 | 16,384 | Oct 2023 |
88
+
|`gpt-4.5-preview` (2025-02-27) <br> **GPT-4.5 Preview**|[GPT 4.1](#gpt-41-series) is the recommended replacement for this model. Excels at diverse text and image tasks. <br>-Structured outputs <br>-Prompt caching <br>-Tools <br>-Streaming<br>-Text(input/output)<br>- Image(input) | 128,000 | 16,384 | Oct 2023 |
74
89
75
90
> [!NOTE]
76
91
> It is expected behavior that the model cannot answer questions about itself. If you want to know when the knowledge cutoff for the model's training data is, or other details about the model you should refer to the model documentation above.
Copy file name to clipboardExpand all lines: articles/ai-services/openai/how-to/batch.md
+6-1Lines changed: 6 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,7 +6,7 @@ manager: nitinme
6
6
ms.service: azure-ai-openai
7
7
ms.custom: references_regions
8
8
ms.topic: how-to
9
-
ms.date: 01/14/2025
9
+
ms.date: 04/14/2025
10
10
author: mrbullwinkle
11
11
ms.author: mbullwin
12
12
recommendations: false
@@ -33,6 +33,11 @@ Key use cases include:
33
33
34
34
***Marketing and Personalization:** Generate personalized content and recommendations at scale.
35
35
36
+
> [!TIP]
37
+
> If your batch jobs are so large that you are hitting the enqueued token limit even after maxing out the quota for your deployment, certain regions now support a new feature that allows you to queue multiple batch jobs with exponential backoff.
38
+
>
39
+
>Once your enqueued token quota is available, the next batch job can be created and kicked off automatically.To learn more, see [**automating retries of large batch jobs with exponential backoff**](#queueing-batch-jobs).
40
+
36
41
> [!IMPORTANT]
37
42
> We aim to process batch requests within 24 hours; we don't expire the jobs that take longer. You can [cancel](#cancel-batch) the job anytime. When you cancel the job, any remaining work is cancelled and any already completed work is returned. You'll be charged for any completed work.
0 commit comments