You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/foundry-models/concepts/models.md
+29-25Lines changed: 29 additions & 25 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -19,7 +19,7 @@ ms.custom:
19
19
# Foundry Models and capabilities
20
20
21
21
Azure AI Foundry Models gives you access to flagship models in Azure AI Foundry to consume them as APIs with flexible deployment options.
22
-
This article lists a selection of model offerings and their capabilities, excluding [deprecated and legacy models](../../concepts/model-lifecycle-retirement.md#deprecated).
22
+
This article lists a selection of model offerings along with their capabilities, [deployment types and regions of availability](deployment-types.md), excluding [deprecated and legacy models](../../concepts/model-lifecycle-retirement.md#deprecated).
23
23
24
24
Depending on what [kind of project](../../what-is-azure-ai-foundry.md#work-in-an-azure-ai-foundry-project) you're using in Azure AI Foundry, you might see a different selection of these models. Specifically, if you're using a Foundry project, built on an Azure AI Foundry resource, you see the models that are available for standard deployment to a Foundry resource. Alternatively, if you're using a hub-based project, hosted by an Azure AI Foundry hub, you see models that are available for deployment to managed compute and serverless APIs. These model selections do overlap in many cases, since many models support the multiple [deployment options](../../concepts/deployments-overview.md).
25
25
@@ -43,22 +43,22 @@ To learn more about the available Azure OpenAI models and their regions of avail
43
43
44
44
### Black Forest Labs models sold directly by Azure
45
45
46
-
The Black Forest Labs collection of image generation models include FLUX.1 Kontext [pro] for in-context generation and editing and FLUX1.1 [pro] for text-to-image generation.
46
+
The Black Forest Labs collection of image generation models includes FLUX.1 Kontext [pro] for in-context generation and editing and FLUX1.1 [pro] for text-to-image generation.
47
47
48
48
| Model | Type | Capabilities | Region availability | Project type |
|[FLUX.1-Kontext-pro](https://ai.azure.com/explore/models/FLUX.1-Kontext-pro/version/1/registry/azureml-blackforestlabs)| Image generation | - **Input:** text and image (5,000 tokens and 1 image) <br /> - **Output:** One Image <br /> - **Tool calling:** No <br /> - **Response formats**: Image (PNG and JPG) | - Global standard (all regions) | Foundry, Hub-based |
51
+
|[FLUX-1.1-pro](https://ai.azure.com/explore/models/FLUX-1.1-pro/version/1/registry/azureml-blackforestlabs)| Image generation | - **Input:** text (5,000 tokens) <br /> - **Output:** One Image <br /> - **Tool calling:** No <br /> - **Response formats:** Image (PNG and JPG) | - Global standard (all regions) | Hub-based |
52
52
53
53
54
54
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=black+forest+labs).
55
55
56
56
### DeepSeek models sold directly by Azure
57
57
58
-
DeepSeek family of models includes DeepSeek-R1, which excels at reasoning tasks using a step-by-step training process, such as language, scientific reasoning, and coding tasks.
58
+
The DeepSeek family of models includes DeepSeek-R1, which excels at reasoning tasks by using a step-by-step training process, such as language, scientific reasoning, and coding tasks.
59
59
60
60
| Model | Type | Capabilities | Region availability | Project type |
|[DeepSeek-R1-0528](https://ai.azure.com/explore/models/deepseek-r1-0528/version/1/registry/azureml-deepseek)| chat-completion <br /> [(with reasoning content)](../how-to/use-chat-reasoning.md)| - **Input:** text (163,840 tokens) <br /> - **Output:** (163,840 tokens) <br /> - **Languages:**`en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text. | - Global standard (all regions) <br> - Global provisioned (all regions)| Foundry, Hub-based |
63
63
|[DeepSeek-V3-0324](https://ai.azure.com/explore/models/deepseek-v3-0324/version/1/registry/azureml-deepseek)| chat-completion | - **Input:** text (131,072 tokens) <br /> - **Output:** (131,072 tokens) <br /> - **Languages:**`en` and `zh` <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON | - Global standard (all regions) <br> - Global provisioned (all regions) | Foundry, Hub-based |
64
64
|[DeepSeek-R1](https://ai.azure.com/explore/models/deepseek-r1/version/1/registry/azureml-deepseek)| chat-completion <br /> [(with reasoning content)](../how-to/use-chat-reasoning.md)| - **Input:** text (163,840 tokens) <br /> - **Output:** (163,840 tokens) <br /> - **Languages:**`en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text. | - Global standard (all regions) <br> - Global provisioned (all regions) | Foundry, Hub-based |
@@ -67,18 +67,18 @@ See [this model collection in Azure AI Foundry portal](https://ai.azure.com/expl
67
67
68
68
### Meta models sold directly by Azure
69
69
70
-
Meta Llama models and tools are a collection of pretrained and fine-tuned generative AI text and image reasoning models. Meta models range is scale to include:
70
+
Meta Llama models and tools are a collection of pretrained and fine-tuned generative AI text and image reasoning models. Meta models range in scale to include:
71
71
72
72
- Small language models (SLMs) like 1B and 3B Base and Instruct models for on-device and edge inferencing
73
73
- Mid-size large language models (LLMs) like 7B, 8B, and 70B Base and Instruct models
74
-
- High-performant models like Meta Llama 3.1-405B Instruct for synthetic data generation and distillation use cases.
74
+
- High-performance models like Meta Llama 3.1-405B Instruct for synthetic data generation and distillation use cases.
75
75
76
76
| Model | Type | Capabilities | Region availability | Project type |
|[Llama-4-Maverick-17B-128E-Instruct-FP8](https://ai.azure.com/explore/models/Llama-4-Maverick-17B-128E-Instruct-FP8/version/1/registry/azureml-meta)| chat-completion | - **Input:** text and images (1M tokens) <br /> - **Output:** text (1M tokens) <br /> - **Languages:**`ar`, `en`, `fr`, `de`, `hi`, `id`, `it`, `pt`, `es`, `tl`, `th`, and `vi` <br /> - **Tool calling:** No* <br /> - **Response formats:** Text | - Global standard (all regions) | Foundry, Hub-based |
79
79
|[Llama-3.3-70B-Instruct](https://ai.azure.com/explore/models/Llama-3.3-70B-Instruct/version/4/registry/azureml-meta)| chat-completion | - **Input:** text (128,000 tokens) <br /> - **Output:** text (8,192 tokens) <br /> - **Languages:**`en`, `de`, `fr`, `it`, `pt`, `hi`, `es`, and `th` <br /> - **Tool calling:** No* <br /> - **Response formats:** Text | - Global standard (all regions) | Foundry, Hub-based |
80
80
81
-
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=Meta). There are also several Meta models available [from partners and community](#meta).
81
+
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=Meta). You can also find several Meta models available [from partners and community](#meta).
82
82
83
83
### Microsoft models sold directly by Azure
84
84
@@ -88,23 +88,27 @@ Microsoft models include various model groups such as MAI models, Phi models, he
|[MAI-DS-R1](https://ai.azure.com/explore/models/MAI-DS-R1/version/1/registry/azureml)| chat-completion <br /> [(with reasoning content)](../how-to/use-chat-reasoning.md)| - **Input:** text (163,840 tokens) <br /> - **Output:** (163,840 tokens) <br /> - **Languages:**`en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text. |- Global standard (all regions) | Foundry, Hub-based |
90
90
91
-
See [the Microsoft model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=Microsoft). There are also several Microsoft models available [from partners and community](#microsoft).
91
+
See [the Microsoft model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=Microsoft). You can also find several Microsoft models available [from partners and community](#microsoft).
92
92
93
93
### Mistral models sold directly by Azure
94
94
95
95
| Model | Type | Capabilities | Region availability | Project type |
|[mistral-document-ai-2505](https://ai.azure.com/explore/models/mistral-document-ai-2505/version/1/registry/azureml-mistral)| Image-to-Text | - **Input:** image or PDF pages ( 30 pages, max 30MB PDF file) <br /> - **Output:** text <br /> - **Languages:** en <br /> - **Tool calling:** no <br /> - **Response formats:** Text, JSON, Markdown |- Global standard (all regions) <br> - Data zone standard (US) | Foundry |
98
98
99
-
See [the Mistral model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=Mistral+AI). There are also several Mistral models available [from partners and community](#mistral-ai).
99
+
See [the Mistral model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=Mistral+AI). You can also find several Mistral models available [from partners and community](#mistral-ai).
100
100
101
101
102
102
### xAI models sold directly by Azure
103
103
104
-
xAI's Grok 3 and Grok 3 Mini models are designed to excel in various enterprise domains. Grok 3, a non-reasoning model pretrained by the Colossus datacenter, is tailored for business use cases such as data extraction, coding, and text summarization, with exceptional instruction-following capabilities. It supports a 131,072 token context window, allowing it to handle extensive inputs while maintaining coherence and depth, and is adept at drawing connections across domains and languages. On the other hand, Grok 3 Mini is a lightweight reasoning model trained to tackle agentic, coding, mathematical, and deep science problems with test-time compute. It also supports a 131,072 token context window for understanding codebases and enterprise documents, and excels at using tools to solve complex logical problems in novel environments, offering raw reasoning traces for user inspection with adjustable thinking budgets.
104
+
xAI's Grok 3 and Grok 3 Mini models excel in various enterprise domains.
105
+
106
+
Grok 3 is a non-reasoning model pretrained by the Colossus datacenter. It's tailored for business use cases such as data extraction, coding, and text summarization, with exceptional instruction-following capabilities. It supports a 131,072 token context window, so it can handle extensive inputs while maintaining coherence and depth. It's adept at drawing connections across domains and languages.
107
+
108
+
On the other hand, Grok 3 Mini is a lightweight reasoning model trained to tackle agentic, coding, mathematical, and deep science problems with test-time compute. It also supports a 131,072 token context window for understanding codebases and enterprise documents. It excels at using tools to solve complex logical problems in novel environments, offering raw reasoning traces for user inspection with adjustable thinking budgets.
105
109
106
110
| Model | Type | Capabilities | Region availability | Project type |
|[grok-3](https://ai.azure.com/explore/models/grok-3/version/1/registry/azureml-xai)<sup>1</sup> | chat-completion | - **Input:** text (131,072 tokens) <br /> - **Output:** text (131,072 tokens) <br /> - **Languages:**`en` <br /> - **Tool calling:** yes <br /> - **Response formats:** text |- Global standard (all regions) <br> - Data zone standard (US) | Foundry, Hub-based |
109
113
|[grok-3-mini](https://ai.azure.com/explore/models/grok-3-mini/version/1/registry/azureml-xai)<sup>1</sup> | chat-completion | - **Input:** text (131,072 tokens) <br /> - **Output:** text (131,072 tokens) <br /> - **Languages:**`en` <br /> - **Tool calling:** yes <br /> - **Response formats:** text | - Global standard (all regions) <br> - Data zone standard (US) | Foundry, Hub-based |
110
114
@@ -113,7 +117,7 @@ See [the xAI model collection in Azure AI Foundry portal](https://ai.azure.com/e
113
117
114
118
## Models from partners and community
115
119
116
-
Models from partners and community constitute the majority of the Azure AI Foundry Models and are provided by trusted third-party organizations, partners, research labs, and community contributors. To learn more about these models, see [Models from Partners and Community](../../concepts/foundry-models-overview.md#models-from-partners-and-community).
120
+
Models from partners and community make up most of the Azure AI Foundry Models. Trusted third-party organizations, partners, research labs, and community contributors provide these models. For more information about these models, see [Models from Partners and Community](../../concepts/foundry-models-overview.md#models-from-partners-and-community).
117
121
118
122
119
123
### Cohere
@@ -141,7 +145,7 @@ See [the Cohere model collection in Azure AI Foundry portal](https://ai.azure.co
141
145
142
146
### Core42
143
147
144
-
Core42 includes autoregressive bi-lingual LLMs for Arabic & English with state-of-the-art capabilities in Arabic.
148
+
Core42 includes autoregressive bilingual LLMs for Arabic and English with state-of-the-art capabilities in Arabic.
145
149
146
150
| Model | Type | Capabilities | Project type |
147
151
| ------ | ---- | ------------ | ------------ |
@@ -151,11 +155,11 @@ See [this model collection in Azure AI Foundry portal](https://ai.azure.com/expl
151
155
152
156
### Meta
153
157
154
-
Meta Llama models and tools are a collection of pretrained and fine-tuned generative AI text and image reasoning models. Meta models range is scale to include:
158
+
Meta Llama models and tools are a collection of pretrained and fine-tuned generative AI text and image reasoning models. Meta models range in scale to include:
155
159
156
160
- Small language models (SLMs) like 1B and 3B Base and Instruct models for on-device and edge inferencing
157
161
- Mid-size large language models (LLMs) like 7B, 8B, and 70B Base and Instruct models
158
-
- High-performant models like Meta Llama 3.1-405B Instruct for synthetic data generation and distillation use cases.
162
+
- High-performance models like Meta Llama 3.1-405B Instruct for synthetic data generation and distillation use cases.
159
163
160
164
| Model | Type | Capabilities | Project type |
161
165
| ------ | ---- | ------------ | ------------ |
@@ -179,7 +183,7 @@ Microsoft models include various model groups such as MAI models, Phi models, he
179
183
|[Phi-4-reasoning](https://aka.ms/azureai/landing/Phi-4-reasoning)| chat-completion with reasoning content | - **Input:** text (32,768 tokens) <br /> - **Output:** text (32,768 tokens) <br /> - **Languages:**`en` <br /> - **Tool calling:** No <br /> - **Response formats:** Text | Foundry, Hub-based |
180
184
|[Phi-4-mini-reasoning](https://aka.ms/azureai/landing/Phi-4-mini-reasoning)| chat-completion with reasoning content | - **Input:** text (128,000 tokens) <br /> - **Output:** text (128,000 tokens) <br /> - **Languages:**`en` <br /> - **Tool calling:** No <br /> - **Response formats:** Text | Foundry, Hub-based |
181
185
182
-
See [the Microsoft model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=Microsoft). There are also Microsoft models available as [models sold directly by Azure](#microsoft-models-sold-directly-by-azure).
186
+
See [the Microsoft model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=Microsoft). Microsoft models are also available as [models sold directly by Azure](#microsoft-models-sold-directly-by-azure).
183
187
184
188
### Mistral AI
185
189
@@ -204,7 +208,7 @@ See [this model collection in Azure AI Foundry portal](https://ai.azure.com/expl
204
208
205
209
### Nixtla
206
210
207
-
Nixtla's TimeGEN-1 is a generative pretrained forecasting and anomaly detection model for time series data. TimeGEN-1 can produce accurate forecasts for new time series without training, using only historical values and exogenous covariates as inputs.
211
+
Nixtla's TimeGEN-1 is a generative pretrained forecasting and anomaly detection model for time series data. TimeGEN-1 produces accurate forecasts for new time series without training, using only historical values and exogenous covariates as inputs.
208
212
209
213
To perform inferencing, TimeGEN-1 requires you to use Nixtla's custom inference API.
210
214
@@ -217,7 +221,7 @@ For more details on pricing for Nixtla models, see [Nixtla](../../concepts/model
217
221
218
222
### NTT Data
219
223
220
-
**tsuzumi** is an autoregressive languageoptimized transformer. The tuned versions use supervised fine-tuning (SFT). tsuzumi handles both Japanese and English language with high efficiency.
224
+
**tsuzumi** is an autoregressive language-optimized transformer. The tuned versions use supervised fine-tuning (SFT). tsuzumi handles both Japanese and English language with high efficiency.
221
225
222
226
| Model | Type | Capabilities | Project type |
223
227
| ------ | ---- | ------------ | ------------ |
@@ -227,7 +231,7 @@ See [this model collection in Azure AI Foundry portal](https://ai.azure.com/expl
227
231
228
232
### Stability AI
229
233
230
-
The Stability AI collection of image generation models include Stable Image Core, Stable Image Ultra, and Stable Diffusion 3.5 Large. Stable Diffusion 3.5 Large allows for an image and text input.
234
+
The Stability AI collection of image generation models includes Stable Image Core, Stable Image Ultra, and Stable Diffusion 3.5 Large. Stable Diffusion 3.5 Large accepts both image and text input.
231
235
232
236
| Model | Type | Capabilities | Project type |
233
237
| ------ | ---- | ------------ | ------------ |
@@ -240,7 +244,7 @@ See [this model collection in Azure AI Foundry portal](https://ai.azure.com/expl
240
244
241
245
## Open and custom models
242
246
243
-
The model catalog offers a larger selection of models, from a bigger range of providers. For these models, you can't use the option for [standard deployment in Azure AI Foundry resources](../../concepts/deployments-overview.md#standard-deployment-in-azure-ai-foundry-resources), where models are provided as APIs; rather, to deploy these models, you might be required to host them on your infrastructure, create an AI hub, and provide the underlying compute quota to host the models.
247
+
The model catalog offers a larger selection of models from a wider range of providers. For these models, you can't use the option for [standard deployment in Azure AI Foundry resources](../../concepts/deployments-overview.md#standard-deployment-in-azure-ai-foundry-resources), where models are provided as APIs. Instead, to deploy these models, you might need to host them on your infrastructure, create an AI hub, and provide the underlying compute quota to host the models.
244
248
245
249
Furthermore, these models can be open-access or IP protected. In both cases, you have to deploy them in managed compute offerings in Azure AI Foundry. To get started, see [How-to: Deploy to Managed compute](../../how-to/deploy-models-managed.md).
0 commit comments