Skip to content

Commit 72b7a83

Browse files
committed
fixes
1 parent f9cecfe commit 72b7a83

File tree

1 file changed

+29
-25
lines changed
  • articles/ai-foundry/foundry-models/concepts

1 file changed

+29
-25
lines changed

articles/ai-foundry/foundry-models/concepts/models.md

Lines changed: 29 additions & 25 deletions
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@ ms.custom:
1919
# Foundry Models and capabilities
2020

2121
Azure AI Foundry Models gives you access to flagship models in Azure AI Foundry to consume them as APIs with flexible deployment options.
22-
This article lists a selection of model offerings and their capabilities, excluding [deprecated and legacy models](../../concepts/model-lifecycle-retirement.md#deprecated).
22+
This article lists a selection of model offerings along with their capabilities, [deployment types and regions of availability](deployment-types.md), excluding [deprecated and legacy models](../../concepts/model-lifecycle-retirement.md#deprecated).
2323

2424
Depending on what [kind of project](../../what-is-azure-ai-foundry.md#work-in-an-azure-ai-foundry-project) you're using in Azure AI Foundry, you might see a different selection of these models. Specifically, if you're using a Foundry project, built on an Azure AI Foundry resource, you see the models that are available for standard deployment to a Foundry resource. Alternatively, if you're using a hub-based project, hosted by an Azure AI Foundry hub, you see models that are available for deployment to managed compute and serverless APIs. These model selections do overlap in many cases, since many models support the multiple [deployment options](../../concepts/deployments-overview.md).
2525

@@ -43,22 +43,22 @@ To learn more about the available Azure OpenAI models and their regions of avail
4343

4444
### Black Forest Labs models sold directly by Azure
4545

46-
The Black Forest Labs collection of image generation models include FLUX.1 Kontext [pro] for in-context generation and editing and FLUX1.1 [pro] for text-to-image generation.
46+
The Black Forest Labs collection of image generation models includes FLUX.1 Kontext [pro] for in-context generation and editing and FLUX1.1 [pro] for text-to-image generation.
4747

4848
| Model | Type | Capabilities | Region availability | Project type |
49-
| ------ | ---- | ------------ | ------------------- | ------------ |
50-
| [FLUX.1-Kontext-pro](https://ai.azure.com/explore/models/FLUX.1-Kontext-pro/version/1/registry/azureml-blackforestlabs) | Image generation | - **Input:** text and image (5000 tokens and 1 image) <br /> - **Output:** One Image <br /> - **Tool calling:** No <br /> - **Response formats**: Image (PNG and JPG) | - Global standard (all regions) | Foundry, Hub-based |
51-
| [FLUX-1.1-pro](https://ai.azure.com/explore/models/FLUX-1.1-pro/version/1/registry/azureml-blackforestlabs) | Image generation | - **Input:** text (5000 tokens) <br /> - **Output:** One Image <br /> - **Tool calling:** No <br /> - **Response formats:** Image (PNG and JPG) | - Global standard (all regions) | Hub-based |
49+
| ------ | ---- | ------------ | Deployment type (region availability) | ------------ |
50+
| [FLUX.1-Kontext-pro](https://ai.azure.com/explore/models/FLUX.1-Kontext-pro/version/1/registry/azureml-blackforestlabs) | Image generation | - **Input:** text and image (5,000 tokens and 1 image) <br /> - **Output:** One Image <br /> - **Tool calling:** No <br /> - **Response formats**: Image (PNG and JPG) | - Global standard (all regions) | Foundry, Hub-based |
51+
| [FLUX-1.1-pro](https://ai.azure.com/explore/models/FLUX-1.1-pro/version/1/registry/azureml-blackforestlabs) | Image generation | - **Input:** text (5,000 tokens) <br /> - **Output:** One Image <br /> - **Tool calling:** No <br /> - **Response formats:** Image (PNG and JPG) | - Global standard (all regions) | Hub-based |
5252

5353

5454
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=black+forest+labs).
5555

5656
### DeepSeek models sold directly by Azure
5757

58-
DeepSeek family of models includes DeepSeek-R1, which excels at reasoning tasks using a step-by-step training process, such as language, scientific reasoning, and coding tasks.
58+
The DeepSeek family of models includes DeepSeek-R1, which excels at reasoning tasks by using a step-by-step training process, such as language, scientific reasoning, and coding tasks.
5959

6060
| Model | Type | Capabilities | Region availability | Project type |
61-
| ------ | ---- | ------------ | ------------------- | ------------ |
61+
| ------ | ---- | ------------ | Deployment type (region availability) | ------------ |
6262
| [DeepSeek-R1-0528](https://ai.azure.com/explore/models/deepseek-r1-0528/version/1/registry/azureml-deepseek) | chat-completion <br /> [(with reasoning content)](../how-to/use-chat-reasoning.md) | - **Input:** text (163,840 tokens) <br /> - **Output:** (163,840 tokens) <br /> - **Languages:** `en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text. | - Global standard (all regions) <br> - Global provisioned (all regions)| Foundry, Hub-based |
6363
| [DeepSeek-V3-0324](https://ai.azure.com/explore/models/deepseek-v3-0324/version/1/registry/azureml-deepseek) | chat-completion | - **Input:** text (131,072 tokens) <br /> - **Output:** (131,072 tokens) <br /> - **Languages:** `en` and `zh` <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON | - Global standard (all regions) <br> - Global provisioned (all regions) | Foundry, Hub-based |
6464
| [DeepSeek-R1](https://ai.azure.com/explore/models/deepseek-r1/version/1/registry/azureml-deepseek) | chat-completion <br /> [(with reasoning content)](../how-to/use-chat-reasoning.md) | - **Input:** text (163,840 tokens) <br /> - **Output:** (163,840 tokens) <br /> - **Languages:** `en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text. | - Global standard (all regions) <br> - Global provisioned (all regions) | Foundry, Hub-based |
@@ -67,18 +67,18 @@ See [this model collection in Azure AI Foundry portal](https://ai.azure.com/expl
6767

6868
### Meta models sold directly by Azure
6969

70-
Meta Llama models and tools are a collection of pretrained and fine-tuned generative AI text and image reasoning models. Meta models range is scale to include:
70+
Meta Llama models and tools are a collection of pretrained and fine-tuned generative AI text and image reasoning models. Meta models range in scale to include:
7171

7272
- Small language models (SLMs) like 1B and 3B Base and Instruct models for on-device and edge inferencing
7373
- Mid-size large language models (LLMs) like 7B, 8B, and 70B Base and Instruct models
74-
- High-performant models like Meta Llama 3.1-405B Instruct for synthetic data generation and distillation use cases.
74+
- High-performance models like Meta Llama 3.1-405B Instruct for synthetic data generation and distillation use cases.
7575

7676
| Model | Type | Capabilities | Region availability | Project type |
77-
| ------ | ---- | ------------ | ------------------- | ------------ |
77+
| ------ | ---- | ------------ | Deployment type (region availability) | ------------ |
7878
| [Llama-4-Maverick-17B-128E-Instruct-FP8](https://ai.azure.com/explore/models/Llama-4-Maverick-17B-128E-Instruct-FP8/version/1/registry/azureml-meta) | chat-completion | - **Input:** text and images (1M tokens) <br /> - **Output:** text (1M tokens) <br /> - **Languages:** `ar`, `en`, `fr`, `de`, `hi`, `id`, `it`, `pt`, `es`, `tl`, `th`, and `vi` <br /> - **Tool calling:** No* <br /> - **Response formats:** Text | - Global standard (all regions) | Foundry, Hub-based |
7979
| [Llama-3.3-70B-Instruct](https://ai.azure.com/explore/models/Llama-3.3-70B-Instruct/version/4/registry/azureml-meta) | chat-completion | - **Input:** text (128,000 tokens) <br /> - **Output:** text (8,192 tokens) <br /> - **Languages:** `en`, `de`, `fr`, `it`, `pt`, `hi`, `es`, and `th` <br /> - **Tool calling:** No* <br /> - **Response formats:** Text | - Global standard (all regions) | Foundry, Hub-based |
8080

81-
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=Meta). There are also several Meta models available [from partners and community](#meta).
81+
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=Meta). You can also find several Meta models available [from partners and community](#meta).
8282

8383
### Microsoft models sold directly by Azure
8484

@@ -88,23 +88,27 @@ Microsoft models include various model groups such as MAI models, Phi models, he
8888
| ------ | ---- | ------------ | ------------------- | ------------ |
8989
| [MAI-DS-R1](https://ai.azure.com/explore/models/MAI-DS-R1/version/1/registry/azureml) | chat-completion <br /> [(with reasoning content)](../how-to/use-chat-reasoning.md) | - **Input:** text (163,840 tokens) <br /> - **Output:** (163,840 tokens) <br /> - **Languages:** `en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text. |- Global standard (all regions) | Foundry, Hub-based |
9090

91-
See [the Microsoft model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=Microsoft). There are also several Microsoft models available [from partners and community](#microsoft).
91+
See [the Microsoft model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=Microsoft). You can also find several Microsoft models available [from partners and community](#microsoft).
9292

9393
### Mistral models sold directly by Azure
9494

9595
| Model | Type | Capabilities | Region availability | Project type |
96-
| ------ | ---- | ------------ | ------------------- | ------------ |
96+
| ------ | ---- | ------------ | Deployment type (region availability) | ------------ |
9797
| [mistral-document-ai-2505](https://ai.azure.com/explore/models/mistral-document-ai-2505/version/1/registry/azureml-mistral) | Image-to-Text | - **Input:** image or PDF pages ( 30 pages, max 30MB PDF file) <br /> - **Output:** text <br /> - **Languages:** en <br /> - **Tool calling:** no <br /> - **Response formats:** Text, JSON, Markdown |- Global standard (all regions) <br> - Data zone standard (US) | Foundry |
9898

99-
See [the Mistral model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=Mistral+AI). There are also several Mistral models available [from partners and community](#mistral-ai).
99+
See [the Mistral model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=Mistral+AI). You can also find several Mistral models available [from partners and community](#mistral-ai).
100100

101101

102102
### xAI models sold directly by Azure
103103

104-
xAI's Grok 3 and Grok 3 Mini models are designed to excel in various enterprise domains. Grok 3, a non-reasoning model pretrained by the Colossus datacenter, is tailored for business use cases such as data extraction, coding, and text summarization, with exceptional instruction-following capabilities. It supports a 131,072 token context window, allowing it to handle extensive inputs while maintaining coherence and depth, and is adept at drawing connections across domains and languages. On the other hand, Grok 3 Mini is a lightweight reasoning model trained to tackle agentic, coding, mathematical, and deep science problems with test-time compute. It also supports a 131,072 token context window for understanding codebases and enterprise documents, and excels at using tools to solve complex logical problems in novel environments, offering raw reasoning traces for user inspection with adjustable thinking budgets.
104+
xAI's Grok 3 and Grok 3 Mini models excel in various enterprise domains.
105+
106+
Grok 3 is a non-reasoning model pretrained by the Colossus datacenter. It's tailored for business use cases such as data extraction, coding, and text summarization, with exceptional instruction-following capabilities. It supports a 131,072 token context window, so it can handle extensive inputs while maintaining coherence and depth. It's adept at drawing connections across domains and languages.
107+
108+
On the other hand, Grok 3 Mini is a lightweight reasoning model trained to tackle agentic, coding, mathematical, and deep science problems with test-time compute. It also supports a 131,072 token context window for understanding codebases and enterprise documents. It excels at using tools to solve complex logical problems in novel environments, offering raw reasoning traces for user inspection with adjustable thinking budgets.
105109

106110
| Model | Type | Capabilities | Region availability | Project type |
107-
| ------ | ---- | ------------ | ------------------- | ------------ |
111+
| ------ | ---- | ------------ | Deployment type (region availability) | ------------ |
108112
| [grok-3](https://ai.azure.com/explore/models/grok-3/version/1/registry/azureml-xai)<sup>1</sup> | chat-completion | - **Input:** text (131,072 tokens) <br /> - **Output:** text (131,072 tokens) <br /> - **Languages:** `en` <br /> - **Tool calling:** yes <br /> - **Response formats:** text |- Global standard (all regions) <br> - Data zone standard (US) | Foundry, Hub-based |
109113
| [grok-3-mini](https://ai.azure.com/explore/models/grok-3-mini/version/1/registry/azureml-xai)<sup>1</sup> | chat-completion | - **Input:** text (131,072 tokens) <br /> - **Output:** text (131,072 tokens) <br /> - **Languages:** `en` <br /> - **Tool calling:** yes <br /> - **Response formats:** text | - Global standard (all regions) <br> - Data zone standard (US) | Foundry, Hub-based |
110114

@@ -113,7 +117,7 @@ See [the xAI model collection in Azure AI Foundry portal](https://ai.azure.com/e
113117

114118
## Models from partners and community
115119

116-
Models from partners and community constitute the majority of the Azure AI Foundry Models and are provided by trusted third-party organizations, partners, research labs, and community contributors. To learn more about these models, see [Models from Partners and Community](../../concepts/foundry-models-overview.md#models-from-partners-and-community).
120+
Models from partners and community make up most of the Azure AI Foundry Models. Trusted third-party organizations, partners, research labs, and community contributors provide these models. For more information about these models, see [Models from Partners and Community](../../concepts/foundry-models-overview.md#models-from-partners-and-community).
117121

118122

119123
### Cohere
@@ -141,7 +145,7 @@ See [the Cohere model collection in Azure AI Foundry portal](https://ai.azure.co
141145

142146
### Core42
143147

144-
Core42 includes autoregressive bi-lingual LLMs for Arabic & English with state-of-the-art capabilities in Arabic.
148+
Core42 includes autoregressive bilingual LLMs for Arabic and English with state-of-the-art capabilities in Arabic.
145149

146150
| Model | Type | Capabilities | Project type |
147151
| ------ | ---- | ------------ | ------------ |
@@ -151,11 +155,11 @@ See [this model collection in Azure AI Foundry portal](https://ai.azure.com/expl
151155

152156
### Meta
153157

154-
Meta Llama models and tools are a collection of pretrained and fine-tuned generative AI text and image reasoning models. Meta models range is scale to include:
158+
Meta Llama models and tools are a collection of pretrained and fine-tuned generative AI text and image reasoning models. Meta models range in scale to include:
155159

156160
- Small language models (SLMs) like 1B and 3B Base and Instruct models for on-device and edge inferencing
157161
- Mid-size large language models (LLMs) like 7B, 8B, and 70B Base and Instruct models
158-
- High-performant models like Meta Llama 3.1-405B Instruct for synthetic data generation and distillation use cases.
162+
- High-performance models like Meta Llama 3.1-405B Instruct for synthetic data generation and distillation use cases.
159163

160164
| Model | Type | Capabilities | Project type |
161165
| ------ | ---- | ------------ | ------------ |
@@ -179,7 +183,7 @@ Microsoft models include various model groups such as MAI models, Phi models, he
179183
| [Phi-4-reasoning](https://aka.ms/azureai/landing/Phi-4-reasoning) | chat-completion with reasoning content | - **Input:** text (32,768 tokens) <br /> - **Output:** text (32,768 tokens) <br /> - **Languages:** `en` <br /> - **Tool calling:** No <br /> - **Response formats:** Text | Foundry, Hub-based |
180184
| [Phi-4-mini-reasoning](https://aka.ms/azureai/landing/Phi-4-mini-reasoning) | chat-completion with reasoning content | - **Input:** text (128,000 tokens) <br /> - **Output:** text (128,000 tokens) <br /> - **Languages:** `en` <br /> - **Tool calling:** No <br /> - **Response formats:** Text | Foundry, Hub-based |
181185

182-
See [the Microsoft model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=Microsoft). There are also Microsoft models available as [models sold directly by Azure](#microsoft-models-sold-directly-by-azure).
186+
See [the Microsoft model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=Microsoft). Microsoft models are also available as [models sold directly by Azure](#microsoft-models-sold-directly-by-azure).
183187

184188
### Mistral AI
185189

@@ -204,7 +208,7 @@ See [this model collection in Azure AI Foundry portal](https://ai.azure.com/expl
204208

205209
### Nixtla
206210

207-
Nixtla's TimeGEN-1 is a generative pretrained forecasting and anomaly detection model for time series data. TimeGEN-1 can produce accurate forecasts for new time series without training, using only historical values and exogenous covariates as inputs.
211+
Nixtla's TimeGEN-1 is a generative pretrained forecasting and anomaly detection model for time series data. TimeGEN-1 produces accurate forecasts for new time series without training, using only historical values and exogenous covariates as inputs.
208212

209213
To perform inferencing, TimeGEN-1 requires you to use Nixtla's custom inference API.
210214

@@ -217,7 +221,7 @@ For more details on pricing for Nixtla models, see [Nixtla](../../concepts/model
217221

218222
### NTT Data
219223

220-
**tsuzumi** is an autoregressive language optimized transformer. The tuned versions use supervised fine-tuning (SFT). tsuzumi handles both Japanese and English language with high efficiency.
224+
**tsuzumi** is an autoregressive language-optimized transformer. The tuned versions use supervised fine-tuning (SFT). tsuzumi handles both Japanese and English language with high efficiency.
221225

222226
| Model | Type | Capabilities | Project type |
223227
| ------ | ---- | ------------ | ------------ |
@@ -227,7 +231,7 @@ See [this model collection in Azure AI Foundry portal](https://ai.azure.com/expl
227231

228232
### Stability AI
229233

230-
The Stability AI collection of image generation models include Stable Image Core, Stable Image Ultra, and Stable Diffusion 3.5 Large. Stable Diffusion 3.5 Large allows for an image and text input.
234+
The Stability AI collection of image generation models includes Stable Image Core, Stable Image Ultra, and Stable Diffusion 3.5 Large. Stable Diffusion 3.5 Large accepts both image and text input.
231235

232236
| Model | Type | Capabilities | Project type |
233237
| ------ | ---- | ------------ | ------------ |
@@ -240,7 +244,7 @@ See [this model collection in Azure AI Foundry portal](https://ai.azure.com/expl
240244

241245
## Open and custom models
242246

243-
The model catalog offers a larger selection of models, from a bigger range of providers. For these models, you can't use the option for [standard deployment in Azure AI Foundry resources](../../concepts/deployments-overview.md#standard-deployment-in-azure-ai-foundry-resources), where models are provided as APIs; rather, to deploy these models, you might be required to host them on your infrastructure, create an AI hub, and provide the underlying compute quota to host the models.
247+
The model catalog offers a larger selection of models from a wider range of providers. For these models, you can't use the option for [standard deployment in Azure AI Foundry resources](../../concepts/deployments-overview.md#standard-deployment-in-azure-ai-foundry-resources), where models are provided as APIs. Instead, to deploy these models, you might need to host them on your infrastructure, create an AI hub, and provide the underlying compute quota to host the models.
244248

245249
Furthermore, these models can be open-access or IP protected. In both cases, you have to deploy them in managed compute offerings in Azure AI Foundry. To get started, see [How-to: Deploy to Managed compute](../../how-to/deploy-models-managed.md).
246250

0 commit comments

Comments
 (0)