Skip to content

Commit 501869d

Browse files
Merge pull request #6983 from MicrosoftDocs/main
Auto Publish – main to live - 2025-09-08 22:08 UTC
2 parents 5695e61 + e7d202b commit 501869d

File tree

16 files changed

+252
-157
lines changed

16 files changed

+252
-157
lines changed

.openpublishing.redirection.json

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -300,6 +300,16 @@
300300
"redirect_url": "/azure/ai-foundry",
301301
"redirect_document_id": false
302302
},
303+
{
304+
"source_path": "articles/ai-foundry/openai/concepts/models.md",
305+
"redirect_url": "/azure/ai-foundry/foundry-models/concepts/models-sold-directly-by-azure?pivots=azure-openai",
306+
"redirect_document_id": false
307+
},
308+
{
309+
"source_path": "articles/ai-foundry/foundry-models/concepts/models.md",
310+
"redirect_url": "/azure/ai-foundry/foundry-models/concepts/models-sold-directly-by-azure?pivots=azure-direct-others",
311+
"redirect_document_id": false
312+
},
303313
{
304314
"source_path_from_root": "/articles/ai-services/speech-service/text-to-speech-avatar/custom-avatar-endpoint.md",
305315
"redirect_url": "/azure/ai-services/speech-service/custom-avatar-create",

articles/ai-foundry/foundry-models/concepts/models.md renamed to articles/ai-foundry/foundry-models/concepts/models-from-partners.md

Lines changed: 30 additions & 125 deletions
Large diffs are not rendered by default.
Lines changed: 47 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,47 @@
1+
---
2+
title: Foundry Models sold directly by Azure
3+
titleSuffix: Azure AI Foundry
4+
description: Learn about Azure AI Foundry Models sold directly by Azure, their capabilities, deployment types, and regional availability for AI applications.
5+
author: msakande
6+
ms.author: mopeakande
7+
manager: nitinme
8+
ms.date: 09/05/2025
9+
ms.service: azure-ai-foundry
10+
ms.subservice: azure-ai-foundry-model-inference
11+
ms.topic: conceptual
12+
ms.custom:
13+
- references_regions
14+
- tool_generated
15+
- build-aifnd
16+
- build-2025
17+
zone_pivot_groups: models-sold-directly-by-azure
18+
19+
#CustomerIntent: As a developer or AI practitioner, I want to explore and understand Azure AI Foundry Models sold directly by Azure, including Azure OpenAI models and selected partner models, along with their capabilities and regional availability, so that I can choose the right model for my AI application.
20+
---
21+
22+
# Foundry Models sold directly by Azure
23+
24+
This article lists a selection of Azure AI Foundry Models sold directly by Azure along with their capabilities, [deployment types, and regions of availability](deployment-types.md), excluding [deprecated and legacy models](../../concepts/model-lifecycle-retirement.md#deprecated).
25+
Models sold directly by Azure include all Azure OpenAI models and specific, selected models from top providers.
26+
27+
[!INCLUDE [models-list-introduction](../includes/models-list-introduction.md)]
28+
29+
To learn more about attributes of Foundry Models sold directly by Azure, see [Explore Azure AI Foundry Models](../../concepts/foundry-models-overview.md#models-sold-directly-by-azure).
30+
31+
> [!NOTE]
32+
> For a list of models from partners and community, see [Foundry Models from partners and community](models-from-partners.md).
33+
34+
::: zone pivot="azure-openai"
35+
36+
[!INCLUDE [models-azure-direct-openai](../../openai/includes/models-azure-direct-openai.md)]
37+
38+
::: zone-end
39+
40+
41+
::: zone pivot="azure-direct-others"
42+
43+
[!INCLUDE [models-azure-direct-others](../includes/models-azure-direct-others.md)]
44+
45+
::: zone-end
46+
47+
Lines changed: 85 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,85 @@
1+
---
2+
title: Other Foundry Models sold directly by Azure
3+
manager: nitinme
4+
ms.service: azure-ai-foundry
5+
ms.subservice: azure-ai-foundry-model-inference
6+
ms.topic: include
7+
ms.date: 09/05/2025
8+
ms.author: mopeakande
9+
author: msakande
10+
---
11+
12+
## DeepSeek models sold directly by Azure
13+
14+
The DeepSeek family of models includes DeepSeek-R1, which excels at reasoning tasks by using a step-by-step training process, such as language, scientific reasoning, and coding tasks.
15+
16+
| Model | Type | Capabilities | Deployment type (region availability) | Project type |
17+
| ------ | ---- | ------------ | ------------------------------------- | ------------ |
18+
| [DeepSeek-R1-0528](https://ai.azure.com/explore/models/deepseek-r1-0528/version/1/registry/azureml-deepseek) | chat-completion <br /> [(with reasoning content)](../how-to/use-chat-reasoning.md) | - **Input:** text (163,840 tokens) <br /> - **Output:** (163,840 tokens) <br /> - **Languages:** `en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text. | - Global standard (all regions) <br> - Global provisioned (all regions)| Foundry, Hub-based |
19+
| [DeepSeek-V3-0324](https://ai.azure.com/explore/models/deepseek-v3-0324/version/1/registry/azureml-deepseek) | chat-completion | - **Input:** text (131,072 tokens) <br /> - **Output:** (131,072 tokens) <br /> - **Languages:** `en` and `zh` <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON | - Global standard (all regions) <br> - Global provisioned (all regions) | Foundry, Hub-based |
20+
| [DeepSeek-R1](https://ai.azure.com/explore/models/deepseek-r1/version/1/registry/azureml-deepseek) | chat-completion <br /> [(with reasoning content)](../how-to/use-chat-reasoning.md) | - **Input:** text (163,840 tokens) <br /> - **Output:** (163,840 tokens) <br /> - **Languages:** `en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text. | - Global standard (all regions) <br> - Global provisioned (all regions) | Foundry, Hub-based |
21+
22+
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=DeepSeek).
23+
24+
## Meta models sold directly by Azure
25+
26+
Meta Llama models and tools are a collection of pretrained and fine-tuned generative AI text and image reasoning models. Meta models range in scale to include:
27+
28+
- Small language models (SLMs) like 1B and 3B Base and Instruct models for on-device and edge inferencing
29+
- Mid-size large language models (LLMs) like 7B, 8B, and 70B Base and Instruct models
30+
- High-performance models like Meta Llama 3.1-405B Instruct for synthetic data generation and distillation use cases.
31+
32+
| Model | Type | Capabilities | Deployment type (region availability) | Project type |
33+
| ------ | ---- | ------------ | ------------------------------------- | ------------ |
34+
| [Llama-4-Maverick-17B-128E-Instruct-FP8](https://ai.azure.com/explore/models/Llama-4-Maverick-17B-128E-Instruct-FP8/version/1/registry/azureml-meta) | chat-completion | - **Input:** text and images (1M tokens) <br /> - **Output:** text (1M tokens) <br /> - **Languages:** `ar`, `en`, `fr`, `de`, `hi`, `id`, `it`, `pt`, `es`, `tl`, `th`, and `vi` <br /> - **Tool calling:** No <br /> - **Response formats:** Text | - Global standard (all regions) | Foundry, Hub-based |
35+
| [Llama-3.3-70B-Instruct](https://ai.azure.com/explore/models/Llama-3.3-70B-Instruct/version/4/registry/azureml-meta) | chat-completion | - **Input:** text (128,000 tokens) <br /> - **Output:** text (8,192 tokens) <br /> - **Languages:** `en`, `de`, `fr`, `it`, `pt`, `hi`, `es`, and `th` <br /> - **Tool calling:** No <br /> - **Response formats:** Text | - Global standard (all regions) | Foundry, Hub-based |
36+
37+
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=Meta). You can also find several Meta models available [from partners and community](../concepts/models-from-partners.md#meta).
38+
39+
## Microsoft models sold directly by Azure
40+
41+
Microsoft models include various model groups such as MAI models, Phi models, healthcare AI models, and more. To see all the available Microsoft models, view [the Microsoft model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=phi).
42+
43+
| Model | Type | Capabilities | Deployment type (region availability) | Project type |
44+
| ------ | ---- | ------------ | ------------------------------------- | ------------ |
45+
| [MAI-DS-R1](https://ai.azure.com/explore/models/MAI-DS-R1/version/1/registry/azureml) | chat-completion <br /> [(with reasoning content)](../how-to/use-chat-reasoning.md) | - **Input:** text (163,840 tokens) <br /> - **Output:** (163,840 tokens) <br /> - **Languages:** `en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text. |- Global standard (all regions) | Foundry, Hub-based |
46+
47+
See [the Microsoft model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=Microsoft). You can also find several Microsoft models available [from partners and community](../concepts/models-from-partners.md#microsoft).
48+
49+
## Mistral models sold directly by Azure
50+
51+
| Model | Type | Capabilities | Deployment type (region availability) | Project type |
52+
| ------ | ---- | ------------ | ------------------------------------- | ------------ |
53+
| [mistral-document-ai-2505](https://ai.azure.com/explore/models/mistral-document-ai-2505/version/1/registry/azureml-mistral) | Image-to-Text | - **Input:** image or PDF pages (30 pages, max 30MB PDF file) <br /> - **Output:** text <br /> - **Languages:** en <br /> - **Tool calling:** no <br /> - **Response formats:** Text, JSON, Markdown |- Global standard (all regions) <br> - Data zone standard (US) | Foundry |
54+
55+
See [the Mistral model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=Mistral+AI). You can also find several Mistral models available [from partners and community](../concepts/models-from-partners.md#mistral-ai).
56+
57+
58+
## xAI models sold directly by Azure
59+
60+
xAI's Grok models in Azure AI Foundry Models include a diverse set of models designed to excel in various enterprise domains with different capabilities and price points, including:
61+
62+
- Grok 3, a non-reasoning model pretrained by the Colossus datacenter, is tailored for business use cases such as data extraction, coding, and text summarization, with exceptional instruction-following capabilities. It supports a 131,072 token context window, allowing it to handle extensive inputs while maintaining coherence and depth, and is adept at drawing connections across domains and languages.
63+
64+
- Grok 3 Mini is a lightweight reasoning model trained to tackle agentic, coding, mathematical, and deep science problems with test-time compute. It also supports a 131,072 token context window for understanding codebases and enterprise documents, and excels at using tools to solve complex logical problems in novel environments, offering raw reasoning traces for user inspection with adjustable thinking budgets.
65+
66+
- Grok Code Fast 1, a fast and efficient reasoning model designed for use in agentic coding applications. It was pre-trained on a coding-focused data mixture, then post-trained on demonstrations of various coding tasks and tool use as well as demonstrations of correct refusal behaviors based on xAI's safety policy. Learn more about Grok Code Fast 1's capabilities, risks, and limitations, in the model card [here](https://ai.azure.com/explore/models/grok-code-fast-1/version/1/registry/azureml-xa).
67+
68+
| Model | Type | Capabilities | Deployment type (region availability) | Project type |
69+
| ------ | ---- | ------------ | ------------------------------------- | ------------ |
70+
| [grok-code-fast-1](https://ai.azure.com/explore/models/grok-code-fast-1/version/1/registry/azureml-xa) | chat-completion | - **Input:** text (256,000 tokens) <br /> - **Output:** text (8,192 tokens) <br /> - **Languages:** `en` <br /> - **Tool calling:** yes <br /> - **Response formats:** text |- Global standard (all regions) | Foundry, Hub-based |
71+
| [grok-3](https://ai.azure.com/explore/models/grok-3/version/1/registry/azureml-xai) | chat-completion | - **Input:** text (131,072 tokens) <br /> - **Output:** text (131,072 tokens) <br /> - **Languages:** `en` <br /> - **Tool calling:** yes <br /> - **Response formats:** text |- Global standard (all regions) <br> - Data zone standard (US) | Foundry, Hub-based |
72+
| [grok-3-mini](https://ai.azure.com/explore/models/grok-3-mini/version/1/registry/azureml-xai) | chat-completion | - **Input:** text (131,072 tokens) <br /> - **Output:** text (131,072 tokens) <br /> - **Languages:** `en` <br /> - **Tool calling:** yes <br /> - **Response formats:** text | - Global standard (all regions) <br> - Data zone standard (US) | Foundry, Hub-based |
73+
74+
See [the xAI model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=xAI).
75+
76+
77+
[!INCLUDE [models-open-and-custom](models-open-custom.md)]
78+
79+
80+
## Related content
81+
82+
- [Deployment overview for Azure AI Foundry Models](../../concepts/deployments-overview.md)
83+
- [Add and configure models to Azure AI Foundry Models](../how-to/create-model-deployments.md)
84+
- [Deployment types in Azure AI Foundry Models](../concepts/deployment-types.md)
85+
- [Serverless API inference examples for Foundry Models](../../concepts/models-inference-examples.md)
Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,12 @@
1+
---
2+
title: Introduction for list of Foundry Models
3+
manager: nitinme
4+
ms.service: azure-ai-foundry
5+
ms.subservice: azure-ai-foundry-model-inference
6+
ms.topic: include
7+
ms.date: 09/05/2025
8+
ms.author: mopeakande
9+
author: msakande
10+
---
11+
12+
Depending on the [kind of project](../../what-is-azure-ai-foundry.md#work-in-an-azure-ai-foundry-project) you use in Azure AI Foundry, you see a different selection of models. Specifically, if you use a Foundry project built on an Azure AI Foundry resource, you see the models that are available for standard deployment to a Foundry resource. Alternatively, if you use a hub-based project hosted by an Azure AI Foundry hub, you see models that are available for deployment to managed compute and serverless APIs. These model selections often overlap because many models support multiple [deployment options](../../concepts/deployments-overview.md).
Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,16 @@
1+
---
2+
title: Open and custom models
3+
manager: nitinme
4+
ms.service: azure-ai-foundry
5+
ms.subservice: azure-ai-foundry-model-inference
6+
ms.topic: include
7+
ms.date: 09/05/2025
8+
ms.author: mopeakande
9+
author: msakande
10+
---
11+
12+
## Open and custom models
13+
14+
The model catalog offers a larger selection of models from a wider range of providers. For these models, you can't use the option for [standard deployment in Azure AI Foundry resources](../../concepts/deployments-overview.md#standard-deployment-in-azure-ai-foundry-resources), where models are provided as APIs. Instead, to deploy these models, you might need to host them on your infrastructure, create an AI hub, and provide the underlying compute quota to host the models.
15+
16+
Furthermore, these models can be open-access or IP protected. In both cases, you have to deploy them in managed compute offerings in Azure AI Foundry. To get started, see [How-to: Deploy to Managed compute](../../how-to/deploy-models-managed.md).

0 commit comments

Comments
 (0)