You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/foundry-models/concepts/models.md
+30-17Lines changed: 30 additions & 17 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -21,7 +21,16 @@ ms.custom:
21
21
22
22
Azure AI Foundry Models gives you access to flagship models in Azure AI Foundry to consume them as APIs with flexible deployment options.
23
23
24
-
This article lists the current model offerings and their capabilities, excluding [deprecated and legacy models](../../concepts/model-lifecycle-retirement.md#deprecated)
24
+
This article lists a selection of current model offerings and their capabilities, excluding [deprecated and legacy models](../../concepts/model-lifecycle-retirement.md#deprecated). It also lists if a model can be deployed by serverless API. Follow this link for a full list of [region availability](../how-to/deploy-models-serverless-availability.md).
25
+
26
+
27
+
Our catalog is organized into two main categories:
28
+
*[Models sold directly by Azure](#models-sold-directly-by-azure)
29
+
*[Models from Partners and Community](#models-from-partners-and-community)
30
+
31
+
Follow this link for more infromation on [Models sold directly by Azure](foundry-models-overview.md#models-sold-directly-by-azure). Follow this link for more information on [Models from Partners and Community](foundry-models-overview.md#models-from-partners-and-community).
32
+
33
+
25
34
26
35
## Azure OpenAI
27
36
@@ -47,7 +56,9 @@ Azure OpenAI in Azure AI Foundry Models offers a diverse set of models with diff
47
56
48
57
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=aoai).
49
58
50
-
## AI21 Labs
59
+
## Models from Partners and Community
60
+
61
+
### AI21 Labs
51
62
52
63
The Jamba family models are AI21's production-grade Mamba-based large language model (LLM) which uses AI21's hybrid Mamba-Transformer architecture. It's an instruction-tuned version of AI21's hybrid structured state space model (SSM) transformer Jamba model. The Jamba family models are built for reliable commercial use with respect to quality and performance.
53
64
@@ -58,7 +69,7 @@ The Jamba family models are AI21's production-grade Mamba-based large language m
58
69
59
70
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=ai21).
60
71
61
-
## Cohere
72
+
###Cohere
62
73
63
74
The Cohere family of models includes various models optimized for different use cases, including chat completions and embeddings. Cohere models are optimized for various use cases that include reasoning, summarization, and question answering.
64
75
@@ -73,7 +84,7 @@ The Cohere family of models includes various models optimized for different use
73
84
74
85
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=cohere).
75
86
76
-
## Core42
87
+
###Core42
77
88
78
89
Core42 includes autoregressive bi-lingual LLMs for Arabic & English with state-of-the-art capabilities in Arabic.
79
90
@@ -83,19 +94,7 @@ Core42 includes autoregressive bi-lingual LLMs for Arabic & English with state-o
83
94
84
95
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=core42).
85
96
86
-
## DeepSeek
87
-
88
-
DeepSeek family of models includes DeepSeek-R1, which excels at reasoning tasks using a step-by-step training process, such as language, scientific reasoning, and coding tasks.
89
-
90
-
| Model | Type | Offering | Capabilities | Serverless API Availability |
|[DeepSeek-R1](https://ai.azure.com/explore/models/deepseek-r1/version/1/registry/azureml-deepseek)| chat-completion <br /> [(with reasoning content)](../how-to/use-chat-reasoning.md)| Models sold directly by Azure | - **Input:** text (163,840 tokens) <br /> - **Output:** (163,840 tokens) <br /> - **Languages:**`en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text. | Yes |
95
-
96
-
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=deepseek).
97
-
98
-
## Meta
97
+
### Meta
99
98
100
99
Meta Llama models and tools are a collection of pretrained and fine-tuned generative AI text and image reasoning models. Meta models range is scale to include:
101
100
@@ -158,6 +157,20 @@ See [this model collection in Azure AI Foundry portal](https://ai.azure.com/expl
|[tsuzumi-7b](https://ai.azure.com/explore/models/Tsuzumi-7b/version/1/registry/azureml-nttdata)| chat-completion | Partners and Community | - **Input:** text (8,192 tokens) <br /> - **Output:** text (8,192 tokens) <br /> - **Languages:**`en` and `jp` <br /> - **Tool calling:** No <br /> - **Response formats:** Text | Yes |
160
159
160
+
## Models sold directly by Azure
161
+
162
+
### DeepSeek
163
+
164
+
DeepSeek family of models includes DeepSeek-R1, which excels at reasoning tasks using a step-by-step training process, such as language, scientific reasoning, and coding tasks.
165
+
166
+
| Model | Type | Offering | Capabilities | Serverless API Availability |
|[DeepSeek-R1](https://ai.azure.com/explore/models/deepseek-r1/version/1/registry/azureml-deepseek)| chat-completion <br /> [(with reasoning content)](../how-to/use-chat-reasoning.md)| Models sold directly by Azure | - **Input:** text (163,840 tokens) <br /> - **Output:** (163,840 tokens) <br /> - **Languages:**`en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text. | Yes |
171
+
172
+
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=deepseek).
173
+
161
174
## xAI
162
175
163
176
xAI's Grok 3 and Grok 3 Mini models are designed to excel in various enterprise domains. Grok 3, a non-reasoning model pre-trained by the Colossus datacenter, is tailored for business use cases such as data extraction, coding, and text summarization, with exceptional instruction-following capabilities. It supports a 131,072 token context window, allowing it to handle extensive inputs while maintaining coherence and depth, and is particularly adept at drawing connections across domains and languages. On the other hand, Grok 3 Mini is a lightweight reasoning model trained to tackle agentic, coding, mathematical, and deep science problems with test-time compute. It also supports a 131,072 token context window for understanding codebases and enterprise documents, and excels at using tools to solve complex logical problems in novel environments, offering raw reasoning traces for user inspection with adjustable thinking budgets.
0 commit comments