Skip to content

Commit 74424c7

Browse files
authored
Update models.md
1 parent 93a69b5 commit 74424c7

File tree

1 file changed

+30
-17
lines changed
  • articles/ai-foundry/foundry-models/concepts

1 file changed

+30
-17
lines changed

articles/ai-foundry/foundry-models/concepts/models.md

Lines changed: 30 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,16 @@ ms.custom:
2121

2222
Azure AI Foundry Models gives you access to flagship models in Azure AI Foundry to consume them as APIs with flexible deployment options.
2323

24-
This article lists the current model offerings and their capabilities, excluding [deprecated and legacy models](../../concepts/model-lifecycle-retirement.md#deprecated)
24+
This article lists a selection of current model offerings and their capabilities, excluding [deprecated and legacy models](../../concepts/model-lifecycle-retirement.md#deprecated). It also lists if a model can be deployed by serverless API. Follow this link for a full list of [region availability](../how-to/deploy-models-serverless-availability.md).
25+
26+
27+
Our catalog is organized into two main categories:
28+
* [Models sold directly by Azure](#models-sold-directly-by-azure)
29+
* [Models from Partners and Community](#models-from-partners-and-community)
30+
31+
Follow this link for more infromation on [Models sold directly by Azure](foundry-models-overview.md#models-sold-directly-by-azure). Follow this link for more information on [Models from Partners and Community](foundry-models-overview.md#models-from-partners-and-community).
32+
33+
2534

2635
## Azure OpenAI
2736

@@ -47,7 +56,9 @@ Azure OpenAI in Azure AI Foundry Models offers a diverse set of models with diff
4756

4857
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=aoai).
4958

50-
## AI21 Labs
59+
## Models from Partners and Community
60+
61+
### AI21 Labs
5162

5263
The Jamba family models are AI21's production-grade Mamba-based large language model (LLM) which uses AI21's hybrid Mamba-Transformer architecture. It's an instruction-tuned version of AI21's hybrid structured state space model (SSM) transformer Jamba model. The Jamba family models are built for reliable commercial use with respect to quality and performance.
5364

@@ -58,7 +69,7 @@ The Jamba family models are AI21's production-grade Mamba-based large language m
5869

5970
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=ai21).
6071

61-
## Cohere
72+
### Cohere
6273

6374
The Cohere family of models includes various models optimized for different use cases, including chat completions and embeddings. Cohere models are optimized for various use cases that include reasoning, summarization, and question answering.
6475

@@ -73,7 +84,7 @@ The Cohere family of models includes various models optimized for different use
7384

7485
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=cohere).
7586

76-
## Core42
87+
### Core42
7788

7889
Core42 includes autoregressive bi-lingual LLMs for Arabic & English with state-of-the-art capabilities in Arabic.
7990

@@ -83,19 +94,7 @@ Core42 includes autoregressive bi-lingual LLMs for Arabic & English with state-o
8394

8495
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=core42).
8596

86-
## DeepSeek
87-
88-
DeepSeek family of models includes DeepSeek-R1, which excels at reasoning tasks using a step-by-step training process, such as language, scientific reasoning, and coding tasks.
89-
90-
| Model | Type | Offering | Capabilities | Serverless API Availability |
91-
| ------ | ---- | ---- | ------------ | --------------------------- |
92-
| [DeepSeek-R1-0528](https://ai.azure.com/explore/models/deepseek-r1-0528/version/1/registry/azureml-deepseek) | chat-completion <br /> [(with reasoning content)](../how-to/use-chat-reasoning.md) | Models sold directly by Azure | - **Input:** text (163,840 tokens) <br /> - **Output:** (163,840 tokens) <br /> - **Languages:** `en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text. | Yes |
93-
| [DeepSeek-V3-0324](https://ai.azure.com/explore/models/deepseek-v3-0324/version/1/registry/azureml-deepseek) | chat-completion | Models sold directly by Azure | - **Input:** text (131,072 tokens) <br /> - **Output:** (131,072 tokens) <br /> - **Languages:** `en` and `zh` <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON | Yes |
94-
| [DeepSeek-R1](https://ai.azure.com/explore/models/deepseek-r1/version/1/registry/azureml-deepseek) | chat-completion <br /> [(with reasoning content)](../how-to/use-chat-reasoning.md) | Models sold directly by Azure | - **Input:** text (163,840 tokens) <br /> - **Output:** (163,840 tokens) <br /> - **Languages:** `en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text. | Yes |
95-
96-
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=deepseek).
97-
98-
## Meta
97+
### Meta
9998

10099
Meta Llama models and tools are a collection of pretrained and fine-tuned generative AI text and image reasoning models. Meta models range is scale to include:
101100

@@ -158,6 +157,20 @@ See [this model collection in Azure AI Foundry portal](https://ai.azure.com/expl
158157
| ------ | ---- | --- | ------------ | --------------------------- |
159158
| [tsuzumi-7b](https://ai.azure.com/explore/models/Tsuzumi-7b/version/1/registry/azureml-nttdata) | chat-completion | Partners and Community | - **Input:** text (8,192 tokens) <br /> - **Output:** text (8,192 tokens) <br /> - **Languages:** `en` and `jp` <br /> - **Tool calling:** No <br /> - **Response formats:** Text | Yes |
160159

160+
## Models sold directly by Azure
161+
162+
### DeepSeek
163+
164+
DeepSeek family of models includes DeepSeek-R1, which excels at reasoning tasks using a step-by-step training process, such as language, scientific reasoning, and coding tasks.
165+
166+
| Model | Type | Offering | Capabilities | Serverless API Availability |
167+
| ------ | ---- | ---- | ------------ | --------------------------- |
168+
| [DeepSeek-R1-0528](https://ai.azure.com/explore/models/deepseek-r1-0528/version/1/registry/azureml-deepseek) | chat-completion <br /> [(with reasoning content)](../how-to/use-chat-reasoning.md) | Models sold directly by Azure | - **Input:** text (163,840 tokens) <br /> - **Output:** (163,840 tokens) <br /> - **Languages:** `en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text. | Yes |
169+
| [DeepSeek-V3-0324](https://ai.azure.com/explore/models/deepseek-v3-0324/version/1/registry/azureml-deepseek) | chat-completion | Models sold directly by Azure | - **Input:** text (131,072 tokens) <br /> - **Output:** (131,072 tokens) <br /> - **Languages:** `en` and `zh` <br /> - **Tool calling:** Yes <br /> - **Response formats:** Text, JSON | Yes |
170+
| [DeepSeek-R1](https://ai.azure.com/explore/models/deepseek-r1/version/1/registry/azureml-deepseek) | chat-completion <br /> [(with reasoning content)](../how-to/use-chat-reasoning.md) | Models sold directly by Azure | - **Input:** text (163,840 tokens) <br /> - **Output:** (163,840 tokens) <br /> - **Languages:** `en` and `zh` <br /> - **Tool calling:** No <br /> - **Response formats:** Text. | Yes |
171+
172+
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=deepseek).
173+
161174
## xAI
162175

163176
xAI's Grok 3 and Grok 3 Mini models are designed to excel in various enterprise domains. Grok 3, a non-reasoning model pre-trained by the Colossus datacenter, is tailored for business use cases such as data extraction, coding, and text summarization, with exceptional instruction-following capabilities. It supports a 131,072 token context window, allowing it to handle extensive inputs while maintaining coherence and depth, and is particularly adept at drawing connections across domains and languages. On the other hand, Grok 3 Mini is a lightweight reasoning model trained to tackle agentic, coding, mathematical, and deep science problems with test-time compute. It also supports a 131,072 token context window for understanding codebases and enterprise documents, and excels at using tools to solve complex logical problems in novel environments, offering raw reasoning traces for user inspection with adjustable thinking budgets.

0 commit comments

Comments
 (0)