You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/concepts/models-inference-examples.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,13 +1,13 @@
1
1
---
2
-
title: Azure AI Foundry Models available for serverless API deployment
2
+
title: Serverless API inference examples for Foundry Models
3
3
titleSuffix: Azure AI Foundry
4
-
description: Explore various models available for serverless API deployment in Azure AI Foundry.
4
+
description: Inference examples for Foundry Models that support deployment to serverless APIs in Azure AI Foundry.
5
5
author: msakande
6
6
ms.author: mopeakande
7
7
manager: scottpolly
8
8
reviewer: santiagxf
9
9
ms.reviewer: fasantia
10
-
ms.date: 05/19/2025
10
+
ms.date: 07/10/2025
11
11
ms.service: azure-ai-foundry
12
12
ms.topic: concept-article
13
13
ms.custom:
@@ -16,7 +16,7 @@ ms.custom:
16
16
- build-2025
17
17
---
18
18
19
-
# Azure AI Foundry Models Serverless API Inference Examples
19
+
# Serverless API inference examples for Foundry Models
20
20
21
21
The Azure AI model catalog offers a large selection of Azure AI Foundry Models from a wide range of providers. You have various options for deploying models from the model catalog. This article lists Azure AI Foundry Models that can be deployed via serverless API deployment. For some of these models, you can also host them on your infrastructure for deployment via managed compute.
Copy file name to clipboardExpand all lines: articles/ai-foundry/foundry-models/concepts/models.md
+17-15Lines changed: 17 additions & 15 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@ ms.author: mopeakande
7
7
manager: scottpolly
8
8
reviewer: santiagxf
9
9
ms.reviewer: fasantia
10
-
ms.date: 07/03/2025
10
+
ms.date: 07/10/2025
11
11
ms.service: azure-ai-model-inference
12
12
ms.topic: how-to
13
13
ms.custom:
@@ -21,15 +21,15 @@ ms.custom:
21
21
22
22
Azure AI Foundry Models gives you access to flagship models in Azure AI Foundry to consume them as APIs with flexible deployment options.
23
23
24
-
This article lists a selection of current model offerings and their capabilities, excluding [deprecated and legacy models](../../concepts/model-lifecycle-retirement.md#deprecated). Many of the models listed can also be deployed to serverless APIs, with a few exceptions, which are indicated in the model lists.
24
+
This article lists a selection of current model offerings and their capabilities, excluding [deprecated and legacy models](../../concepts/model-lifecycle-retirement.md#deprecated). The models listed support [standard deployment in Azure AI Foundry resources](../../concepts/deployments-overview.md#standard-deployment-in-azure-ai-foundry-resources), and many of them also support deployment to serverless APIs (those that don't support serverless deployment are indicated in the model lists).
25
25
26
26
Foundry models in the model catalog belong to two main categories:
27
27
*[Models sold directly by Azure](#models-sold-directly-by-azure)
28
28
*[Models from Partners and Community](#models-from-partners-and-community)
29
29
30
-
Follow this link for more information on [Models sold directly by Azure](foundry-models-overview.md#models-sold-directly-by-azure). Follow this link for more information on [Models from Partners and Community](foundry-models-overview.md#models-from-partners-and-community).
30
+
To learn more about these two categories, see [Models Sold Directly by Azure](../../concepts/foundry-models-overview.md#models-sold-directly-by-azure) and [Models from Partners and Community](../../concepts/foundry-models-overview.md#models-from-partners-and-community).
31
31
32
-
## Azure OpenAI
32
+
###Azure OpenAI
33
33
34
34
Azure OpenAI in Azure AI Foundry Models offers a diverse set of models with different capabilities and price points. Learn more details at [Azure OpenAI Model availability](../../../ai-services/openai/concepts/models.md). These models include:
35
35
@@ -111,7 +111,7 @@ Meta Llama models and tools are a collection of pretrained and fine-tuned genera
111
111
112
112
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=meta). There are also several Meta models available as [Models Sold Directly by Azure](../../concepts/foundry-models-overview.md#models-sold-directly-by-azure).
113
113
114
-
## Microsoft
114
+
###Microsoft
115
115
116
116
Microsoft models include various model groups such as MAI models, Phi models, healthcare AI models, and more. To see all the available Microsoft models, view [the Microsoft model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=phi).
117
117
@@ -135,7 +135,7 @@ Microsoft models include various model groups such as MAI models, Phi models, he
135
135
136
136
See [the Microsoft model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=phi).
137
137
138
-
## Mistral AI
138
+
###Mistral AI
139
139
140
140
Mistral AI offers two categories of models: premium models including Mistral Large and Mistral Small and open models including Mistral Nemo.
141
141
@@ -150,7 +150,7 @@ Mistral AI offers two categories of models: premium models including Mistral Lar
150
150
151
151
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=mistral).
152
152
153
-
## NTT Data
153
+
###NTT Data
154
154
155
155
**tsuzumi** is an autoregressive language optimized transformer. The tuned versions use supervised fine-tuning (SFT). tsuzumi handles both Japanese and English language with high efficiency.
156
156
@@ -172,7 +172,7 @@ DeepSeek family of models includes DeepSeek-R1, which excels at reasoning tasks
172
172
173
173
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=deepseek).
174
174
175
-
## xAI
175
+
###xAI
176
176
177
177
xAI's Grok 3 and Grok 3 Mini models are designed to excel in various enterprise domains. Grok 3, a non-reasoning model pre-trained by the Colossus datacenter, is tailored for business use cases such as data extraction, coding, and text summarization, with exceptional instruction-following capabilities. It supports a 131,072 token context window, allowing it to handle extensive inputs while maintaining coherence and depth, and is particularly adept at drawing connections across domains and languages. On the other hand, Grok 3 Mini is a lightweight reasoning model trained to tackle agentic, coding, mathematical, and deep science problems with test-time compute. It also supports a 131,072 token context window for understanding codebases and enterprise documents, and excels at using tools to solve complex logical problems in novel environments, offering raw reasoning traces for user inspection with adjustable thinking budgets.
178
178
@@ -183,13 +183,8 @@ xAI's Grok 3 and Grok 3 Mini models are designed to excel in various enterprise
183
183
184
184
<sup>1</sup> These models do not support deployment to a serverless API.
185
185
186
-
## Open and custom models
187
-
188
-
The model catalog offers a larger selection of models, from a bigger range of providers. For these models, you cannot use the option for [standard deployment in Azure AI Foundry resources](../../concepts/deployments-overview.md#standard-deployment-in-azure-ai-foundry-resources), where models are provided as APIs; rather, to deploy these models, you might be required to host them on your infrastructure, create an AI hub, and provide the underlying compute quota to host the models.
189
186
190
-
Furthermore, these models can be open-access or IP protected. In both cases, you have to deploy them in Managed Compute offerings in Azure AI Foundry. To get started, see [How-to: Deploy to Managed compute](../../how-to/deploy-models-managed.md).
191
-
192
-
## Other Models available for Serverless API deployment
187
+
## Other Foundry Models available for serverless API deployment
193
188
194
189
This section lists a selection of models available only through Serverless API deployment. For more information on these models, visit the [Azure AI Foundry Models Serverless API Inference Examples](models-featured.md) page.
195
190
@@ -206,7 +201,14 @@ This section lists a selection of models available only through Serverless API d
206
201
|[Cohere-rerank-v3-english](https://ai.azure.com/explore/models/Cohere-rerank-v3-english/version/1/registry/azureml-cohere) <br> (deprecated) | rerank <br> text classification | Partners and Community ||[Cohere's v2/rerank API](https://docs.cohere.com/v2/reference/rerank) <br> [Cohere's v1/rerank API](https://docs.cohere.com/v1/reference/rerank)|
207
202
|[Cohere-rerank-v3-multilingual](https://ai.azure.com/explore/models/Cohere-rerank-v3-multilingual/version/1/registry/azureml-cohere) <br> (deprecated) | rerank <br> text classification | Partners and Community ||[Cohere's v2/rerank API](https://docs.cohere.com/v2/reference/rerank) <br> [Cohere's v1/rerank API](https://docs.cohere.com/v1/reference/rerank)|
208
203
204
+
## Open and custom models
205
+
206
+
The model catalog offers a larger selection of models, from a bigger range of providers. For these models, you cannot use the option for [standard deployment in Azure AI Foundry resources](../../concepts/deployments-overview.md#standard-deployment-in-azure-ai-foundry-resources), where models are provided as APIs; rather, to deploy these models, you might be required to host them on your infrastructure, create an AI hub, and provide the underlying compute quota to host the models.
207
+
208
+
Furthermore, these models can be open-access or IP protected. In both cases, you have to deploy them in Managed Compute offerings in Azure AI Foundry. To get started, see [How-to: Deploy to Managed compute](../../how-to/deploy-models-managed.md).
209
+
209
210
210
211
## Related content
211
212
212
-
- Get started today and [deploy your first model in Azure AI Foundry Models](../../model-inference/how-to/create-model-deployments.md)
213
+
-[Deployment overview for Azure AI Foundry Models](../../concepts/deployments-overview.md)
214
+
-[Add and configure models to Azure AI Foundry Models](../how-to/create-model-deployments.md)
0 commit comments