Skip to content

Commit e73349d

Browse files
committed
rename featured models file, redirect, update TOC, etc
1 parent 4ece542 commit e73349d

File tree

4 files changed

+28
-21
lines changed

4 files changed

+28
-21
lines changed

articles/ai-foundry/.openpublishing.redirection.ai-studio.json

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -110,6 +110,11 @@
110110
"redirect_url": "/azure/ai-foundry/how-to/data-add",
111111
"redirect_document_id": true
112112
},
113+
{
114+
"source_path_from_root": "/articles/ai-foundry/concepts/models-featured.md",
115+
"redirect_url": "/azure/ai-foundry/concepts/models-inference-examples",
116+
"redirect_document_id": false
117+
},
113118
{
114119
"source_path_from_root": "/articles/ai-foundry/model-inference/reference/api-version-updates.md",
115120
"redirect_url": "/rest/api/aifoundry/modelinference",

articles/ai-foundry/concepts/models-featured.md renamed to articles/ai-foundry/concepts/models-inference-examples.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,13 +1,13 @@
11
---
2-
title: Azure AI Foundry Models available for serverless API deployment
2+
title: Serverless API inference examples for Foundry Models
33
titleSuffix: Azure AI Foundry
4-
description: Explore various models available for serverless API deployment in Azure AI Foundry.
4+
description: Inference examples for Foundry Models that support deployment to serverless APIs in Azure AI Foundry.
55
author: msakande
66
ms.author: mopeakande
77
manager: scottpolly
88
reviewer: santiagxf
99
ms.reviewer: fasantia
10-
ms.date: 05/19/2025
10+
ms.date: 07/10/2025
1111
ms.service: azure-ai-foundry
1212
ms.topic: concept-article
1313
ms.custom:
@@ -16,7 +16,7 @@ ms.custom:
1616
- build-2025
1717
---
1818

19-
# Azure AI Foundry Models Serverless API Inference Examples
19+
# Serverless API inference examples for Foundry Models
2020

2121
The Azure AI model catalog offers a large selection of Azure AI Foundry Models from a wide range of providers. You have various options for deploying models from the model catalog. This article lists Azure AI Foundry Models that can be deployed via serverless API deployment. For some of these models, you can also host them on your infrastructure for deployment via managed compute.
2222

articles/ai-foundry/foundry-models/concepts/models.md

Lines changed: 17 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ ms.author: mopeakande
77
manager: scottpolly
88
reviewer: santiagxf
99
ms.reviewer: fasantia
10-
ms.date: 07/03/2025
10+
ms.date: 07/10/2025
1111
ms.service: azure-ai-model-inference
1212
ms.topic: how-to
1313
ms.custom:
@@ -21,15 +21,15 @@ ms.custom:
2121

2222
Azure AI Foundry Models gives you access to flagship models in Azure AI Foundry to consume them as APIs with flexible deployment options.
2323

24-
This article lists a selection of current model offerings and their capabilities, excluding [deprecated and legacy models](../../concepts/model-lifecycle-retirement.md#deprecated). Many of the models listed can also be deployed to serverless APIs, with a few exceptions, which are indicated in the model lists.
24+
This article lists a selection of current model offerings and their capabilities, excluding [deprecated and legacy models](../../concepts/model-lifecycle-retirement.md#deprecated). The models listed support [standard deployment in Azure AI Foundry resources](../../concepts/deployments-overview.md#standard-deployment-in-azure-ai-foundry-resources), and many of them also support deployment to serverless APIs (those that don't support serverless deployment are indicated in the model lists).
2525

2626
Foundry models in the model catalog belong to two main categories:
2727
* [Models sold directly by Azure](#models-sold-directly-by-azure)
2828
* [Models from Partners and Community](#models-from-partners-and-community)
2929

30-
Follow this link for more information on [Models sold directly by Azure](foundry-models-overview.md#models-sold-directly-by-azure). Follow this link for more information on [Models from Partners and Community](foundry-models-overview.md#models-from-partners-and-community).
30+
To learn more about these two categories, see [Models Sold Directly by Azure](../../concepts/foundry-models-overview.md#models-sold-directly-by-azure) and [Models from Partners and Community](../../concepts/foundry-models-overview.md#models-from-partners-and-community).
3131

32-
## Azure OpenAI
32+
### Azure OpenAI
3333

3434
Azure OpenAI in Azure AI Foundry Models offers a diverse set of models with different capabilities and price points. Learn more details at [Azure OpenAI Model availability](../../../ai-services/openai/concepts/models.md). These models include:
3535

@@ -111,7 +111,7 @@ Meta Llama models and tools are a collection of pretrained and fine-tuned genera
111111

112112
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=meta). There are also several Meta models available as [Models Sold Directly by Azure](../../concepts/foundry-models-overview.md#models-sold-directly-by-azure).
113113

114-
## Microsoft
114+
### Microsoft
115115

116116
Microsoft models include various model groups such as MAI models, Phi models, healthcare AI models, and more. To see all the available Microsoft models, view [the Microsoft model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=phi).
117117

@@ -135,7 +135,7 @@ Microsoft models include various model groups such as MAI models, Phi models, he
135135

136136
See [the Microsoft model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=phi).
137137

138-
## Mistral AI
138+
### Mistral AI
139139

140140
Mistral AI offers two categories of models: premium models including Mistral Large and Mistral Small and open models including Mistral Nemo.
141141

@@ -150,7 +150,7 @@ Mistral AI offers two categories of models: premium models including Mistral Lar
150150

151151
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=mistral).
152152

153-
## NTT Data
153+
### NTT Data
154154

155155
**tsuzumi** is an autoregressive language optimized transformer. The tuned versions use supervised fine-tuning (SFT). tsuzumi handles both Japanese and English language with high efficiency.
156156

@@ -172,7 +172,7 @@ DeepSeek family of models includes DeepSeek-R1, which excels at reasoning tasks
172172

173173
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=deepseek).
174174

175-
## xAI
175+
### xAI
176176

177177
xAI's Grok 3 and Grok 3 Mini models are designed to excel in various enterprise domains. Grok 3, a non-reasoning model pre-trained by the Colossus datacenter, is tailored for business use cases such as data extraction, coding, and text summarization, with exceptional instruction-following capabilities. It supports a 131,072 token context window, allowing it to handle extensive inputs while maintaining coherence and depth, and is particularly adept at drawing connections across domains and languages. On the other hand, Grok 3 Mini is a lightweight reasoning model trained to tackle agentic, coding, mathematical, and deep science problems with test-time compute. It also supports a 131,072 token context window for understanding codebases and enterprise documents, and excels at using tools to solve complex logical problems in novel environments, offering raw reasoning traces for user inspection with adjustable thinking budgets.
178178

@@ -183,13 +183,8 @@ xAI's Grok 3 and Grok 3 Mini models are designed to excel in various enterprise
183183

184184
<sup>1</sup> These models do not support deployment to a serverless API.
185185

186-
## Open and custom models
187-
188-
The model catalog offers a larger selection of models, from a bigger range of providers. For these models, you cannot use the option for [standard deployment in Azure AI Foundry resources](../../concepts/deployments-overview.md#standard-deployment-in-azure-ai-foundry-resources), where models are provided as APIs; rather, to deploy these models, you might be required to host them on your infrastructure, create an AI hub, and provide the underlying compute quota to host the models.
189186

190-
Furthermore, these models can be open-access or IP protected. In both cases, you have to deploy them in Managed Compute offerings in Azure AI Foundry. To get started, see [How-to: Deploy to Managed compute](../../how-to/deploy-models-managed.md).
191-
192-
## Other Models available for Serverless API deployment
187+
## Other Foundry Models available for serverless API deployment
193188

194189
This section lists a selection of models available only through Serverless API deployment. For more information on these models, visit the [Azure AI Foundry Models Serverless API Inference Examples](models-featured.md) page.
195190

@@ -206,7 +201,14 @@ This section lists a selection of models available only through Serverless API d
206201
| [Cohere-rerank-v3-english](https://ai.azure.com/explore/models/Cohere-rerank-v3-english/version/1/registry/azureml-cohere) <br> (deprecated) | rerank <br> text classification | Partners and Community | | [Cohere's v2/rerank API](https://docs.cohere.com/v2/reference/rerank) <br> [Cohere's v1/rerank API](https://docs.cohere.com/v1/reference/rerank) |
207202
| [Cohere-rerank-v3-multilingual](https://ai.azure.com/explore/models/Cohere-rerank-v3-multilingual/version/1/registry/azureml-cohere) <br> (deprecated) | rerank <br> text classification | Partners and Community | | [Cohere's v2/rerank API](https://docs.cohere.com/v2/reference/rerank) <br> [Cohere's v1/rerank API](https://docs.cohere.com/v1/reference/rerank) |
208203

204+
## Open and custom models
205+
206+
The model catalog offers a larger selection of models, from a bigger range of providers. For these models, you cannot use the option for [standard deployment in Azure AI Foundry resources](../../concepts/deployments-overview.md#standard-deployment-in-azure-ai-foundry-resources), where models are provided as APIs; rather, to deploy these models, you might be required to host them on your infrastructure, create an AI hub, and provide the underlying compute quota to host the models.
207+
208+
Furthermore, these models can be open-access or IP protected. In both cases, you have to deploy them in Managed Compute offerings in Azure AI Foundry. To get started, see [How-to: Deploy to Managed compute](../../how-to/deploy-models-managed.md).
209+
209210

210211
## Related content
211212

212-
- Get started today and [deploy your first model in Azure AI Foundry Models](../../model-inference/how-to/create-model-deployments.md)
213+
- [Deployment overview for Azure AI Foundry Models](../../concepts/deployments-overview.md)
214+
- [Add and configure models to Azure AI Foundry Models](../how-to/create-model-deployments.md)

articles/ai-foundry/toc.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -112,8 +112,8 @@ items:
112112
href: how-to/deploy-models-serverless-availability.md
113113
- name: Guardrails & controls for serverless API
114114
href: concepts/model-catalog-content-safety.md
115-
- name: Models available for serverless API
116-
href: concepts/models-featured.md
115+
- name: Inference examples for serverless deployments
116+
href: concepts/models-inference-examples.md
117117
- name: Gretel Navigator model
118118
href: how-to/deploy-models-gretel-navigator.md
119119
- name: Mistral-7B and Mixtral models

0 commit comments

Comments
 (0)