You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
|[Llama-4-Scout-17B-16E-Instruct](https://aka.ms/aifoundry/landing/llama-4-scout-17b-16e-instruct)|[chat-completion](../foundry-models/how-to/use-chat-completions.md?context=/azure/ai-foundry/context/context)| - **Input:** text and image (128,000 tokens) <br /> - **Output:** text (8,192 tokens) <br /> - **Tool calling:** No <br /> - **Response formats:** Text | Foundry, Hub-based |
163
+
|[Llama-4-Scout-17B-16E-Instruct](https://aka.ms/aifoundry/landing/llama-4-scout-17b-16e-instruct)|[chat-completion](../how-to/use-chat-completions.md?context=/azure/ai-foundry/context/context)| - **Input:** text and image (128,000 tokens) <br /> - **Output:** text (8,192 tokens) <br /> - **Tool calling:** No <br /> - **Response formats:** Text | Foundry, Hub-based |
164
164
165
165
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=meta). There are also several Meta models available as [models sold directly by Azure](#meta-models-sold-directly-by-azure).
166
166
@@ -173,8 +173,8 @@ Microsoft models include various model groups such as MAI models, Phi models, he
|[Phi-4-reasoning](https://aka.ms/azureai/landing/Phi-4-reasoning)|[chat-completion with reasoning content](../foundry-models/how-to/use-chat-reasoning.md?context=/azure/ai-foundry/context/context)| - **Input:** text (32,768 tokens) <br /> - **Output:** text (32,768 tokens) <br /> - **Tool calling:** No <br /> - **Response formats:** Text | Foundry, Hub-based |
177
-
|[Phi-4-mini-reasoning](https://aka.ms/azureai/landing/Phi-4-mini-reasoning)|[chat-completion with reasoning content](../foundry-models/how-to/use-chat-reasoning.md?context=/azure/ai-foundry/context/context)| - **Input:** text (128,000 tokens) <br /> - **Output:** text (128,000 tokens) <br /> - **Tool calling:** No <br /> - **Response formats:** Text | Foundry, Hub-based |
176
+
|[Phi-4-reasoning](https://aka.ms/azureai/landing/Phi-4-reasoning)|[chat-completion with reasoning content](../how-to/use-chat-reasoning.md?context=/azure/ai-foundry/context/context)| - **Input:** text (32,768 tokens) <br /> - **Output:** text (32,768 tokens) <br /> - **Tool calling:** No <br /> - **Response formats:** Text | Foundry, Hub-based |
177
+
|[Phi-4-mini-reasoning](https://aka.ms/azureai/landing/Phi-4-mini-reasoning)|[chat-completion with reasoning content](../how-to/use-chat-reasoning.md?context=/azure/ai-foundry/context/context)| - **Input:** text (128,000 tokens) <br /> - **Output:** text (128,000 tokens) <br /> - **Tool calling:** No <br /> - **Response formats:** Text | Foundry, Hub-based |
178
178
179
179
See [the Microsoft model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=phi). There are also several Microsoft models available as [models sold directly by Azure](#microsoft-models-sold-directly-by-azure).
180
180
@@ -207,13 +207,13 @@ The Jamba family models are AI21's production-grade Mamba-based large language m
## Other Foundry Models available for serverless API deployment
215
215
216
-
This section lists a selection of models available only through Serverless API deployment. For more information on these models, visit the [Azure AI Foundry Models Serverless API Inference Examples](models-inference-examples.md) page.
216
+
This section lists a selection of models available only through Serverless API deployment. For more information on these models, visit the [Azure AI Foundry Models Serverless API Inference Examples](../concepts/models-inference-examples.md) page.
217
217
218
218
219
219
| Model | Type | Offering | Capabilities | API Reference |
@@ -223,7 +223,7 @@ This section lists a selection of models available only through Serverless API d
223
223
|[Stable Image Core](https://ai.azure.com/explore/models/Stable-Image-Core/version/1/registry/azureml-stabilityai)| Image generation | Partners and Community | - **Input:** text (1000 tokens) <br /> - **Output:** 1 Image <br /> - **Tool calling:** No <br /> - **Response formats:** Image (PNG and JPG) |
224
224
|[Stable Image Ultra](https://ai.azure.com/explore/models/Stable-Image-Ultra/version/1/registry/azureml-stabilityai)| Image generation | Partners and Community | - **Input:** text (1000 tokens) <br /> - **Output:** 1 Image <br /> - **Tool calling:** No <br /> - **Response formats:** Image (PNG and JPG) |
225
225
|[Mistral-OCR-2503](https://aka.ms/aistudio/landing/mistral-ocr-2503)|[image to text](../../how-to/use-image-models.md)| Models sold directly by Azure | - **Input:** image or PDF pages (1,000 pages, max 50MB PDF file) <br> - **Output:** text <br /> - **Tool calling:** No <br /> - **Response formats:** Text, JSON, Markdown |
226
-
|[Mistral-medium-2505](https://aka.ms/aistudio/landing/mistral-medium-2505)|[chat-completion](../foundry-models/how-to/use-chat-completions.md?context=/azure/ai-foundry/context/context)| Models sold directly by Azure | - **Input:** text (128,000 tokens), image <br /> - **Output:** text (128,000 tokens) <br /> - **Tool calling:** No <br /> - **Response formats:** Text, JSON |
226
+
|[Mistral-medium-2505](https://aka.ms/aistudio/landing/mistral-medium-2505)|[chat-completion](../how-to/use-chat-completions.md?context=/azure/ai-foundry/context/context)| Models sold directly by Azure | - **Input:** text (128,000 tokens), image <br /> - **Output:** text (128,000 tokens) <br /> - **Tool calling:** No <br /> - **Response formats:** Text, JSON |
227
227
|[Cohere-rerank-v3.5](https://ai.azure.com/explore/models/Cohere-rerank-v3.5/version/1/registry/azureml-cohere)| rerank <br> text classification | Partners and Community ||[Cohere's v2/rerank API](https://docs.cohere.com/v2/reference/rerank)|
228
228
|[Cohere-rerank-v3-english](https://ai.azure.com/explore/models/Cohere-rerank-v3-english/version/1/registry/azureml-cohere) <br> (deprecated) | rerank <br> text classification | Partners and Community ||[Cohere's v2/rerank API](https://docs.cohere.com/v2/reference/rerank) <br> [Cohere's v1/rerank API](https://docs.cohere.com/v1/reference/rerank)|
229
229
|[Cohere-rerank-v3-multilingual](https://ai.azure.com/explore/models/Cohere-rerank-v3-multilingual/version/1/registry/azureml-cohere) <br> (deprecated) | rerank <br> text classification | Partners and Community ||[Cohere's v2/rerank API](https://docs.cohere.com/v2/reference/rerank) <br> [Cohere's v1/rerank API](https://docs.cohere.com/v1/reference/rerank)|
0 commit comments