You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/concepts/models-inference-examples.md
+15-15Lines changed: 15 additions & 15 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -18,7 +18,7 @@ ms.custom:
18
18
19
19
# Serverless API inference examples for Foundry Models
20
20
21
-
The Azure AI model catalog offers a large selection of Azure AI Foundry Models from a wide range of providers. You have various options for deploying models from the model catalog. This article lists Azure AI Foundry Models that can be deployed via serverless API deployment. For some of these models, you can also host them on your infrastructure for deployment via managed compute.
21
+
The Azure AI model catalog offers a large selection of Azure AI Foundry Models from a wide range of providers. You have various options for deploying models from the model catalog. This article lists inference examples for serverless API deployments.
@@ -59,13 +59,7 @@ For more examples of how to use Cohere models, see the following examples:
59
59
60
60
### Cohere rerank
61
61
62
-
The following table lists the Cohere rerank models. To perform inferencing with these rerank models, you're required to use Cohere's custom rerank APIs that are listed in the table.
63
-
64
-
| Model | Type | Inference API |
65
-
| ------ | ---- | --- |
66
-
|[Cohere-rerank-v3.5](https://ai.azure.com/explore/models/Cohere-rerank-v3.5/version/1/registry/azureml-cohere)| rerank <br> text classification |[Cohere's v2/rerank API](https://docs.cohere.com/v2/reference/rerank)|
To perform inferencing with Cohere rerank models, you're required to use Cohere's custom rerank APIs. For more information, see the table for [Other Foundry Models available for serverless API deployment](../foundry-models/concepts/models.md#other-foundry-models-available-for-serverless-api-deployment).
69
63
70
64
71
65
#### Pricing for Cohere rerank models
@@ -76,7 +70,8 @@ See the [Cohere model collection in Azure AI Foundry portal](https://ai.azure.co
76
70
77
71
## Core42
78
72
79
-
For more examples of how to use Jais models, see the following examples:
73
+
The following table provides links to examples of how to use Jais models.
@@ -89,7 +84,8 @@ For more examples of how to use Jais models, see the following examples:
89
84
90
85
DeepSeek family of models includes DeepSeek-R1, which excels at reasoning tasks using a step-by-step training process, such as language, scientific reasoning, and coding tasks, DeepSeek-V3-0324, a Mixture-of-Experts (MoE) language model, and more.
91
86
92
-
For more examples of how to use DeepSeek models, see the following examples:
87
+
The following table provides links to examples of how to use DeepSeek models.
@@ -109,7 +105,8 @@ Meta Llama models and tools are a collection of pretrained and fine-tuned genera
109
105
- High-performant natively multimodal models, Llama 4 Scout and Llama 4 Maverick, leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.
110
106
111
107
112
-
For more examples of how to use Meta Llama models, see the following examples:
108
+
The following table provides links to examples of how to use Meta Llama models.
@@ -126,7 +123,8 @@ For more examples of how to use Meta Llama models, see the following examples:
126
123
127
124
Microsoft models include various model groups such as MAI models, Phi models, healthcare AI models, and more. To see all the available Microsoft models, view [the Microsoft model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=phi).
128
125
129
-
For more examples of how to use Microsoft models, see the following examples:
126
+
The following table provides links to examples of how to use Microsoft models.
@@ -148,7 +146,8 @@ Mistral AI offers two categories of models, namely:
148
146
-_Open models_: These include Mistral-small-2503, Codestral, and Mistral Nemo (that are available as serverless APIs with pay-as-you-go token-based billing), and [Mixtral-8x7B-Instruct-v01, Mixtral-8x7B-v01, Mistral-7B-Instruct-v01, and Mistral-7B-v01](../how-to/deploy-models-mistral-open.md)(that are available to download and run on self-hosted managed endpoints).
149
147
150
148
151
-
For more examples of how to use Mistral models, see the following examples and tutorials:
149
+
The following table provides links to examples of how to use Mistral models.
0 commit comments