Skip to content

Commit b35a690

Browse files
committed
clean up inference examples article
1 parent e73349d commit b35a690

File tree

1 file changed

+15
-15
lines changed

1 file changed

+15
-15
lines changed

articles/ai-foundry/concepts/models-inference-examples.md

Lines changed: 15 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ ms.custom:
1818

1919
# Serverless API inference examples for Foundry Models
2020

21-
The Azure AI model catalog offers a large selection of Azure AI Foundry Models from a wide range of providers. You have various options for deploying models from the model catalog. This article lists Azure AI Foundry Models that can be deployed via serverless API deployment. For some of these models, you can also host them on your infrastructure for deployment via managed compute.
21+
The Azure AI model catalog offers a large selection of Azure AI Foundry Models from a wide range of providers. You have various options for deploying models from the model catalog. This article lists inference examples for serverless API deployments.
2222

2323
[!INCLUDE [models-preview](../includes/models-preview.md)]
2424

@@ -33,7 +33,7 @@ The Cohere family of models includes various models optimized for different use
3333

3434
#### Inference examples: Cohere command and embed
3535

36-
For more examples of how to use Cohere models, see the following examples:
36+
The following table provides links to examples of how to use Cohere models.
3737

3838
| Description | Language | Sample |
3939
|-------------------------------------------|-------------------|-----------------------------------------------------------------|
@@ -59,13 +59,7 @@ For more examples of how to use Cohere models, see the following examples:
5959

6060
### Cohere rerank
6161

62-
The following table lists the Cohere rerank models. To perform inferencing with these rerank models, you're required to use Cohere's custom rerank APIs that are listed in the table.
63-
64-
| Model | Type | Inference API |
65-
| ------ | ---- | --- |
66-
| [Cohere-rerank-v3.5](https://ai.azure.com/explore/models/Cohere-rerank-v3.5/version/1/registry/azureml-cohere) | rerank <br> text classification | [Cohere's v2/rerank API](https://docs.cohere.com/v2/reference/rerank) |
67-
| [Cohere-rerank-v3-english](https://ai.azure.com/explore/models/Cohere-rerank-v3-english/version/1/registry/azureml-cohere) <br> (deprecated) | rerank <br> text classification | [Cohere's v2/rerank API](https://docs.cohere.com/v2/reference/rerank) <br> [Cohere's v1/rerank API](https://docs.cohere.com/v1/reference/rerank) |
68-
| [Cohere-rerank-v3-multilingual](https://ai.azure.com/explore/models/Cohere-rerank-v3-multilingual/version/1/registry/azureml-cohere) <br> (deprecated) | rerank <br> text classification | [Cohere's v2/rerank API](https://docs.cohere.com/v2/reference/rerank) <br> [Cohere's v1/rerank API](https://docs.cohere.com/v1/reference/rerank) |
62+
To perform inferencing with Cohere rerank models, you're required to use Cohere's custom rerank APIs. For more information, see the table for [Other Foundry Models available for serverless API deployment](../foundry-models/concepts/models.md#other-foundry-models-available-for-serverless-api-deployment).
6963

7064

7165
#### Pricing for Cohere rerank models
@@ -76,7 +70,8 @@ See the [Cohere model collection in Azure AI Foundry portal](https://ai.azure.co
7670

7771
## Core42
7872

79-
For more examples of how to use Jais models, see the following examples:
73+
The following table provides links to examples of how to use Jais models.
74+
8075

8176
| Description | Language | Sample |
8277
|-------------------------------------------|-------------------|-----------------------------------------------------------------|
@@ -89,7 +84,8 @@ For more examples of how to use Jais models, see the following examples:
8984

9085
DeepSeek family of models includes DeepSeek-R1, which excels at reasoning tasks using a step-by-step training process, such as language, scientific reasoning, and coding tasks, DeepSeek-V3-0324, a Mixture-of-Experts (MoE) language model, and more.
9186

92-
For more examples of how to use DeepSeek models, see the following examples:
87+
The following table provides links to examples of how to use DeepSeek models.
88+
9389

9490
| Description | Language | Sample |
9591
|-------------------------------------------|-------------------|-----------------------------------------------------------------|
@@ -109,7 +105,8 @@ Meta Llama models and tools are a collection of pretrained and fine-tuned genera
109105
- High-performant natively multimodal models, Llama 4 Scout and Llama 4 Maverick, leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.
110106

111107

112-
For more examples of how to use Meta Llama models, see the following examples:
108+
The following table provides links to examples of how to use Meta Llama models.
109+
113110

114111
| Description | Language | Sample |
115112
|-------------------------------------------|-------------------|------------------------------------------------------------------- |
@@ -126,7 +123,8 @@ For more examples of how to use Meta Llama models, see the following examples:
126123

127124
Microsoft models include various model groups such as MAI models, Phi models, healthcare AI models, and more. To see all the available Microsoft models, view [the Microsoft model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=phi).
128125

129-
For more examples of how to use Microsoft models, see the following examples:
126+
The following table provides links to examples of how to use Microsoft models.
127+
130128

131129
| Description | Language | Sample |
132130
|-------------------------------------------|-------------------|-----------------------------------------------------------------|
@@ -148,7 +146,8 @@ Mistral AI offers two categories of models, namely:
148146
- _Open models_: These include Mistral-small-2503, Codestral, and Mistral Nemo (that are available as serverless APIs with pay-as-you-go token-based billing), and [Mixtral-8x7B-Instruct-v01, Mixtral-8x7B-v01, Mistral-7B-Instruct-v01, and Mistral-7B-v01](../how-to/deploy-models-mistral-open.md)(that are available to download and run on self-hosted managed endpoints).
149147

150148

151-
For more examples of how to use Mistral models, see the following examples and tutorials:
149+
The following table provides links to examples of how to use Mistral models.
150+
152151

153152
| Description | Language | Sample |
154153
|-------------------------------------------|-------------------|-----------------------------------------------------------------|
@@ -218,7 +217,8 @@ For examples of how to use Stability AI models, see the following examples:
218217
## Related content
219218

220219
- [Deploy models as serverless API deployments](../how-to/deploy-models-serverless.md)
221-
- [Model catalog and collections in Azure AI Foundry portal](../how-to/model-catalog-overview.md)
220+
- [Explore Azure AI Foundry Models](foundry-models-overview.md)
221+
- [Foundry Models and their capabilities](../foundry-models/concepts/models.md)
222222
- [Region availability for models in serverless API deployments](../how-to/deploy-models-serverless-availability.md)
223223
- [Content safety for Models Sold Directly by Azure ](model-catalog-content-safety.md)
224224

0 commit comments

Comments
 (0)