Skip to content

Commit c26dc1d

Browse files
committed
update model name and steps in preocedure
1 parent ebe9b7e commit c26dc1d

File tree

1 file changed

+14
-15
lines changed

1 file changed

+14
-15
lines changed

articles/ai-studio/how-to/deploy-models-cohere-rerank.md

Lines changed: 14 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -24,12 +24,12 @@ In this article, you learn about the Cohere Rerank models, how to use Azure AI F
2424

2525
Cohere offers two Rerank models in [Azure AI Foundry](https://ai.azure.com). These models are available in the model catalog for deployment as serverless APIs:
2626

27-
* Cohere Rerank 3 - English
28-
* Cohere Rerank 3 - Multilingual
27+
* Cohere Rerank v3 - English
28+
* Cohere Rerank v3 - Multilingual
2929

3030
You can browse the Cohere family of models in the [Model Catalog](model-catalog.md) by filtering on the Cohere collection.
3131

32-
### Cohere Rerank 3 - English
32+
### Cohere Rerank v3 - English
3333

3434
Cohere Rerank English is a reranking model used for semantic search and retrieval-augmented generation (RAG). Rerank enables you to significantly improve search quality by augmenting traditional keyword-based search systems with a semantic-based reranking system that can contextualize the meaning of a user's query beyond keyword relevance. Cohere's Rerank delivers higher quality results than embedding-based search, lexical search, and even hybrid search, and it requires only adding a single line of code into your application.
3535

@@ -42,7 +42,7 @@ Rerank supports JSON objects as documents where users can specify, at query time
4242

4343
Rerank English works well for code retrieval, semi-structured data retrieval, and long context.
4444

45-
### Cohere Rerank 3 - Multilingual
45+
### Cohere Rerank v3 - Multilingual
4646

4747
Cohere Rerank Multilingual is a reranking model used for semantic search and retrieval-augmented generation (RAG). Rerank Multilingual supports more than 100 languages and can be used to search within a language (for example, to search with a French query on French documents) and across languages (for example, to search with an English query on Chinese documents). Rerank enables you to significantly improve search quality by augmenting traditional keyword-based search systems with a semantic-based reranking system that can contextualize the meaning of a user's query beyond keyword relevance. Cohere's Rerank delivers higher quality results than embedding-based search, lexical search, and even hybrid search, and it requires only adding a single line of code into your application.
4848

@@ -74,21 +74,16 @@ You can deploy the previously mentioned Cohere models as a service with pay-as-y
7474

7575
### Create a new deployment
7676

77-
The following steps demonstrate the deployment of Cohere Rerank 3 - English, but you can use the same steps to deploy Cohere Rerank 3 - Multilingual by replacing the model name.
77+
The following steps demonstrate the deployment of Cohere Rerank v3 - English, but you can use the same steps to deploy Cohere Rerank v3 - Multilingual by replacing the model name.
7878

7979
To create a deployment:
8080

8181
[!INCLUDE [open-catalog](../includes/open-catalog.md)]
8282

83-
> [!NOTE]
84-
> For models offered through the Azure Marketplace, ensure that your account has the **Azure AI Developer** role permissions on the resource group, or that you meet the [permissions required to subscribe to model offerings](#permissions-required-to-subscribe-to-model-offerings).
85-
>
86-
> Models that are offered by non-Microsoft providers (for example, Llama and Mistral models) are billed through the Azure Marketplace. For such models, you're required to subscribe your project to the particular model offering. Models that are offered by Microsoft (for example, Phi-3 models) don't have this requirement, as billing is done differently. For details about billing for serverless deployment of models in the model catalog, see [Billing for serverless APIs](model-catalog-overview.md#billing).
87-
8883
4. Select the model card of the model you want to deploy. In this article, you select **Cohere-rerank-v3-english** to open the Model Details page.
8984

9085
1. Select **Deploy** to open a serverless API deployment window for the model.
91-
1. Alternatively, you can initiate a deployment by starting from your project in AI Foundry portal.
86+
1. Alternatively, you can initiate a deployment from your project in the AI Foundry portal as follows:
9287

9388
1. From the left sidebar of your project, select **Models + Endpoints**.
9489
1. Select **+ Deploy model**.
@@ -97,7 +92,11 @@ To create a deployment:
9792

9893
1. In the deployment wizard, select the link to **Azure Marketplace Terms** to learn more about the terms of use.
9994
1. Select the **Pricing and terms** tab to learn about pricing for the selected model.
100-
1. Select the **Subscribe and Deploy** button. If this is your first time deploying the model in the project, you have to subscribe your project for the particular offering. This step requires that your account has the **Azure AI Developer role** permissions on the resource group, as listed in the prerequisites. Each project has its own subscription to the particular Azure Marketplace offering of the model, which allows you to control and monitor spending. Currently, you can have only one deployment for each model within a project.
95+
1. Select the **Subscribe and Deploy** button. If this is your first time deploying the model in the project, you have to subscribe your project for the particular offering.
96+
97+
> [!NOTE]
98+
> This step requires that your account has the **Azure AI Developer role** permissions on the resource group, as listed in the prerequisites. Models that are offered by non-Microsoft providers (for example, Cohere models) are billed through the Azure Marketplace. For such models, you're required to subscribe your project to the particular model offering. Each project has its own subscription to the particular Azure Marketplace offering of the model, which allows you to control and monitor spending. Currently, you can have only one deployment for each model within a project.
99+
101100
1. Once you subscribe the project for the particular Azure Marketplace offering, subsequent deployments of the _same_ offering in the _same_ project don't require subscribing again. If this scenario applies to you, there's a **Continue to deploy** option to select.
102101

103102
1. Give the deployment a name. This name becomes part of the deployment API URL. This URL must be unique in each Azure region.
@@ -118,11 +117,11 @@ Cohere Rerank models deployed as serverless APIs can be consumed using the Reran
118117

119118
1. Copy the **Target** URL and the **Key** value.
120119

121-
1. Cohere currently exposes `v1/rerank` for inference with the Rerank 3 - English and Rerank 3 - Multilingual models schema. For more information on using the APIs, see the [reference](#rerank-api-reference-for-cohere-rerank-models-deployed-as-a-service) section.
120+
1. Cohere currently exposes `v1/rerank` for inference with the Rerank v3 - English and Rerank v3 - Multilingual models schema. For more information on using the APIs, see the [reference](#rerank-api-reference-for-cohere-rerank-models-deployed-as-a-service) section.
122121

123122
## Rerank API reference for Cohere Rerank models deployed as a service
124123

125-
Cohere Rerank 3 - English and Rerank 3 - Multilingual accept the native Cohere Rerank API on `v1/rerank`. This section contains details about the Cohere Rerank API.
124+
Cohere Rerank v3 - English and Rerank v3 - Multilingual accept the native Cohere Rerank API on `v1/rerank`. This section contains details about the Cohere Rerank API.
126125

127126
#### v1/rerank request
128127

@@ -135,7 +134,7 @@ Cohere Rerank 3 - English and Rerank 3 - Multilingual accept the native Cohere R
135134

136135
#### v1/rerank request schema
137136

138-
Cohere Rerank 3 - English and Rerank 3 - Multilingual accept the following parameters for a `v1/rerank` API call:
137+
Cohere Rerank v3 - English and Rerank v3 - Multilingual accept the following parameters for a `v1/rerank` API call:
139138

140139
| Property | Type | Default | Description |
141140
| --- | --- | --- | --- |

0 commit comments

Comments
 (0)