You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-studio/how-to/deploy-models-cohere-rerank.md
+14-15Lines changed: 14 additions & 15 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -24,12 +24,12 @@ In this article, you learn about the Cohere Rerank models, how to use Azure AI F
24
24
25
25
Cohere offers two Rerank models in [Azure AI Foundry](https://ai.azure.com). These models are available in the model catalog for deployment as serverless APIs:
26
26
27
-
* Cohere Rerank 3 - English
28
-
* Cohere Rerank 3 - Multilingual
27
+
* Cohere Rerank v3 - English
28
+
* Cohere Rerank v3 - Multilingual
29
29
30
30
You can browse the Cohere family of models in the [Model Catalog](model-catalog.md) by filtering on the Cohere collection.
31
31
32
-
### Cohere Rerank 3 - English
32
+
### Cohere Rerank v3 - English
33
33
34
34
Cohere Rerank English is a reranking model used for semantic search and retrieval-augmented generation (RAG). Rerank enables you to significantly improve search quality by augmenting traditional keyword-based search systems with a semantic-based reranking system that can contextualize the meaning of a user's query beyond keyword relevance. Cohere's Rerank delivers higher quality results than embedding-based search, lexical search, and even hybrid search, and it requires only adding a single line of code into your application.
35
35
@@ -42,7 +42,7 @@ Rerank supports JSON objects as documents where users can specify, at query time
42
42
43
43
Rerank English works well for code retrieval, semi-structured data retrieval, and long context.
44
44
45
-
### Cohere Rerank 3 - Multilingual
45
+
### Cohere Rerank v3 - Multilingual
46
46
47
47
Cohere Rerank Multilingual is a reranking model used for semantic search and retrieval-augmented generation (RAG). Rerank Multilingual supports more than 100 languages and can be used to search within a language (for example, to search with a French query on French documents) and across languages (for example, to search with an English query on Chinese documents). Rerank enables you to significantly improve search quality by augmenting traditional keyword-based search systems with a semantic-based reranking system that can contextualize the meaning of a user's query beyond keyword relevance. Cohere's Rerank delivers higher quality results than embedding-based search, lexical search, and even hybrid search, and it requires only adding a single line of code into your application.
48
48
@@ -74,21 +74,16 @@ You can deploy the previously mentioned Cohere models as a service with pay-as-y
74
74
75
75
### Create a new deployment
76
76
77
-
The following steps demonstrate the deployment of Cohere Rerank 3 - English, but you can use the same steps to deploy Cohere Rerank 3 - Multilingual by replacing the model name.
77
+
The following steps demonstrate the deployment of Cohere Rerank v3 - English, but you can use the same steps to deploy Cohere Rerank v3 - Multilingual by replacing the model name.
> For models offered through the Azure Marketplace, ensure that your account has the **Azure AI Developer** role permissions on the resource group, or that you meet the [permissions required to subscribe to model offerings](#permissions-required-to-subscribe-to-model-offerings).
85
-
>
86
-
> Models that are offered by non-Microsoft providers (for example, Llama and Mistral models) are billed through the Azure Marketplace. For such models, you're required to subscribe your project to the particular model offering. Models that are offered by Microsoft (for example, Phi-3 models) don't have this requirement, as billing is done differently. For details about billing for serverless deployment of models in the model catalog, see [Billing for serverless APIs](model-catalog-overview.md#billing).
87
-
88
83
4. Select the model card of the model you want to deploy. In this article, you select **Cohere-rerank-v3-english** to open the Model Details page.
89
84
90
85
1. Select **Deploy** to open a serverless API deployment window for the model.
91
-
1. Alternatively, you can initiate a deployment by starting from your project in AI Foundry portal.
86
+
1. Alternatively, you can initiate a deployment from your project in the AI Foundry portal as follows:
92
87
93
88
1. From the left sidebar of your project, select **Models + Endpoints**.
94
89
1. Select **+ Deploy model**.
@@ -97,7 +92,11 @@ To create a deployment:
97
92
98
93
1. In the deployment wizard, select the link to **Azure Marketplace Terms** to learn more about the terms of use.
99
94
1. Select the **Pricing and terms** tab to learn about pricing for the selected model.
100
-
1. Select the **Subscribe and Deploy** button. If this is your first time deploying the model in the project, you have to subscribe your project for the particular offering. This step requires that your account has the **Azure AI Developer role** permissions on the resource group, as listed in the prerequisites. Each project has its own subscription to the particular Azure Marketplace offering of the model, which allows you to control and monitor spending. Currently, you can have only one deployment for each model within a project.
95
+
1. Select the **Subscribe and Deploy** button. If this is your first time deploying the model in the project, you have to subscribe your project for the particular offering.
96
+
97
+
> [!NOTE]
98
+
> This step requires that your account has the **Azure AI Developer role** permissions on the resource group, as listed in the prerequisites. Models that are offered by non-Microsoft providers (for example, Cohere models) are billed through the Azure Marketplace. For such models, you're required to subscribe your project to the particular model offering. Each project has its own subscription to the particular Azure Marketplace offering of the model, which allows you to control and monitor spending. Currently, you can have only one deployment for each model within a project.
99
+
101
100
1. Once you subscribe the project for the particular Azure Marketplace offering, subsequent deployments of the _same_ offering in the _same_ project don't require subscribing again. If this scenario applies to you, there's a **Continue to deploy** option to select.
102
101
103
102
1. Give the deployment a name. This name becomes part of the deployment API URL. This URL must be unique in each Azure region.
@@ -118,11 +117,11 @@ Cohere Rerank models deployed as serverless APIs can be consumed using the Reran
118
117
119
118
1. Copy the **Target** URL and the **Key** value.
120
119
121
-
1. Cohere currently exposes `v1/rerank` for inference with the Rerank 3 - English and Rerank 3 - Multilingual models schema. For more information on using the APIs, see the [reference](#rerank-api-reference-for-cohere-rerank-models-deployed-as-a-service) section.
120
+
1. Cohere currently exposes `v1/rerank` for inference with the Rerank v3 - English and Rerank v3 - Multilingual models schema. For more information on using the APIs, see the [reference](#rerank-api-reference-for-cohere-rerank-models-deployed-as-a-service) section.
122
121
123
122
## Rerank API reference for Cohere Rerank models deployed as a service
124
123
125
-
Cohere Rerank 3 - English and Rerank 3 - Multilingual accept the native Cohere Rerank API on `v1/rerank`. This section contains details about the Cohere Rerank API.
124
+
Cohere Rerank v3 - English and Rerank v3 - Multilingual accept the native Cohere Rerank API on `v1/rerank`. This section contains details about the Cohere Rerank API.
126
125
127
126
#### v1/rerank request
128
127
@@ -135,7 +134,7 @@ Cohere Rerank 3 - English and Rerank 3 - Multilingual accept the native Cohere R
135
134
136
135
#### v1/rerank request schema
137
136
138
-
Cohere Rerank 3 - English and Rerank 3 - Multilingual accept the following parameters for a `v1/rerank` API call:
137
+
Cohere Rerank v3 - English and Rerank v3 - Multilingual accept the following parameters for a `v1/rerank` API call:
0 commit comments