You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-studio/how-to/deploy-models-cohere-rerank.md
+8-2Lines changed: 8 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -30,7 +30,7 @@ In this section, you learn about the two Cohere Rerank models that are available
30
30
You can browse the Cohere family of models in the [Model Catalog](model-catalog-overview.md) by filtering on the Cohere collection.
31
31
32
32
### Cohere Rerank 3 - English
33
-
Cohere Rerank English is the market’s leading reranking model used for semantic search and retrieval-augmented generation (RAG). Rerank enables you to significantly improve search quality by augmenting traditional key-word based search systems with a semantic-based reranking system which can contextualize the meaning of a user's query beyond keyword relevance. Cohere's Rerank delivers much higher quality results than just embedding-based search, lexical search and even hybrid search, and it requires only adding a single line of code into your application.
33
+
Cohere Rerank English is the market's leading reranking model used for semantic search and retrieval-augmented generation (RAG). Rerank enables you to significantly improve search quality by augmenting traditional key-word based search systems with a semantic-based reranking system which can contextualize the meaning of a user's query beyond keyword relevance. Cohere's Rerank delivers much higher quality results than just embedding-based search, lexical search and even hybrid search, and it requires only adding a single line of code into your application.
34
34
Rerank should be used as a ranker after initial retrieval (i.e. an initial search system finds the top-100 most relevant documents for a larger corpus of documents).
35
35
36
36
Rerank supports JSON objects as documents where users can specify at query time the fields (keys) that semantic search should be applied over.
@@ -41,7 +41,7 @@ Rerank supports JSON objects as documents where users can specify at query time
41
41
Rerank English has SOTA performance on benchmarks in Code Retreival, Semi-structured Data Retreival, and Long Context. Cohere evaluated Rerank English on various configurations with BM25 (lexical search) as the initial retrieval step as well as Embeddings as the initial retrieval step <ahref="https://github.com/cohere-ai/notebooks/blob/main/public_rerank_benchmarks/bm25_with_rerank.md">BM25 with Rerank v3.0 General Retreival Evaluation Results</a> and <ahref="https://github.com/cohere-ai/notebooks/blob/main/public_rerank_benchmarks/embed_with_rerank.md">Embeddings with Rerank v3.0 General Retreival Evaluation Results</a>
42
42
43
43
### Cohere Rerank 3 - Multilingual
44
-
Cohere Rerank Multilingual is the market’s leading reranking model used for semantic search and retrieval-augmented generation (RAG). Rerank Multilingual supports 100+ languages and can be used to search within a language (e.g., search with a French query on French documents) and across languages (e.g., search with an English query on Chinese documents). Rerank enables you to significantly improve search quality by augmenting traditional key-word based search systems with a semantic-based reranking system which can contextualize the meaning of a user's query beyond keyword relevance. Cohere's Rerank delivers much higher quality results than just embedding-based search, lexical search and even hybrid search, and it requires only adding a single line of code into your application.
44
+
Cohere Rerank Multilingual is the market's leading reranking model used for semantic search and retrieval-augmented generation (RAG). Rerank Multilingual supports 100+ languages and can be used to search within a language (e.g., search with a French query on French documents) and across languages (e.g., search with an English query on Chinese documents). Rerank enables you to significantly improve search quality by augmenting traditional key-word based search systems with a semantic-based reranking system which can contextualize the meaning of a user's query beyond keyword relevance. Cohere's Rerank delivers much higher quality results than just embedding-based search, lexical search and even hybrid search, and it requires only adding a single line of code into your application.
45
45
Rerank should be used as a ranker after initial retrieval (i.e. an initial search system finds the top-100 most relevant documents for a larger corpus of documents).
46
46
47
47
Rerank supports JSON objects as documents where users can specify at query time the fields (keys) that semantic search should be applied over.
@@ -126,10 +126,12 @@ These models can be consumed using the Rerank API.
126
126
127
127
### v1/rerank Request
128
128
129
+
```json
129
130
POST /v1/rerank HTTP/1.1
130
131
Host: <DEPLOYMENT_URI>
131
132
Authorization: Bearer <TOKEN>
132
133
Content-type: application/json
134
+
```
133
135
134
136
### v1/rerank Request Schema
135
137
@@ -169,6 +171,7 @@ The `results` object is a dictionary with the following fields:
169
171
170
172
**Request**
171
173
174
+
```json
172
175
{
173
176
"query": "What is the capital of the United States?",
174
177
"rank_fields": ["Title", "Content"],
@@ -179,9 +182,11 @@ The `results` object is a dictionary with the following fields:
179
182
],
180
183
"top_n": 3
181
184
}
185
+
```
182
186
183
187
**Response**
184
188
189
+
```json
185
190
{
186
191
"id": "571e6744-3074-457f-8935-08646a3352fb",
187
192
"results": [
@@ -219,6 +224,7 @@ The `results` object is a dictionary with the following fields:
Copy file name to clipboardExpand all lines: articles/machine-learning/how-to-deploy-models-cohere-rerank.md
+9-3Lines changed: 9 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -28,7 +28,7 @@ You can browse the Cohere family of models in the model catalog by filtering on
28
28
In this article, you learn how to use Azure Machine Learning studio to deploy the Cohere models as a serverless API with pay-as you go billing.
29
29
30
30
### Cohere Rerank 3 - English
31
-
Cohere Rerank English is the market’s leading reranking model used for semantic search and retrieval-augmented generation (RAG). Rerank enables you to significantly improve search quality by augmenting traditional key-word based search systems with a semantic-based reranking system which can contextualize the meaning of a user's query beyond keyword relevance. Cohere's Rerank delivers much higher quality results than just embedding-based search, lexical search and even hybrid search, and it requires only adding a single line of code into your application.
31
+
Cohere Rerank English is the market's leading reranking model used for semantic search and retrieval-augmented generation (RAG). Rerank enables you to significantly improve search quality by augmenting traditional key-word based search systems with a semantic-based reranking system which can contextualize the meaning of a user's query beyond keyword relevance. Cohere's Rerank delivers much higher quality results than just embedding-based search, lexical search and even hybrid search, and it requires only adding a single line of code into your application.
32
32
Rerank should be used as a ranker after initial retrieval (i.e. an initial search system finds the top-100 most relevant documents for a larger corpus of documents).
33
33
34
34
Rerank supports JSON objects as documents where users can specify at query time the fields (keys) that semantic search should be applied over.
@@ -39,7 +39,7 @@ Rerank supports JSON objects as documents where users can specify at query time
39
39
Rerank English has SOTA performance on benchmarks in Code Retreival, Semi-structured Data Retreival, and Long Context. Cohere evaluated Rerank English on various configurations with BM25 (lexical search) as the initial retrieval step as well as Embeddings as the initial retrieval step <ahref="https://github.com/cohere-ai/notebooks/blob/main/public_rerank_benchmarks/bm25_with_rerank.md">BM25 with Rerank v3.0 General Retreival Evaluation Results</a> and <ahref="https://github.com/cohere-ai/notebooks/blob/main/public_rerank_benchmarks/embed_with_rerank.md">Embeddings with Rerank v3.0 General Retreival Evaluation Results</a>
40
40
41
41
### Cohere Rerank 3 - Multilingual
42
-
Cohere Rerank Multilingual is the market’s leading reranking model used for semantic search and retrieval-augmented generation (RAG). Rerank Multilingual supports 100+ languages and can be used to search within a language (e.g., search with a French query on French documents) and across languages (e.g., search with an English query on Chinese documents). Rerank enables you to significantly improve search quality by augmenting traditional key-word based search systems with a semantic-based reranking system which can contextualize the meaning of a user's query beyond keyword relevance. Cohere's Rerank delivers much higher quality results than just embedding-based search, lexical search and even hybrid search, and it requires only adding a single line of code into your application.
42
+
Cohere Rerank Multilingual is the market's leading reranking model used for semantic search and retrieval-augmented generation (RAG). Rerank Multilingual supports 100+ languages and can be used to search within a language (e.g., search with a French query on French documents) and across languages (e.g., search with an English query on Chinese documents). Rerank enables you to significantly improve search quality by augmenting traditional key-word based search systems with a semantic-based reranking system which can contextualize the meaning of a user's query beyond keyword relevance. Cohere's Rerank delivers much higher quality results than just embedding-based search, lexical search and even hybrid search, and it requires only adding a single line of code into your application.
43
43
Rerank should be used as a ranker after initial retrieval (i.e. an initial search system finds the top-100 most relevant documents for a larger corpus of documents).
44
44
45
45
Rerank supports JSON objects as documents where users can specify at query time the fields (keys) that semantic search should be applied over.
@@ -113,14 +113,16 @@ The previously mentioned Cohere models can be consumed using the chat API.
113
113
114
114
For more information on using the APIs, see the [reference](#rerank-api-reference-for-cohere-rerank-models-deployed-as-a-serverless-api) section.
115
115
116
-
## Rerank API reference for Cohere Rerank models deployed as a service
116
+
## Rerank API reference for Cohere Rerank models deployed as a serverless API
117
117
118
118
### v1/rerank Request
119
119
120
+
```json
120
121
POST /v1/rerank HTTP/1.1
121
122
Host: <DEPLOYMENT_URI>
122
123
Authorization: Bearer <TOKEN>
123
124
Content-type: application/json
125
+
```
124
126
125
127
### v1/rerank Request Schema
126
128
@@ -160,6 +162,7 @@ The `results` object is a dictionary with the following fields:
160
162
161
163
**Request**
162
164
165
+
```json
163
166
{
164
167
"query": "What is the capital of the United States?",
165
168
"rank_fields": ["Title", "Content"],
@@ -170,9 +173,11 @@ The `results` object is a dictionary with the following fields:
170
173
],
171
174
"top_n": 3
172
175
}
176
+
```
173
177
174
178
**Response**
175
179
180
+
```json
176
181
{
177
182
"id": "571e6744-3074-457f-8935-08646a3352fb",
178
183
"results": [
@@ -210,6 +215,7 @@ The `results` object is a dictionary with the following fields:
0 commit comments