|
1 | 1 | ---
|
2 |
| -title: How to deploy Cohere Rerank models with Azure AI Studio |
| 2 | +title: How to deploy Cohere Rerank models as serverless APIs |
3 | 3 | titleSuffix: Azure AI Studio
|
4 |
| -description: Learn how to deploy Cohere Rerank models with Azure AI Studio. |
| 4 | +description: Learn to deploy and use Cohere Rerank models with Azure AI Studio. |
5 | 5 | manager: scottpolly
|
6 | 6 | ms.service: azure-ai-studio
|
7 | 7 | ms.topic: how-to
|
8 |
| -ms.date: 5/21/2024 |
| 8 | +ms.date: 07/24/2024 |
9 | 9 | ms.reviewer: shubhiraj
|
10 | 10 | ms.author: mopeakande
|
11 | 11 | author: msakande
|
@@ -165,8 +165,8 @@ The `results` object is a dictionary with the following fields:
|
165 | 165 | | Key | Type | Description |
|
166 | 166 | | --- | --- | --- |
|
167 | 167 | | `document` | `object` |The document objects or strings that were reranked. |
|
168 |
| -| `index` | `ingeter` |The `index` in the original list of documents to which the ranked document belongs. For example, if the first value in the `results` object has an index value of 3, it means in the list of documents passed in, the document at `index=3` had the highest relevance.| |
169 |
| -| `relevance_score` | `float` |Relevance scores are normalized to be in the range `[0, 1]`. Scores close to one indicate a high relevance to the query, and scores close to zero indicate low relevance. A score of `0.9` _doesn't_ necessarily mean that a document is twice as relevant as another with a score of `0.45` | |
| 168 | +| `index` | `integer` |The `index` in the original list of documents to which the ranked document belongs. For example, if the first value in the `results` object has an index value of 3, it means in the list of documents passed in, the document at `index=3` had the highest relevance.| |
| 169 | +| `relevance_score` | `float` |Relevance scores are normalized to be in the range `[0, 1]`. Scores close to one indicate a high relevance to the query, and scores close to zero indicate low relevance. A score of `0.9` _doesn't_ necessarily mean that a document is twice as relevant as another with a score of `0.45`. | |
170 | 170 |
|
171 | 171 |
|
172 | 172 | ## Examples
|
@@ -236,9 +236,7 @@ The `results` object is a dictionary with the following fields:
|
236 | 236 | |LangChain|[langchain.ipynb](https://aka.ms/samples/cohere-rerank/langchain)|
|
237 | 237 | |Cohere SDK|[cohere-sdk.ipynb](https://aka.ms/samples/cohere-rerank/cohere-python-sdk)|
|
238 | 238 |
|
239 |
| -## Cost and quotas |
240 |
| - |
241 |
| -### Cost and quota considerations for models deployed as a service |
| 239 | +## Cost and quota considerations for models deployed as a service |
242 | 240 |
|
243 | 241 | Quota is managed per deployment. Each deployment has a rate limit of 200,000 tokens per minute and 1,000 API requests per minute. However, we currently limit one deployment per model per project. Contact Microsoft Azure Support if the current rate limits aren't sufficient for your scenarios.
|
244 | 242 |
|
|
0 commit comments