Skip to content

Commit 803d61d

Browse files
committed
AzureML doc update
1 parent d358feb commit 803d61d

File tree

2 files changed

+58
-52
lines changed

2 files changed

+58
-52
lines changed

articles/ai-studio/how-to/deploy-models-cohere-rerank.md

Lines changed: 6 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,11 @@
11
---
2-
title: How to deploy Cohere Rerank models with Azure AI Studio
2+
title: How to deploy Cohere Rerank models as serverless APIs
33
titleSuffix: Azure AI Studio
4-
description: Learn how to deploy Cohere Rerank models with Azure AI Studio.
4+
description: Learn to deploy and use Cohere Rerank models with Azure AI Studio.
55
manager: scottpolly
66
ms.service: azure-ai-studio
77
ms.topic: how-to
8-
ms.date: 5/21/2024
8+
ms.date: 07/24/2024
99
ms.reviewer: shubhiraj
1010
ms.author: mopeakande
1111
author: msakande
@@ -165,8 +165,8 @@ The `results` object is a dictionary with the following fields:
165165
| Key | Type | Description |
166166
| --- | --- | --- |
167167
| `document` | `object` |The document objects or strings that were reranked. |
168-
| `index` | `ingeter` |The `index` in the original list of documents to which the ranked document belongs. For example, if the first value in the `results` object has an index value of 3, it means in the list of documents passed in, the document at `index=3` had the highest relevance.|
169-
| `relevance_score` | `float` |Relevance scores are normalized to be in the range `[0, 1]`. Scores close to one indicate a high relevance to the query, and scores close to zero indicate low relevance. A score of `0.9` _doesn't_ necessarily mean that a document is twice as relevant as another with a score of `0.45` |
168+
| `index` | `integer` |The `index` in the original list of documents to which the ranked document belongs. For example, if the first value in the `results` object has an index value of 3, it means in the list of documents passed in, the document at `index=3` had the highest relevance.|
169+
| `relevance_score` | `float` |Relevance scores are normalized to be in the range `[0, 1]`. Scores close to one indicate a high relevance to the query, and scores close to zero indicate low relevance. A score of `0.9` _doesn't_ necessarily mean that a document is twice as relevant as another with a score of `0.45`. |
170170

171171

172172
## Examples
@@ -236,9 +236,7 @@ The `results` object is a dictionary with the following fields:
236236
|LangChain|[langchain.ipynb](https://aka.ms/samples/cohere-rerank/langchain)|
237237
|Cohere SDK|[cohere-sdk.ipynb](https://aka.ms/samples/cohere-rerank/cohere-python-sdk)|
238238

239-
## Cost and quotas
240-
241-
### Cost and quota considerations for models deployed as a service
239+
## Cost and quota considerations for models deployed as a service
242240

243241
Quota is managed per deployment. Each deployment has a rate limit of 200,000 tokens per minute and 1,000 API requests per minute. However, we currently limit one deployment per model per project. Contact Microsoft Azure Support if the current rate limits aren't sufficient for your scenarios.
244242

0 commit comments

Comments
 (0)