Skip to content

Commit af2b68a

Browse files
Update how-to-deploy-models-cohere-embed.md
1 parent 4a4156e commit af2b68a

File tree

1 file changed

+14
-4
lines changed

1 file changed

+14
-4
lines changed

articles/machine-learning/how-to-deploy-models-cohere-embed.md

Lines changed: 14 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -53,7 +53,7 @@ The previously mentioned Cohere models can be deployed as a service with pay-as-
5353
- An Azure Machine Learning workspace. If you don't have these, use the steps in the [Quickstart: Create workspace resources](quickstart-create-resources.md) article to create them.
5454

5555
> [!IMPORTANT]
56-
> Pay-as-you-go model deployment offering is only available in workspaces created in EastUS, EastUS2 or Sweden Central regions.
56+
> Pay-as-you-go model deployment offering is only available in workspaces created in EastUS2 or Sweden Central regions.
5757
5858
- Azure role-based access controls (Azure RBAC) are used to grant access to operations. To perform the steps in this article, your user account must be assigned the __Azure AI Developer role__ on the Resource Group.
5959

@@ -64,7 +64,7 @@ The previously mentioned Cohere models can be deployed as a service with pay-as-
6464
To create a deployment:
6565

6666
1. Go to [Azure Machine Learning studio](https://ml.azure.com/home).
67-
1. Select the workspace in which you want to deploy your models. To use the pay-as-you-go model deployment offering, your workspace must belong to the EastUS, EastUS2 or Sweden Central region.
67+
1. Select the workspace in which you want to deploy your models. To use the pay-as-you-go model deployment offering, your workspace must belong to the EastUS2 or Sweden Central region.
6868
1. Choose the model you want to deploy from the [model catalog](https://ml.azure.com/model/catalog).
6969

7070
Alternatively, you can initiate deployment by going to your workspace and selecting **Endpoints** > **Serverless endpoints** > **Create**.
@@ -118,15 +118,15 @@ For more information on using the APIs, see the [reference](#embed-api-reference
118118
Content-type: application/json
119119
```
120120

121-
#### v1/emebeddings request schema
121+
#### v1/embeddings request schema
122122

123123
Cohere Embed v3 - English and Embed v3 - Multilingual accept the following parameters for a `v1/embeddings` API call:
124124

125125
| Property | Type | Default | Description |
126126
| --- | --- | --- | --- |
127127
|`input` |`array of strings` |Required |An array of strings for the model to embed. Maximum number of texts per call is 96. We recommend reducing the length of each text to be under 512 tokens for optimal quality. |
128128

129-
#### v1/emebeddings response schema
129+
#### v1/embeddings response schema
130130

131131
The response payload is a dictionary with the following fields:
132132

@@ -329,6 +329,16 @@ Response:
329329
| OpenAI SDK (experimental) | [openaisdk.ipynb](https://aka.ms/samples/cohere-embed/openaisdk) |
330330
| LangChain | [langchain.ipynb](https://aka.ms/samples/cohere-embed/langchain) |
331331
| Cohere SDK | [cohere-sdk.ipynb](https://aka.ms/samples/cohere-embed/cohere-python-sdk) |
332+
| LiteLLM SDK | [litellm.ipynb](https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/cohere/litellm.ipynb) |
333+
334+
##### Retrieval Augmented Generation (RAG) and Tool-Use samples
335+
**Description** | **Sample Type** | **Sample Notebook**
336+
--|--|--
337+
Create a local (FAISS) vector index using Cohere embeddings - Langchain|`langchain`, `langchain_cohere`|[cohere_faiss_langchain_embed.ipynb](https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/cohere/cohere_faiss_langchain_embed.ipynb)
338+
Use Cohere Command R/R+ to answer questions from data in local (FAISS) vector index - Langchain|`langchain`, `langchain_cohere`|[command_faiss_langchain.ipynb](https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/cohere/command_faiss_langchain.ipynb)
339+
Use Cohere Command R/R+ to answer questions from data in AI search vector index - Langchain|`langchain`, `langchain_cohere`|[cohere-aisearch-langchain-rag.ipynb](https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/cohere/cohere-aisearch-langchain-rag.ipynb)
340+
Use Cohere Command R/R+ to answer questions from data in AI search vector index - Cohere SDK| `cohere`, `azure_search_documents`|[cohere-aisearch-rag.ipynb](https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/cohere/cohere-aisearch-rag.ipynb)
341+
Command R+ tool/function calling using LangChain|`cohere`, `langchain`, `langchain_cohere`|[command_tools-langchain.ipynb](https://github.com/Azure/azureml-examples/blob/main/sdk/python/foundation-models/cohere/command_tools-langchain.ipynb)
332342

333343
## Cost and quotas
334344

0 commit comments

Comments
 (0)