You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/machine-learning/how-to-deploy-models-cohere-embed.md
+10-8Lines changed: 10 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -108,7 +108,8 @@ The previously mentioned Cohere models can be consumed using the chat API.
108
108
109
109
## Embed API reference for Cohere Embed models deployed as a service
110
110
111
-
## V1/embeddings request
111
+
## v1/embeddings
112
+
### Request
112
113
113
114
```
114
115
POST /v1/embeddings HTTP/1.1
@@ -117,15 +118,15 @@ The previously mentioned Cohere models can be consumed using the chat API.
117
118
Content-type: application/json
118
119
```
119
120
120
-
### V1/emebeddings request schema
121
+
### v1/emebeddings request schema
121
122
122
123
Cohere Embed v3 - English and Embed v3 - Multilingual accept the following parameters for a `v1/embeddings` API call:
123
124
124
125
| Property | Type | Default | Description |
125
126
| --- | --- | --- | --- |
126
127
|`input`|`array of strings`|Required |An array of strings for the model to embed. Maximum number of texts per call is 96. We recommend reducing the length of each text to be under 512 tokens for optimal quality. |
127
128
128
-
### V1/emebeddings response schema
129
+
### v1/emebeddings response schema
129
130
130
131
The response payload is a dictionary with the following fields:
131
132
@@ -154,7 +155,7 @@ The `usage` object is a dictionary with the following fields:
154
155
|`total_tokens`|`integer`| Total tokens. |
155
156
156
157
157
-
## V1/embeddings examples
158
+
## v1/embeddings examples
158
159
159
160
**Request**
160
161
@@ -190,7 +191,8 @@ The `usage` object is a dictionary with the following fields:
190
191
}
191
192
```
192
193
193
-
## V1/embed request
194
+
## v1/embed
195
+
### Request
194
196
195
197
```
196
198
POST /v1/embed HTTP/1.1
@@ -199,7 +201,7 @@ The `usage` object is a dictionary with the following fields:
199
201
Content-type: application/json
200
202
```
201
203
202
-
### V1/embed request schema
204
+
### v1/embed request schema
203
205
204
206
Cohere Embed v3 - English and Embed v3 - Multilingual accept the following parameters for a `v1/embed` API call:
205
207
@@ -210,7 +212,7 @@ Cohere Embed v3 - English and Embed v3 - Multilingual accept the following param
210
212
|`truncate`|`enum string`|`NONE`|`NONE` – Returns an error when the input exceeds the maximum input token length. <br/> `START` – Discards the start of the input. <br/> `END` – Discards the end of the input. |
211
213
|`embedding_types`|`array of strings`|`float`|Specifies the types of embeddings you want to get back. Can be one or more of the following types. `float`, `int8`, `uint8`, `binary`, `ubinary`|
212
214
213
-
### V1/embed response schema
215
+
### v1/embed response schema
214
216
215
217
Cohere Embed v3 - English and Embed v3 - Multilingual include the following fields in the response:
216
218
@@ -224,7 +226,7 @@ Cohere Embed v3 - English and Embed v3 - Multilingual include the following fiel
224
226
225
227
For more information, see [https://docs.cohere.com/reference/embed](https://docs.cohere.com/reference/embed).
0 commit comments