Skip to content

Commit 9d892e0

Browse files
Merge pull request #732 from mrbullwinkle/mrb_10_09_2024_embeddings
[Azure OpenAI] embeddings update
2 parents 59d9fe9 + 39a398a commit 9d892e0

File tree

2 files changed

+5
-4
lines changed

2 files changed

+5
-4
lines changed

articles/ai-services/openai/concepts/models.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -430,10 +430,10 @@ These models can only be used with Embedding API requests.
430430
431431
| Model ID | Max Request (tokens) | Output Dimensions |Training Data (up-to)
432432
|---|---| :---:|:---:|:---:|
433-
| `text-embedding-ada-002` (version 2) |8,191 | 1,536 | Sep 2021 |
433+
| `text-embedding-ada-002` (version 2) |8,192 | 1,536 | Sep 2021 |
434434
| `text-embedding-ada-002` (version 1) |2,046 | 1,536 | Sep 2021 |
435-
| `text-embedding-3-large` | 8,191 | 3,072 |Sep 2021 |
436-
| `text-embedding-3-small` | 8,191| 1,536 | Sep 2021 |
435+
| `text-embedding-3-large` | 8,192 | 3,072 |Sep 2021 |
436+
| `text-embedding-3-small` | 8,192| 1,536 | Sep 2021 |
437437

438438
> [!NOTE]
439439
> When sending an array of inputs for embedding, the max number of input items in the array per call to the embedding endpoint is 2048.

articles/ai-services/openai/how-to/embeddings.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -126,8 +126,9 @@ return $response.data.embedding
126126

127127
### Verify inputs don't exceed the maximum length
128128

129-
- The maximum length of input text for our latest embedding models is 8192 tokens. You should verify that your inputs don't exceed this limit before making a request.
129+
- The maximum length of input text for our latest embedding models is 8,192 tokens. You should verify that your inputs don't exceed this limit before making a request.
130130
- If sending an array of inputs in a single embedding request the max array size is 2048.
131+
- The sum of the token count of the entire array of inputs sent in a single API call is subject to the max token limit of 8192. For example, you cannot send an array of 2,048 inputs with each input having five tokens or more. The total token count of this API request would be 10,240 total tokens, 2,048 tokens over the 8192 per API call token limit.
131132

132133

133134
## Limitations & risks

0 commit comments

Comments
 (0)