Skip to content

Commit 9c9deb8

Browse files
Merge pull request #1516 from mrbullwinkle/patch-7
Update embeddings.md
2 parents 77e440d + e06cab7 commit 9c9deb8

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

articles/ai-services/openai/how-to/embeddings.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -128,7 +128,7 @@ return $response.data.embedding
128128

129129
- The maximum length of input text for our latest embedding models is 8,192 tokens. You should verify that your inputs don't exceed this limit before making a request.
130130
- If sending an array of inputs in a single embedding request the max array size is 2048.
131-
- The sum of the token count of the entire array of inputs sent in a single API call is subject to the max token limit of 8192. For example, you cannot send an array of 2,048 inputs with each input having five tokens or more. The total token count of this API request would be 10,240 total tokens, 2,048 tokens over the 8192 per API call token limit.
131+
- When sending an array of inputs in a single request, remember that the number of tokens per minute in your requests must remain below the quota limit that was assigned at model deployment. By default, the latest generation 3 embeddings models are subject to a 350 K TPM per region limit.
132132

133133

134134
## Limitations & risks

0 commit comments

Comments
 (0)