Skip to content

Commit d3f7e57

Browse files
Add missing rate limits for EIS (elastic#3625)
Resolves: elastic/search-team#11614 --------- Co-authored-by: Max Jakob <[email protected]>
1 parent 38b967c commit d3f7e57

File tree

1 file changed

+23
-0
lines changed
  • explore-analyze/elastic-inference

1 file changed

+23
-0
lines changed

explore-analyze/elastic-inference/eis.md

Lines changed: 23 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -54,3 +54,26 @@ This is particularly relevant when using the [_bulk API](https://www.elastic.co/
5454
## Pricing
5555

5656
All models on EIS incur a charge per million tokens. The pricing details are at our [Pricing page](https://www.elastic.co/pricing/serverless-search) for the Elastic Managed LLM and ELSER.
57+
58+
## Rate Limits
59+
60+
The service enforces rate limits on an ongoing basis. Exceeding a limit will result in HTTP 429 responses from the server until the sliding window moves on further and parts of the limit resets.
61+
62+
### Elastic Managed LLM
63+
64+
- 50 requests per minute
65+
- No rate limit on tokens
66+
67+
### ELSER (Sparse Embeddings)
68+
69+
We limit on both requests per minute and tokens per minute (whichever limit is reached first).
70+
71+
#### Ingest
72+
73+
- 6,000 request per minute
74+
- 6,000,000 tokens per minute
75+
76+
#### Search
77+
78+
- 6,000 requests per minute
79+
- 600,000 tokens per minute

0 commit comments

Comments
 (0)