You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: explore-analyze/elastic-inference/eis.md
+23Lines changed: 23 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -54,3 +54,26 @@ This is particularly relevant when using the [_bulk API](https://www.elastic.co/
54
54
## Pricing
55
55
56
56
All models on EIS incur a charge per million tokens. The pricing details are at our [Pricing page](https://www.elastic.co/pricing/serverless-search) for the Elastic Managed LLM and ELSER.
57
+
58
+
## Rate Limits
59
+
60
+
The service enforces rate limits on an ongoing basis. Exceeding a limit will result in HTTP 429 responses from the server until the sliding window moves on further and parts of the limit resets.
61
+
62
+
### Elastic Managed LLM
63
+
64
+
- 50 requests per minute
65
+
- No rate limit on tokens
66
+
67
+
### ELSER (Sparse Embeddings)
68
+
69
+
We limit on both requests per minute and tokens per minute (whichever limit is reached first).
0 commit comments