Skip to content

Commit 8e0cb46

Browse files
Add usage instructions for EIS. (#3714)
Co-authored-by: Liam Thompson <[email protected]>
1 parent 639923b commit 8e0cb46

File tree

1 file changed

+8
-1
lines changed
  • explore-analyze/elastic-inference

1 file changed

+8
-1
lines changed

explore-analyze/elastic-inference/eis.md

Lines changed: 8 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -68,7 +68,14 @@ Tokens are the fundamental units that language models process for both input and
6868

6969
For example, the sentence "It was the best of times, it was the worst of times." contains 52 characters but would tokenize into approximately 14 tokens with a typical word-based approach, though the exact count varies by tokenizer.
7070

71-
## Rate Limits
71+
### Monitor your token usage
72+
73+
To track your token consumption:
74+
75+
1. Navigate to [**Billing and subscriptions > Usage**](https://cloud.elastic.co/billing/usage) in the {{ecloud}} Console
76+
2. Look for line items where the **Billing dimension** is set to "Inference"
77+
78+
## Rate limits
7279

7380
The service enforces rate limits on an ongoing basis. Exceeding a limit will result in HTTP 429 responses from the server until the sliding window moves on further and parts of the limit resets.
7481

0 commit comments

Comments
 (0)