Skip to content

Commit 5a93997

Browse files
Update explore-analyze/elastic-inference/eis.md
Co-authored-by: Liam Thompson <[email protected]>
1 parent 5cc1bb1 commit 5a93997

File tree

1 file changed

+6
-3
lines changed
  • explore-analyze/elastic-inference

1 file changed

+6
-3
lines changed

explore-analyze/elastic-inference/eis.md

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -68,11 +68,14 @@ Tokens are the fundamental units that language models process for both input and
6868

6969
For example, the sentence "It was the best of times, it was the worst of times." contains 52 characters but would tokenize into approximately 14 tokens with a typical word-based approach, though the exact count varies by tokenizer.
7070

71-
### Checking Usage
71+
### Monitor your token usage
7272

73-
You can see your token usage by [checking your overall cloud usage](https://cloud.elastic.co/billing/usage) and looking for items that have "Inference" set as the Billing Dimension.
73+
To track your token consumption:
7474

75-
## Rate Limits
75+
1. Navigate to [**Billing and subscriptions > Usage**](https://cloud.elastic.co/billing/usage) in the {{ecloud}} Console
76+
2. Look for line items where the **Billing dimension** is set to "Inference"
77+
78+
## Rate limits
7679

7780
The service enforces rate limits on an ongoing basis. Exceeding a limit will result in HTTP 429 responses from the server until the sliding window moves on further and parts of the limit resets.
7881

0 commit comments

Comments
 (0)