Skip to content

Commit 6e3dc31

Browse files
committed
Adding more info on tokens.
1 parent a831534 commit 6e3dc31

File tree

1 file changed

+12
-0
lines changed
  • explore-analyze/elastic-inference

1 file changed

+12
-0
lines changed

explore-analyze/elastic-inference/eis.md

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -55,6 +55,18 @@ This is particularly relevant when using the [_bulk API](https://www.elastic.co/
5555

5656
All models on EIS incur a charge per million tokens. The pricing details are at our [Pricing page](https://www.elastic.co/pricing/serverless-search) for the Elastic Managed LLM and ELSER.
5757

58+
### Token-based billing
59+
60+
EIS is billed per million "tokens" used. Tokens can be thought of loosely as "words" which are given to a machine learning model to operate upon. The model may also produce a number of tokens in response.
61+
62+
For example, the sentence:
63+
64+
"It was the best of times, it was the worst of times."
65+
66+
contains 52 characters, but would be tokenised into 14 tokens - one for each of the 12 words, one for the comma, and one for the period character.
67+
68+
This is because machine learning models use words to denote meaning.
69+
5870
## Rate Limits
5971

6072
The service enforces rate limits on an ongoing basis. Exceeding a limit will result in HTTP 429 responses from the server until the sliding window moves on further and parts of the limit resets.

0 commit comments

Comments
 (0)