Skip to content

Commit f8e8366

Browse files
committed
update docs
1 parent e29c7c5 commit f8e8366

File tree

2 files changed

+7
-7
lines changed

2 files changed

+7
-7
lines changed

docs/components/frontend/configuration.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -95,7 +95,7 @@ See the [Frontend Guide](frontend-guide.md) for KServe message formats and integ
9595

9696
| CLI Argument | Env Var | Default | Description |
9797
|-------------|---------|---------|-------------|
98-
| `--dyn-tokenizer-backend` | `DYN_TOKENIZER_BACKEND` | `default` | Tokenizer backend: `default` (HuggingFace) or `fasttokens` (fastokens crate for high-performance BPE encoding). See [Tokenizer Backends](tokenizer-backends.md) |
98+
| `--tokenizer` | `DYN_TOKENIZER` | `default` | Tokenizer backend: `default` (HuggingFace) or `fastokens` (fastokens crate for high-performance BPE encoding). See [Tokenizer Backends](tokenizer-backends.md) |
9999

100100
## Experimental
101101

docs/components/frontend/tokenizer-backends.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -13,9 +13,9 @@ The Dynamo Frontend supports multiple tokenizer backends for BPE-based models. T
1313
The default backend uses the [HuggingFace `tokenizers`](https://github.com/huggingface/tokenizers) library (Rust).
1414
It supports features in `tokenizer.json` files (normalizers, pre-tokenizers, post-processors, decoders, added tokens with special-token flags, and byte-fallback).
1515

16-
#### `fasttokens` High-Performance BPE Encoding
16+
#### `fastokens` High-Performance BPE Encoding
1717

18-
The `fasttokens` backend uses the [`fastokens`](https://github.com/Atero-ai/fastokens) crate, a purpose-built BPE encoder optimized for throughput.
18+
The `fastokens` backend uses the [`fastokens`](https://github.com/Atero-ai/fastokens) crate, a purpose-built BPE encoder optimized for throughput.
1919
It is a _hybrid_ backend: encoding uses `fastokens` while decoding falls back to HuggingFace so that incremental detokenization, byte-fallback, and special-token handling work correctly.
2020

2121
Use this backend when tokenization is a measurable bottleneck, for example on high-concurrency prefill-heavy workloads.
@@ -32,22 +32,22 @@ Set the backend with a CLI flag or environment variable. The CLI flag takes prec
3232

3333
| CLI Argument | Env Var | Valid values | Default |
3434
|---|---|---|---|
35-
| `--dyn-tokenizer-backend` | `DYN_TOKENIZER_BACKEND` | `default`, `fasttokens` | `default` |
35+
| `--tokenizer` | `DYN_TOKENIZER` | `default`, `fastokens` | `default` |
3636

3737
**Examples:**
3838

3939
```bash
4040
# CLI flag
41-
python -m dynamo.frontend --dyn-tokenizer-backend fasttokens
41+
python -m dynamo.frontend --tokenizer fastokens
4242

4343
# Environment variable
44-
export DYN_TOKENIZER_BACKEND=fasttokens
44+
export DYN_TOKENIZER=fastokens
4545
python -m dynamo.frontend
4646
```
4747

4848
## Dynamo Frontend Behavior
4949

50-
When `DYN_TOKENIZER_BACKEND=fasttokens` is set:
50+
When `DYN_TOKENIZER=fastokens` is set:
5151

5252
1. The frontend passes the environment variable to the Rust runtime.
5353
2. When building the tokenizer for a model, `ModelDeploymentCard::tokenizer()` attempts to load `fastokens::Tokenizer` from the same `tokenizer.json` file.

0 commit comments

Comments
 (0)