You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/components/frontend/tokenizer-backends.md
+6-6Lines changed: 6 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -13,9 +13,9 @@ The Dynamo Frontend supports multiple tokenizer backends for BPE-based models. T
13
13
The default backend uses the [HuggingFace `tokenizers`](https://github.com/huggingface/tokenizers) library (Rust).
14
14
It supports features in `tokenizer.json` files (normalizers, pre-tokenizers, post-processors, decoders, added tokens with special-token flags, and byte-fallback).
15
15
16
-
#### `fasttokens` High-Performance BPE Encoding
16
+
#### `fastokens` High-Performance BPE Encoding
17
17
18
-
The `fasttokens` backend uses the [`fastokens`](https://github.com/Atero-ai/fastokens) crate, a purpose-built BPE encoder optimized for throughput.
18
+
The `fastokens` backend uses the [`fastokens`](https://github.com/Atero-ai/fastokens) crate, a purpose-built BPE encoder optimized for throughput.
19
19
It is a _hybrid_ backend: encoding uses `fastokens` while decoding falls back to HuggingFace so that incremental detokenization, byte-fallback, and special-token handling work correctly.
20
20
21
21
Use this backend when tokenization is a measurable bottleneck, for example on high-concurrency prefill-heavy workloads.
@@ -32,22 +32,22 @@ Set the backend with a CLI flag or environment variable. The CLI flag takes prec
1. The frontend passes the environment variable to the Rust runtime.
53
53
2. When building the tokenizer for a model, `ModelDeploymentCard::tokenizer()` attempts to load `fastokens::Tokenizer` from the same `tokenizer.json` file.
0 commit comments