You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We presented XERV Crayon, a production-grade tokenization system achieving over 40 million tokens per second through principled first-principles engineering. The combination of hyper-fast BPE training, cache-aligned Double-Array Tries, and hardware-native omni-backend acceleration establishes a new performance baseline for tokenization infrastructure.
1349
+
We presented XERV Crayon, a production-grade tokenization system achieving over 5 million tokens per second through principled first-principles engineering. The combination of hyper-fast BPE training, cache-aligned Double-Array Tries, and hardware-native omni-backend acceleration establishes a new performance baseline for tokenization infrastructure.
1358
1350
1359
-
The Cartridge System with five production profiles (lite, science, code, multilingual, arts\_commerce) enables domain-specific optimization without sacrificing the unified API. Adaptive vocabulary management with staged commit/rollback ensures graceful handling of distribution shift.
1351
+
The Cartridge System with two production profiles (lite, standard) enables domain-specific optimization without sacrificing unified API.
1360
1352
1361
1353
Comprehensive benchmarks demonstrate:
1362
1354
\begin{itemize}
1363
-
\item\textbf{56--118$\times$ speedup} over tiktoken and HuggingFace
1364
-
\item\textbf{40+ million tokens/sec} on commodity CPU hardware
1355
+
\item\textbf{18--22$\times$ speedup} over tiktoken on consumer hardware
1356
+
\item\textbf{5+ million tokens/sec} on Intel i3-7020U CPU
1365
1357
\item\textbf{Sub-millisecond} profile loading via zero-copy mapping
0 commit comments