Use `AutoTokenizer.from()` for faster tokenizer loading #33

DePasqualeOrg · 2025-12-27T16:37:58Z

swift-transformers PR #303 offers significantly faster tokenizer loading when using AutoTokenizer.from(). It also covers the tokenizer remapping and registration that is currently done in mlx-swift-lm, so we can remove that and use the fast path here after that PR is merged.

Changes

MLXLMCommon/Tokenizer.swift: loadTokenizer now uses AutoTokenizer.from() directly instead of manually loading configs and calling PreTrainedTokenizer init
Embedders/Tokenizer.swift: Same change, now passes revision to AutoTokenizer.from()
Embedders/Models.swift: Added revision parameter to ModelConfiguration for consistency with MLXLMCommon
Embedders/Load.swift: Now passes revision to hub.snapshot()
Embedders/EmbeddingModel.swift: Uses loadTokenizer instead of inline config loading

API Changes

Deprecated:

loadTokenizerConfig: Use LanguageModelConfigurationFromHub from swift-transformers directly, which allows users to opt in to the fast path with stripVocabForPerformance: true.

Unavailable (breaking change):

TokenizerReplacementRegistry / replacementTokenizers: Use AutoTokenizer.register(_:for:) from swift-transformers instead. These no longer function with the new AutoTokenizer.from() code path.

Offline Mode

The offline fallback logic has been removed, as it's handled automatically by the swift-transformers Hub API. When offline, HubApi.snapshot() detects the network state via NWPathMonitor and falls back to cached files if available.

davidkoski · 2026-01-05T17:19:39Z

This looks good -- is is ready to merge?

DePasqualeOrg · 2026-01-05T17:24:25Z

I think we should wait for huggingface/swift-transformers#303 to be merged. I'll mark this as ready for review at that time.

davidkoski · 2026-01-05T17:25:50Z

I think we should wait for huggingface/swift-transformers#303 to be merged. I'll mark this as ready for review at that time.

~~Awesome, looking at that one now!~~ Ooops, that is one the swift-transformers side :-)

Add revision option to embedding model ID

af94532

DePasqualeOrg mentioned this pull request Dec 28, 2025

Optimize model loading performance #34

Merged

DePasqualeOrg added 2 commits January 6, 2026 21:04

Use AutoTokenizer.from() for faster tokenizer loading

e8da71f

Deprecate overrideTokenizer (now handled by swift-transformers)

607860b

DePasqualeOrg force-pushed the fast-tokenizer-loading branch from 8e741f3 to 607860b Compare January 6, 2026 20:06

Temporary pin to swift-transformers (remove before merging)

dfe2826

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use `AutoTokenizer.from()` for faster tokenizer loading #33

Use `AutoTokenizer.from()` for faster tokenizer loading #33

DePasqualeOrg commented Dec 27, 2025

Uh oh!

davidkoski commented Jan 5, 2026

Uh oh!

DePasqualeOrg commented Jan 5, 2026

Uh oh!

davidkoski commented Jan 5, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Use AutoTokenizer.from() for faster tokenizer loading #33

Are you sure you want to change the base?

Use AutoTokenizer.from() for faster tokenizer loading #33

Conversation

DePasqualeOrg commented Dec 27, 2025

Changes

API Changes

Offline Mode

Uh oh!

davidkoski commented Jan 5, 2026

Uh oh!

DePasqualeOrg commented Jan 5, 2026

Uh oh!

davidkoski commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Use `AutoTokenizer.from()` for faster tokenizer loading #33

Use `AutoTokenizer.from()` for faster tokenizer loading #33

davidkoski commented Jan 5, 2026 •

edited

Loading