Skip to content

Conversation

@MubarakHAlketbi
Copy link

Enable GGUF conversion for Moonlight-16B-A3B with TikTokenTokenizer

  • Modified get_vocab_base to handle TikTokenTokenizer using vocab_size and decode([i]), bypassing .vocab requirement.
  • Added trust_remote_code=True to AutoTokenizer.from_pretrained for custom tokenizer loading.
  • Added pre-tokenizer hash "81212dc7cdb7e0c1074ca62c5aeab0d43c9f52b8a737be7b12a777c953027890" to get_vocab_base_pre as "moonlight-a3b".
  • Tested successfully with local Moonlight-16B-A3B model.

Make sure to read the contributing guidelines before submitting a PR

Enable GGUF conversion for Moonlight-16B-A3B with TikTokenTokenizer

- Modified get_vocab_base to handle TikTokenTokenizer using vocab_size and decode([i]), bypassing .vocab requirement.
- Added trust_remote_code=True to AutoTokenizer.from_pretrained for custom tokenizer loading.
- Added pre-tokenizer hash "81212dc7cdb7e0c1074ca62c5aeab0d43c9f52b8a737be7b12a777c953027890" to get_vocab_base_pre as "moonlight-a3b".
- Tested successfully with local Moonlight-16B-A3B model.
@github-actions github-actions bot added the python python script changes label Feb 23, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

python python script changes

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant