Qwen3 MoE should also work with tie_word_embeddings #13768

estibi · 2025-05-25T06:25:18Z

For custom trained Qwen3 MoE model with tie_word_embeddings=True I get this error:

load_tensors: loading model tensors, this can take a while... (mmap = true)
llama_model_load: error loading model: missing tensor 'output.weight'
llama_model_load_from_file_impl: failed to load model

This fix makes tie_word_embeddings optional for Qwen3 MoE.

Qwen3 MoE should also work with tie_word_embeddings

07f9269

CISC approved these changes May 25, 2025

View reviewed changes

CISC merged commit 4032ca4 into ggml-org:master May 25, 2025
46 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Qwen3 MoE should also work with tie_word_embeddings #13768

Qwen3 MoE should also work with tie_word_embeddings #13768

Uh oh!

estibi commented May 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Qwen3 MoE should also work with tie_word_embeddings #13768

Qwen3 MoE should also work with tie_word_embeddings #13768

Uh oh!

Conversation

estibi commented May 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants