llama : gemma3 : use output tensor if it exists in model weight #12506

ngxson · 2025-03-21T21:12:11Z

Some fine-tuned models have it, we should include it in the converted model to make sure it runs correctly.

While we can remove this as suggested in the issue, I think it's not worth the risk. We are not sure if the fine tuning code produces different token_embd and lm_head tensor or not.

…-org#12506) * llama : gemma3 : use output tensor if it exists in model weight * also add to the llm_tensor_names

llama : gemma3 : use output tensor if it exists in model weight

9a49ae9

ngxson requested a review from ggerganov March 21, 2025 21:12

github-actions bot added the python python script changes label Mar 21, 2025

also add to the llm_tensor_names

ab0ff0f

ggerganov approved these changes Mar 22, 2025

View reviewed changes

ngxson merged commit fbdfefe into ggml-org:master Mar 22, 2025
49 of 50 checks passed

Ivy233 pushed a commit to Ivy233/llama.cpp that referenced this pull request Mar 23, 2025

llama : gemma3 : use output tensor if it exists in model weight (ggml…

bd5e3d3

…-org#12506) * llama : gemma3 : use output tensor if it exists in model weight * also add to the llm_tensor_names

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama : gemma3 : use output tensor if it exists in model weight #12506

llama : gemma3 : use output tensor if it exists in model weight #12506

Uh oh!

ngxson commented Mar 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

llama : gemma3 : use output tensor if it exists in model weight #12506

llama : gemma3 : use output tensor if it exists in model weight #12506

Uh oh!

Conversation

ngxson commented Mar 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants