Initial support for Gemma 3 models #717

corebonts · 2025-03-15T19:45:45Z

(parcially?) resolves #711

I still want to test it further, but I thought an initial review would be great.

The commit is based on the llama.ccp implementation:

TODO:

corebonts · 2025-03-15T22:22:13Z

@cjpais What is needed for image support? Also, what is the easiest way to debug or test it? Or should we just go with the text only for now?

corebonts · 2025-03-16T06:25:26Z

I think I should build on the granite support feature branch because that also introduces the attention scale param.

cjpais · 2025-03-16T20:24:46Z

Sweet thank you so much. I will take a look at it tomorrow/tuesday. I suspect the granite branch will be merged in a day or two, so we will probably wait for that overall before this comes in. If it's easier to rebase on it now great, but if it's easier later we can do that.

cjpais · 2025-03-17T20:36:34Z

largely this looks good to me.

i will probably do another once over tomorrow again (mostly verifying build_gemma3 and around hparams llm_load_hparams vs llama.cpp)

i've tested it on the 4 sizes and it works

@corebonts if you don't mind rebasing it that would be great, otherwise i can resolve tomorrow afternoon.

corebonts · 2025-03-17T20:59:33Z

On it

Tested only on text-to-text.

cjpais · 2025-03-18T17:36:07Z

Thanks @corebonts, I added the n_swa_pattern hparam which was added in llama.cpp #12373 for slightly easier readability between the two repos. It looks like this will be missing some RoPE scaling from upstream as far as I can tell, but this is probably another indication of needing a sync with upstream.

corebonts · 2025-03-18T20:13:51Z

@cjpais Could you tell me why the n_swa is no longer needed? I see it also in the original llama.cpp code, but according to other gemma3 implementations, it's 1024 (or 512 for the 1b model), like here: https://github.com/google/gemma_pytorch/blob/014acb7ac4563a5f77c76d7ff98f31b568c16508/gemma/config.py#L230

And it's also mentioned in the technical report:
https://www.reddit.com/r/LocalLLaMA/comments/1j9drfk/gemma_3_technical_report/

cjpais · 2025-03-18T20:53:28Z

@corebonts it's there! It's set from the function call below from what I can tell. I tested it as well as this matching the code on the main branch of llama.cpp currently

ml.get_key(LLM_KV_ATTENTION_SLIDING_WINDOW, hparams.n_swa);

jart · 2025-03-23T17:30:13Z

A+++

github-actions bot added the llama.cpp label Mar 15, 2025

corebonts changed the title ~~Draft: Initial support for Gemma 3 models~~ Initial support for Gemma 3 models Mar 15, 2025

Initial support for Gemma 3 models

f6b3831

Tested only on text-to-text.

corebonts force-pushed the gemma3 branch from 8dc6649 to f6b3831 Compare March 17, 2025 21:59

use n_swa_pattern from llama.cpp #12373

4d20599

cjpais merged commit f0d65b6 into mozilla-ai:main Mar 18, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Initial support for Gemma 3 models #717

Initial support for Gemma 3 models #717

Uh oh!

corebonts commented Mar 15, 2025 •

edited

Loading

Uh oh!

corebonts commented Mar 15, 2025

Uh oh!

corebonts commented Mar 16, 2025 •

edited

Loading

Uh oh!

cjpais commented Mar 16, 2025

Uh oh!

cjpais commented Mar 17, 2025

Uh oh!

corebonts commented Mar 17, 2025

Uh oh!

cjpais commented Mar 18, 2025

Uh oh!

Uh oh!

corebonts commented Mar 18, 2025

Uh oh!

cjpais commented Mar 18, 2025

Uh oh!

jart commented Mar 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Initial support for Gemma 3 models #717

Initial support for Gemma 3 models #717

Uh oh!

Conversation

corebonts commented Mar 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

corebonts commented Mar 15, 2025

Uh oh!

corebonts commented Mar 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cjpais commented Mar 16, 2025

Uh oh!

cjpais commented Mar 17, 2025

Uh oh!

corebonts commented Mar 17, 2025

Uh oh!

cjpais commented Mar 18, 2025

Uh oh!

Uh oh!

corebonts commented Mar 18, 2025

Uh oh!

cjpais commented Mar 18, 2025

Uh oh!

jart commented Mar 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

corebonts commented Mar 15, 2025 •

edited

Loading

corebonts commented Mar 16, 2025 •

edited

Loading