Skip to content

Updating the Gemma3 verification script to support safetensors checkpoints and numerical verification for FunctionGemma, a variant of Gemma3 270m.#927

Open
copybara-service[bot] wants to merge 1 commit intomainfrom
test_865393370
Open

Updating the Gemma3 verification script to support safetensors checkpoints and numerical verification for FunctionGemma, a variant of Gemma3 270m.#927
copybara-service[bot] wants to merge 1 commit intomainfrom
test_865393370

Conversation

@copybara-service
Copy link

Updating the Gemma3 verification script to support safetensors checkpoints and numerical verification for FunctionGemma, a variant of Gemma3 270m.

Key changes:
-- Safetensors Support: Implemented a custom_loader within verify_gemma3.py to handle .safetensors files. This loader includes logic to remap tensor keys (e.g., fusing separate q_proj, k_proj, v_proj into qkv_proj and renaming normalization layers) to match the expectations of the litert_torch model builder and gemma_pytorch.
-- Gemma3 270m Support: Added a get_config_for_270m function to define the configuration for the 270m variant (18 layers, 640 embedding dim, 4 heads).
-- Enabling verification of FunctionGemma and Gemma3 270m models against the original implementation.
These changes allow for successful numerical verification of Gemma3-1b (in safetensors format), Gemma3-270m, and FunctionGemma checkpoints.

…oints and numerical verification for FunctionGemma, a variant of Gemma3 270m.

Key changes:
-- Safetensors Support: Implemented a custom_loader within verify_gemma3.py to handle .safetensors files. This loader includes logic to remap tensor keys (e.g., fusing separate q_proj, k_proj, v_proj into qkv_proj and renaming normalization layers) to match the expectations of the litert_torch model builder and gemma_pytorch.
-- Gemma3 270m Support: Added a get_config_for_270m function to define the configuration for the 270m variant (18 layers, 640 embedding dim, 4 heads).
-- Enabling verification of FunctionGemma and Gemma3 270m models against the original implementation.
These changes allow for successful numerical verification of Gemma3-1b (in safetensors format), Gemma3-270m, and FunctionGemma checkpoints.

PiperOrigin-RevId: 865393370
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant