Support converting Qwen2.5-based MiniVLA checkpoint to HF format by 360ZMEM · Pull Request #2 · Stanford-ILIAD/openvla-mini

360ZMEM · 2025-12-22T14:50:35Z

Description:
This PR modifies file vla-scripts/extern/convert_openvla_weights_to_hf.py to enable the conversion of MiniVLA checkpoints (Qwen2.5 backbone) to standard HF format.

Changes:

Qwen2.5 Support: Added logic to handle num_key_value_heads (GQA) which is present in Qwen2.5 configs but not in the default Llama mapping.
Extra Tokens: Implemented logic to add <|extra_i|> tokens for models trained with the extra tokenizer configuration.
Vocab Alignment: Ensured vocab_size in the config matches the padded vocabulary size used during Prismatic training.

Scope:

✅ Verified: Tested with prism-qwen25-extra-dinosiglip-224px+0_5b (Standard Action Binning).
This PR does not consider VQActionTokenizer models.

360ZMEM added 2 commits December 22, 2025 22:22

Support converting MiniVLA weights to HF format

32d7cfa

Update note on HF checkpoint conversion support

78a2e3f

360ZMEM marked this pull request as ready for review December 22, 2025 14:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support converting Qwen2.5-based MiniVLA checkpoint to HF format#2

Support converting Qwen2.5-based MiniVLA checkpoint to HF format#2
360ZMEM wants to merge 2 commits intoStanford-ILIAD:mainfrom
360ZMEM:feat/support-hf-checkpoint-export

360ZMEM commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

360ZMEM commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant