-
Notifications
You must be signed in to change notification settings - Fork 154
Open
Labels
Description
Describe the bug
Hi devs! Recently we attempt to train Megatron-Bridge-based Qwen3-VL with slime and find large logprobs diff between train and rollout backends. After deeply investigating the modeling, we find that:
get_rope_indexcalled inmodel.pycannot correctly process packed input_ids._apply_rotary_pos_emb_thdonly support RoPE of traditional mapping without offsets whenapply_rope_fusionis enabled.--no-rope-fusionmust be set to get the correct behavior.
Here is a draft patch fix (for dense and MoE, not strictly tested) on 5c7ebe7 .
zhuzilin and nanjiangwill