-
Notifications
You must be signed in to change notification settings - Fork 198
Open
Description
Hi authors,
I am using GRPO to fine-tune Qwen/Qwen3-VL-2B-Instruct and encoutering the following error:
[rank0]: File "src/trainer/grpo_trainer.py", line 53, in _generate_and_score_completions
[rank0]: prompts = [x["prompt"] for x in inputs]
[rank0]: File "src/trainer/grpo_trainer.py", line 53, in <listcomp>
[rank0]: prompts = [x["prompt"] for x in inputs]
[rank0]: ~^^^^^^^^^^
[rank0]: TypeError: string indices must be integers, not 'str'
It looks like the autoprocessor of Qwen3-VL is turning the inputs into a BatchFeature object with the following keys:
b.data.keys() Out[8]: dict_keys(['input_ids', 'attention_mask', 'pixel_values', 'image_grid_thw'])
Are there any quick fixes I can do? Thanks for the help!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels