-
Notifications
You must be signed in to change notification settings - Fork 208
Open
Description
Hi authors,
I’m having an issue when running GRPO training.
[rank0]: output_reward_func = reward_func( [rank0]: TypeError: accuracy_reward() missing 1 required positional argument: 'assistant'
Is there a quick fix? Any guidance would be greatly appreciated. Thanks for your help!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels