Skip to content

Keep returns separate from advantages in GRPO#773

Open
ebronstein wants to merge 2 commits intoNovaSky-AI:mainfrom
ebronstein:grpo_returns
Open

Keep returns separate from advantages in GRPO#773
ebronstein wants to merge 2 commits intoNovaSky-AI:mainfrom
ebronstein:grpo_returns

Commits