Skip to content

Add support for per token reward/advantages with a custom_reward_post_process_path#1389

Open
vpj wants to merge 3 commits intoTHUDM:mainfrom
vpj:main
Open

Add support for per token reward/advantages with a custom_reward_post_process_path#1389
vpj wants to merge 3 commits intoTHUDM:mainfrom
vpj:main

Commits

Commits on Jan 12, 2026

Commits on Jan 18, 2026