Skip to content

Refactor advantage computation, and delete RayPPOTrainer.fit#61

Merged
yanxi-chen merged 7 commits intomodelscope:algorithm_devfrom
yanxi-chen:dev/refactor_advantage
Jun 3, 2025
Merged

Refactor advantage computation, and delete RayPPOTrainer.fit#61
yanxi-chen merged 7 commits intomodelscope:algorithm_devfrom
yanxi-chen:dev/refactor_advantage