Skip to content

Hybrid Group Relative Policy Optimization (Hybrid GRPO): A Multi-Sample Approach to Reinforcement Learning#275

Open
Soham4001A wants to merge 8 commits intoStable-Baselines-Team:masterfrom
Soham4001A:dev
Open

Hybrid Group Relative Policy Optimization (Hybrid GRPO): A Multi-Sample Approach to Reinforcement Learning#275
Soham4001A wants to merge 8 commits intoStable-Baselines-Team:masterfrom
Soham4001A:dev

Commits

Commits on Jan 29, 2025

Commits on Jan 30, 2025

Commits on Mar 30, 2025