- https://github.com/open-thought/tiny-grpo - https://github.com/aburkov/theLMbook/blob/main/GRPO.py - https://github.com/policy-gradient/GRPO-Zero