Skip to content

Add more training models and RLHF algorithms#6368

Closed
sglucas wants to merge 4 commits intohpcaitech:grpo-latestfrom
sglucas:test-grpo
Closed

Add more training models and RLHF algorithms#6368
sglucas wants to merge 4 commits intohpcaitech:grpo-latestfrom
sglucas:test-grpo

Commits

Commits on Jul 21, 2025

Commits on Jul 23, 2025