Skip to content

Conversation

@kashif
Copy link
Contributor

@kashif kashif commented Jan 1, 2026

Summary

Add various GRPO loss types.

Testing Done

  • Hardware Type:
  • run make test to ensure correctness
  • run make checkstyle to ensure code style
  • run make test-convergence to ensure convergence

@kashif
Copy link
Contributor Author

kashif commented Jan 1, 2026

grpo_comparison_subplots

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant