[GRPO] add grpo loss types #2370
nvi-ci.yml
on: pull_request
nvi-correctness-tests
0s
nvi-convergence-tests
nvi-correctness-tests-with-transformers-4-52-0
nvi-convergence-tests-with-transformers-4-52-0
0s