Skip to content

Fixes

4724824
Select commit
Loading
Failed to load commit list.
Merged

New options for preference tuning: rpo alpha, logprobs normalization, reference-free, simpo gamma #327

Fixes
4724824
Select commit
Loading
Failed to load commit list.