Skip to content

Add support for the Training Method for finetuning, and for Direct-Preference Optimization (DPO)#262

Merged
mryab merged 14 commits intomainfrom
Vprov/dpo_python
Mar 11, 2025
Merged

Add support for the Training Method for finetuning, and for Direct-Preference Optimization (DPO)#262
mryab merged 14 commits intomainfrom
Vprov/dpo_python

Commits

Commits on Feb 28, 2025

Commits on Mar 3, 2025

Commits on Mar 4, 2025

Commits on Mar 5, 2025

Commits on Mar 11, 2025