How to use ORPO finetuning with MLX #795

gasser707 · 2024-05-23T12:41:43Z

gasser707
May 23, 2024

I was wondering how to do the same finetuning process with mlx.
The article is using OrpoTrainer from https://github.com/huggingface/trl

is there a similar functionality in mlx, if not would any pointers on implementing myself in mlx?

lin72h · 2024-05-29T06:37:16Z

DPO is good enough for me, hope someone MLX ninja can make it happen

0 replies