ORPO (Or DPO?) #1350

exdownloader · 2024-05-28T18:22:09Z

exdownloader
May 28, 2024

I've seen a few discussions about DPO for sd-scripts, specifically this and this.

However there hasn't been further movement on either, from what I can tell.
ORPO is related to DPO and some even consider it superior.
I was recently browsing the various forks of sd-scripts and found the following repo which appears to be under active development.

The branch doesn't function for me, erroring out with the following:

I think that any kind of preference training would be interesting to explore and would be happy to see this kind of feature in sd-scripts but I have not been able to successfully contact the developer of this fork and so I'm raising awareness here in case there is a chance to gain traction.
After speaking with other AI/ML researchers and developers, I have been informed that regular DPO training is "easy" to implement.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ORPO (Or DPO?) #1350

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Uh oh!

ORPO (Or DPO?) #1350

Uh oh!

Uh oh!

exdownloader May 28, 2024

Replies: 0 comments

exdownloader
May 28, 2024