Skip to content

Add Fine-Tuning a Vision Language Model with TRL using MPO recipe#318

Merged
merveenoyan merged 2 commits intohuggingface:mainfrom
sergiopaniego:mpo-recipe
Jul 23, 2025
Merged

Add Fine-Tuning a Vision Language Model with TRL using MPO recipe#318
merveenoyan merged 2 commits intohuggingface:mainfrom
sergiopaniego:mpo-recipe

Conversation

@sergiopaniego
Copy link
Member

What does this PR do?

Add Fine-Tuning a Vision Language Model with TRL using MPO recipe

Fixes #317

Who can review?

@merveenoyan and @stevhliu

@review-notebook-app
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@review-notebook-app
Copy link

review-notebook-app bot commented Jul 21, 2025

View / edit / reply to this conversation on ReviewNB

stevhliu commented on 2025-07-21T18:47:56Z
----------------------------------------------------------------

I think this loss_type list is more appropriate in section 3.3 and you probably don't need to list all possible types - only list the ones you'll use and refer the reader to the docs for more


@review-notebook-app
Copy link

review-notebook-app bot commented Jul 21, 2025

View / edit / reply to this conversation on ReviewNB

stevhliu commented on 2025-07-21T18:47:57Z
----------------------------------------------------------------

I think maybe this needs to be in English 😁


Copy link
Member

@stevhliu stevhliu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Really cool, thanks for the new recipe!

@sergiopaniego sergiopaniego marked this pull request as ready for review July 22, 2025 13:11
@sergiopaniego
Copy link
Member Author

Ready for review as MPO PR in trl is already merged!

Thanks for the comments @stevhliu. I've addressed them.

@merveenoyan merveenoyan merged commit e0fcb99 into huggingface:main Jul 23, 2025
1 check passed
@sergiopaniego sergiopaniego deleted the mpo-recipe branch July 23, 2025 15:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

🧑‍🍳 New Fine-Tuning a Vision Language Model with TRL using MPO recipe

4 participants