An Error In "Fine-Tuning a Vision Language Model (Qwen2-VL-7B) with the Hugging Face Ecosystem (TRL)" Notebook

In this notebook(https://huggingface.co/learn/cookbook/fine_tuning_vlm_trl), it uses messages column to finetune. However, when I try to reproduce it using Qwen2.5-VL and my own dataset, the model converges to a local minima. I then read from the dataset sturcture and saw that I should use prompt and completion and it works. I think we should make this clear in the notebook.
https://github.com/huggingface/trl/issues/4077

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

An Error In "Fine-Tuning a Vision Language Model (Qwen2-VL-7B) with the Hugging Face Ecosystem (TRL)" Notebook #328

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

An Error In "Fine-Tuning a Vision Language Model (Qwen2-VL-7B) with the Hugging Face Ecosystem (TRL)" Notebook #328

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions