1- VLM Supervised Fine-Tuning (SFT)
1+ VLM Supervised Fine-Tuning
22================================
33
44This document explains how to run **full-parameter supervised fine-tuning (Full-parameter SFT) ** for VLM models in RLinf.
55
66This tutorial mainly focuses on two files:
77
88- Launch script: ``examples/sft/run_vlm_sft.sh ``
9- - Training config: ``examples/sft/config/custom_sft_vlm .yaml ``
9+ - Training config: ``examples/sft/config/qwen2_5_sft_vlm .yaml ``
1010
1111Launch Script: ``examples/sft/run_vlm_sft.sh ``
1212
13- - The script uses ``examples/sft/config/custom_sft_vlm .yaml `` by default.
13+ - The script uses ``examples/sft/config/qwen2_5_sft_vlm .yaml `` by default.
1414- Logs are redirected to: ``<repo>/logs/<timestamp>/ ``
1515- Actual command:
1616
@@ -21,7 +21,7 @@ Launch Script: ``examples/sft/run_vlm_sft.sh``
2121 --config-name < your_config_name> \
2222 runner.logger.log_path=< auto_generated_log_dir>
2323
24- Config Template: ``examples/sft/config/custom_sft_vlm .yaml ``
24+ Config Template: ``examples/sft/config/qwen2_5_sft_vlm .yaml ``
2525
2626The VLM config structure is similar to other RLinf training configs.
2727You mainly need to adapt ``data `` and ``actor.model `` for your VLM use case.
@@ -35,11 +35,11 @@ Preparation Before Running
3535 ``https://huggingface.co/Qwen/Qwen2.5-VL-3B-Instruct ``.
36363. Prepare Robo2VLM dataset:
3737 ``https://huggingface.co/datasets/keplerccc/Robo2VLM-1 ``.
38- 4. Edit ``examples/sft/config/custom_sft_vlm .yaml `` and run
38+ 4. Edit ``examples/sft/config/qwen2_5_sft_vlm .yaml `` and run
3939 ``examples/sft/run_vlm_sft.sh ``.
4040
41- Example YAML
42- ------------
41+ Example of Qwen2_5_VL_4B SFT
42+ ----------------------------
4343
4444Important note: after downloading Robo2VLM, train and eval parquet files are mixed in one directory
4545(e.g., ``train-00000-of-00262.parquet `` and ``test-0000X-of-00003.parquet ``).
@@ -153,7 +153,7 @@ Run from repository root:
153153
154154 Notes:
155155
156- - If no argument is provided, the script uses ``custom_sft_vlm `` by default.
156+ - If no argument is provided, the script uses ``qwen2_5_sft_vlm `` by default.
157157- If your config name is different (e.g., ``my_vlm_config.yaml ``), pass it as an argument:
158158
159159.. code :: bash
@@ -230,7 +230,7 @@ Update these fields first:
230230- ``convertor.ckpt_path ``: path to ``full_weights.pt ``
231231- ``convertor.save_path ``: output HF model directory
232232- ``model.model_path ``: base model path
233- - ``model.model_type ``: model type (e.g., ``qwen2.5_vl ``)
233+ - ``model.model_type ``: model type (e.g., ``qwen2.5_vl `` , `` qwen3_vl `` or `` qwen3_vl_moe `` )
234234
235235Run:
236236
0 commit comments