Skip to content

Commit 3bbda33

Browse files
committed
add dpo training scripts
1 parent 6b6833c commit 3bbda33

File tree

1 file changed

+7
-11
lines changed

1 file changed

+7
-11
lines changed

docs/LLaVA_OneVision_Chat.md

Lines changed: 7 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -98,18 +98,14 @@ This iterative process is repeated for `N=3` rounds in total, with each round re
9898

9999
- Example training script: [`/scripts/train/dpo_ov7b.sh`](../scripts/train/dpo_ov7b.sh)
100100
- Format of training data:
101-
102101
~~~json
103-
[
104-
{
105-
"id": "<image-id>",
106-
"image": "<image path under args.image_folder>",
107-
"prompt": "<input prompt/question>",
108-
"chosen": "<chosen model response>",
109-
"rejected": "<rejected model response>"
110-
},
111-
...
112-
]
102+
{
103+
"id": "<image-id>",
104+
"image": "<image path under args.image_folder>",
105+
"prompt": "<input prompt/question>",
106+
"chosen": "<chosen model response>",
107+
"rejected": "<rejected model response>"
108+
}
113109
~~~
114110

115111

0 commit comments

Comments
 (0)