Skip to content

Commit 15bc343

Browse files
committed
add citations
1 parent 4adfd63 commit 15bc343

File tree

1 file changed

+21
-0
lines changed

1 file changed

+21
-0
lines changed

docs/LLaVA_OneVision_Chat.md

Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -94,6 +94,27 @@ Using the feedback data obtained in `Step 2`, we conduct DPO training in an iter
9494

9595
This iterative process is repeated for `N=3` rounds in total, with each round refining the model’s ability to generate high-quality visual chat responses by progressively incorporating feedback from both human and AI assessments.
9696

97+
------
98+
### Citation
99+
100+
If you find it useful for your research and applications, please cite related papers/blogs using this BibTeX:
101+
```bibtex
102+
103+
104+
@article{li2024llava,
105+
title={Llava-onevision: Easy visual task transfer},
106+
author={Li, Bo and Zhang, Yuanhan and Guo, Dong and Zhang, Renrui and Li, Feng and Zhang, Hao and Zhang, Kaichen and Li, Yanwei and Liu, Ziwei and Li, Chunyuan},
107+
journal={arXiv preprint arXiv:2408.03326},
108+
year={2024}
109+
}
110+
111+
@article{2023llavarlhf,
112+
author = {Zhiqing Sun and Sheng Shen and Shengcao Cao and Haotian Liu and Chunyuan Li and Yikang Shen and Chuang Gan and Liang-Yan Gui and Yu-Xiong Wang and Yiming Yang and Kurt Keutzer and Trevor Darrell},
113+
title = {Aligning Large Multimodal Models with Factually Augmented RLHF},
114+
publisher = {arXiv:2309.14525},
115+
year = {2023}
116+
}
117+
```
97118

98119
------
99120

0 commit comments

Comments
 (0)