Skip to content

Commit 4adfd63

Browse files
committed
update links
1 parent 6f73f7d commit 4adfd63

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

docs/LLaVA_OneVision_Chat.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -55,7 +55,7 @@ LLaVA-OV-Chat consistently showcases exceptional visual chat capabilities across
5555

5656
To optimize LLaVA-OneVision’s in-the-wild conversational abilities, we've employed an iterative Direct Preference Optimization (DPO) process. Through this method, we found that feedback from two primary sources is particularly effective:
5757

58-
1. **Human Feedback from LLaVA-RLHF**: Real-world human input plays a crucial role in guiding the model toward more intuitive and user-friendly responses.
58+
1. **Human Feedback from [LLaVA-RLHF](https://llava-rlhf.github.io/)**: Real-world human input plays a crucial role in guiding the model toward more intuitive and user-friendly responses.
5959

6060
2. **AI Feedback from LLaVA-OV’s Self-Generated Responses**: Additionally, the AI's own self-generated feedback allows it to continuously improve and adapt, making this a valuable source for iterative learning.
6161

0 commit comments

Comments
 (0)