Skip to content

Commit 7a78320

Browse files
Fix link in docstring of RLOOTrainer (#4180)
1 parent 67e83ae commit 7a78320

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

trl/trainer/rloo_trainer.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -97,8 +97,8 @@
9797
class RLOOTrainer(BaseTrainer):
9898
"""
9999
Trainer for the Reinforce Leave One Out (RLOO) method. This algorithm was initially proposed in the paper [Back to
100-
Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs]
101-
(https://huggingface.co/papers/2402.14740).
100+
Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in
101+
LLMs](https://huggingface.co/papers/2402.14740).
102102
103103
Example:
104104

0 commit comments

Comments
 (0)