Skip to content

Commit 883b428

Browse files
Add TRL example notebook to RLHF docs (#26346)
Signed-off-by: sergiopaniego <[email protected]>
1 parent e1098ce commit 883b428

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

docs/training/rlhf.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,4 +12,5 @@ See the following basic examples to get started if you don't want to use an exis
1212

1313
See the following notebooks showing how to use vLLM for GRPO:
1414

15+
- [Efficient Online Training with GRPO and vLLM in TRL](https://huggingface.co/learn/cookbook/grpo_vllm_online_training)
1516
- [Qwen-3 4B GRPO using Unsloth + vLLM](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Qwen3_(4B)-GRPO.ipynb)

0 commit comments

Comments
 (0)