Skip to content

Commit 452284b

Browse files
Add trainers taxonomy to docs (#4195)
1 parent 6be53e1 commit 452284b

File tree

1 file changed

+38
-0
lines changed

1 file changed

+38
-0
lines changed

docs/source/index.md

Lines changed: 38 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -7,6 +7,44 @@
77
TRL is a full stack library where we provide a set of tools to train transformer language models with methods like Supervised Fine-Tuning (SFT), Group Relative Policy Optimization (GRPO), Direct Preference Optimization (DPO), Reward Modeling, and more.
88
The library is integrated with 🤗 [transformers](https://github.com/huggingface/transformers).
99

10+
Below is the current list of TRL trainers, organized by method type (⚡️ = vLLM support).
11+
12+
<div style="display: flex; justify-content: space-between; width: 100%; gap: 2rem;">
13+
14+
<div style="flex: 1; min-width: 0;">
15+
16+
**Online methods**
17+
- [`GRPOTrainer`] ⚡️
18+
- [`RLOOTrainer`] ⚡️
19+
- [`OnlineDPOTrainer`] ⚡️
20+
- [`NashMDTrainer`] ⚡️
21+
- [`XPOTrainer`] ⚡️
22+
- [`PPOTrainer`]
23+
24+
**Reward modeling**
25+
- [`PRMTrainer`]
26+
- [`RewardTrainer`]
27+
28+
</div>
29+
30+
<div style="flex: 1; min-width: 0;">
31+
32+
**Offline methods**
33+
- [`SFTTrainer`]
34+
- [`DPOTrainer`]
35+
- [`ORPOTrainer`]
36+
- [`BCOTrainer`]
37+
- [`CPOTrainer`]
38+
- [`KTOTrainer`]
39+
40+
**Knowledge distillation**
41+
- [`GKDTrainer`]
42+
43+
</div>
44+
45+
</div>
46+
47+
1048
## 🎉 What's New
1149

1250
**✨ OpenAI GPT OSS Support**: TRL now fully supports fine-tuning the latest [OpenAI GPT OSS models](https://huggingface.co/collections/openai/gpt-oss-68911959590a1634ba11c7a4)! Check out the:

0 commit comments

Comments
 (0)