Skip to content

Commit 92ef009

Browse files
committed
Update trinity-design image, and other minor updates
1 parent 259c180 commit 92ef009

File tree

3 files changed

+10
-6
lines changed

3 files changed

+10
-6
lines changed

README.md

Lines changed: 8 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -7,17 +7,20 @@
77
</div>
88

99

10+
&nbsp;
11+
12+
1013

1114
**Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).**
1215

1316

14-
Built with a decoupled architecture, seamless integration for agentic workflows, and systematic data processing pipelines, Trinity-RFT can be easily adapted for diverse application scenarios, and serve as a platform for exploring advanced reinforcement learning (RL) paradigms.
17+
Built with a decoupled design, seamless integration for agentic workflows, and systematic data processing pipelines, Trinity-RFT can be easily adapted for diverse application scenarios, and serve as a platform for exploring advanced reinforcement learning (RL) paradigms.
1518

1619

1720

1821

1922

20-
**Vision of this project:**
23+
## Vision of this project
2124

2225

2326
Current RFT approaches, such as RLHF (Reinforcement Learning from Human Feedback) with proxy reward models or training long-CoT reasoning models with rule-based rewards, are limited in their ability to handle dynamic, real-world learning.
@@ -34,7 +37,7 @@ Trinity-RFT offers a path into this future by addressing critical gaps in existi
3437

3538

3639

37-
**Key features of Trinity-RFT:**
40+
## Key features
3841

3942

4043

@@ -102,7 +105,8 @@ e.g., utilizing NCCL (when feasible) for model weight synchronization, sequence
102105
## Getting started
103106

104107

105-
*Note: this project is currently under active development; comments and suggestions are welcome!*
108+
> [!NOTE]
109+
> This project is currently under active development. Comments and suggestions are welcome!
106110
107111

108112

574 KB
Loading

docs/sphinx_doc/source/tutorial/example_data_functionalities.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -137,8 +137,8 @@ All config items in the `data` section can be found [here](trinity_configs.md).
137137

138138

139139

140-
> [!NOTE]
141-
> Only when one of `dj_process_desc` and `dj_config_path` is provided, the data module and the data active iterator will be activated. Otherwise, this part will be skipped and it will enter into the exploring stage directly.
140+
.. note::
141+
Only when one of `dj_process_desc` and `dj_config_path` is provided, the data module and the data active iterator will be activated. Otherwise, this part will be skipped and it will enter into the exploring stage directly.
142142

143143
### Exploring & Training
144144
After preparing the config files of Trinity-RFT, you can start your ray cluster and run the RFT process including the data active iterator part with the following commands:

0 commit comments

Comments
 (0)