You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -171,7 +163,7 @@ For more details about dataset downloading, please refer to [Huggingface](https:
171
163
172
164
You may customize the configurations in `scripts/config/{config_name}.yaml`and `scripts/config/{train_config_name}.yaml`. For example, the model and dataset are specified as:
173
165
174
-
```plain
166
+
```yaml
175
167
model:
176
168
model_path: $MODEL_PATH/{model_name}
177
169
@@ -231,11 +223,11 @@ More example config files can be found in `scripts/config`.
231
223
232
224
233
225
For more detailed examples about how to use Trinity-RFT, please refer to the following documents:
234
-
+[A quick example with GSM8k](./example_reasoning_basic.md);
235
-
+[Off-policy / asynchronous modes of RFT](./example_reasoning_advanced.md);
Copy file name to clipboardExpand all lines: docs/sphinx_doc/source/tutorial/example_data_functionalities.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,7 +8,7 @@ In this example, you will learn how to apply the data module of Trinity-RFT to p
8
8
2. how to configure the data module
9
9
3. what the data module can do
10
10
11
-
Before getting started, you need to prepare the main environment of Trinity-RFT according to the [installation section of the README file](main.md#getting-started), and you need to install [postgresql](https://www.postgresql.org/docs/current/tutorial-install.html) as well.
11
+
Before getting started, you need to prepare the main environment of Trinity-RFT according to the [installation section of the README file](../main.md), and you need to install [postgresql](https://www.postgresql.org/docs/current/tutorial-install.html) as well.
12
12
13
13
### Data Preparation
14
14
@@ -243,7 +243,7 @@ You can set more config items for this OP (e.g. notification when annotation is
243
243
244
244
When you start running with the RFT config, the data module will start the OP `human_preference_annotation_mapper`, and then you can find a new project on the "Projects" page of the label-studio server.
0 commit comments