Skip to content

Commit 75c7c8b

Browse files
committed
* rename command
1 parent 694de66 commit 75c7c8b

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

docs/sphinx_doc/source/tutorial/example_reasoning_advanced.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@ The algorithm design and analysis can be found in this [technical report](./opmd
1717

1818
To try out the OPMD algorithm:
1919
```shell
20-
as-rft run --config scripts/config/gsm8k_opmd.yaml
20+
trinity run --config scripts/config/gsm8k_opmd.yaml
2121
```
2222

2323
Note that in this config file, `sync_iteration_interval` is set to 10, i.e., the model weights of explorer and trainer are synchronized only once every 10 training steps, which leads to a challenging off-policy scenario (potentially with abrupt distribution shift during the RFT process).

0 commit comments

Comments
 (0)