You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -133,7 +133,7 @@ A more technical version:
133
133
134
134
***Adaptation to New Scenarios:**
135
135
136
-
Implement agent-environment interaction logic in a single `Workflow`/`MultiTurnWorkflow`/`RewardPropagationWorkflow` class ([Example](./docs/sphinx_doc/source/tutorial/example_multi_turn.md)),
136
+
Implement agent-environment interaction logic in a single workflow class ([Example](./docs/sphinx_doc/source/tutorial/example_multi_turn.md)),
137
137
or import existing workflows from agent frameworks like AgentScope ([Example](./docs/sphinx_doc/source/tutorial/example_react.md)).
138
138
139
139
@@ -369,7 +369,7 @@ Tutorials for RL algorithm development/research with Trinity-RFT:
369
369
+[RL algorithm development with Trinity-RFT](./docs/sphinx_doc/source/tutorial/example_mix_algo.md)
370
370
371
371
372
-
Guidelines for full configurations:
372
+
Guidelines for full configurations:
373
373
374
374
+ See [this document](./docs/sphinx_doc/source/tutorial/trinity_configs.md)
0 commit comments