You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -90,12 +90,12 @@ It is designed to support diverse application scenarios and serve as a unified p
90
90
91
91
***Adaptation to New Scenarios:**
92
92
93
-
Implement agent-environment interaction logic in a single `Workflow` or `MultiTurnWorkflow` class. ([Example](./docs/sphinx_doc/source/tutorial/example_multi_turn.md))
93
+
Implement agent-environment interaction logic in a single `Workflow` or `MultiTurnWorkflow` class. ([Example](/tutorial/example_multi_turn.md))
94
94
95
95
96
96
***RL Algorithm Development:**
97
97
98
-
Develop custom RL algorithms (loss design, sampling, data processing) in compact, plug-and-play classes. ([Example](./docs/sphinx_doc/source/tutorial/example_mix_algo.md))
98
+
Develop custom RL algorithms (loss design, sampling, data processing) in compact, plug-and-play classes. ([Example](/tutorial/example_mix_algo.md))
99
99
100
100
101
101
***Low-Code Usage:**
@@ -301,39 +301,39 @@ For studio users, click "Run" in the web interface.
301
301
302
302
Tutorials for running different RFT modes:
303
303
304
-
+[Quick example: GRPO on GSM8k](./docs/sphinx_doc/source/tutorial/example_reasoning_basic.md)
Copy file name to clipboardExpand all lines: docs/sphinx_doc/source/tutorial/example_data_functionalities.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,7 +8,7 @@ In this example, you will learn how to apply the data processor of Trinity-RFT t
8
8
2. how to configure the data processor
9
9
3. what the data processor can do
10
10
11
-
Before getting started, you need to prepare the main environment of Trinity-RFT according to the [installation section of the README file](../main.md),
11
+
Before getting started, you need to prepare the main environment of Trinity-RFT according to the [installation section of Quickstart](example_reasoning_basic.md),
12
12
and store the base url and api key in the environment variables `OPENAI_BASE_URL` and `OPENAI_API_KEY` for some agentic or API-model usages if necessary.
Copy file name to clipboardExpand all lines: docs/sphinx_doc/source/tutorial/example_multi_turn.md
+31-1Lines changed: 31 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -14,7 +14,37 @@ To run the ALFworld and WebShop env, you need to setup the corresponding environ
14
14
- ALFworld is a text-based interactive environment that simulates household scenarios. Agents need to understand natural language instructions and complete various domestic tasks like finding objects, moving items, and operating devices in a virtual home environment.
15
15
- WebShop is a simulated online shopping environment where AI agents learn to shop based on user requirements. The platform allows agents to browse products, compare options, and make purchase decisions, mimicking real-world e-commerce interactions.
16
16
17
-
You may refer to their original environment to complete the setup.
17
+
<br>
18
+
<details>
19
+
<summary>Guidelines for preparing ALFWorld environment</summary>
20
+
21
+
1. Pip install: `pip install alfworld[full]`
22
+
23
+
2. Export the path: `export ALFWORLD_DATA=/path/to/alfworld/data`
24
+
25
+
3. Download the environment: `alfworld-download`
26
+
27
+
Now you can find the environment in `$ALFWORLD_DATA` and continue with the following steps.
28
+
</details>
29
+
30
+
<details>
31
+
<summary>Guidelines for preparing WebShop environment</summary>
32
+
33
+
1. Install Python 3.8.13
34
+
35
+
2. Install Java
36
+
37
+
3. Download the source code: `git clone https://github.com/princeton-nlp/webshop.git webshop`
38
+
39
+
4. Create a virtual environment: `conda create -n webshop python=3.8.13` and `conda activate webshop`
40
+
41
+
5. Install requirements into the `webshop` virtual environment via the `setup.sh` script: `./setup.sh [-d small|all]`
42
+
43
+
Now you can continue with the following steps.
44
+
</details>
45
+
<br>
46
+
47
+
You may refer to their original environment for more details.
18
48
- For ALFWorld, refer to the [ALFWorld](https://github.com/alfworld/alfworld) repository.
19
49
- For WebShop, refer to the [WebShop](https://github.com/princeton-nlp/WebShop) repository.
0 commit comments