Skip to content

Commit e2a60f1

Browse files
committed
feat: support alfworld multiturn long thought
1 parent 3cb1202 commit e2a60f1

File tree

2 files changed

+106
-1
lines changed

2 files changed

+106
-1
lines changed

openmanus_rl/agentbench/configs/tasks/alfworld.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ default:
77
name: alfworld-std
88
data_path: "/AgentBench/data/alfworld"
99
config_path: "src/server/tasks/alfworld/configs/base_config.yaml"
10-
prompts_path: "src/server/tasks/alfworld/prompts/alfworld_multiturn_plan_first.json"
10+
prompts_path: "src/server/tasks/alfworld/prompts/alfworld_multiturn_long_thought.json"
1111
split: "standard"
1212
max_step: 35
1313

Lines changed: 105 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,105 @@
1+
{
2+
"put": [
3+
"Observation: You are in the middle of a room. Looking quickly around you, you see a cabinet 4, a cabinet 3, a cabinet 2, a cabinet 1, a countertop 1, a garbagecan 1, a handtowelholder 2, a handtowelholder 1, a sinkbasin 2, a sinkbasin 1, a toilet 1, a toiletpaperhanger 1, and a towelholder 1.\nYour task is to: put some spraybottle on toilet.\n",
4+
"Thought:\nMemory: [Task: put some spraybottle on toilet]\n[Need to find a spraybottle.]\nPlan: 1. Search for a spraybottle. 2. Take the spraybottle. 3. Go to the toilet. 4. Put the spraybottle on the toilet.\n\nAction: go to cabinet 1\n",
5+
"Observation: On the cabinet 1, you see a cloth 1, a soapbar 1, a soapbottle 1.\n",
6+
"Thought:\nMemory: [Task: put some spraybottle on toilet, On cabinet 1: cloth 1, soapbar 1, soapbottle 1]\n[No spraybottle in cabinet 1. Need to search elsewhere.]\nPlan: 1. Search for a spraybottle in another location.\n\nAction: go to cabinet 2\n",
7+
"Observation: The cabinet 2 is closed.\n",
8+
"Thought:\nMemory: [Task: put some spraybottle on toilet, cabinet 2 is closed]\n[Cabinet 2 is closed. Need to open it to check inside.]\nPlan: 1. Open cabinet 2. 2. Check contents of cabinet 2.\n\nAction: open cabinet 2\n",
9+
"Observation: You open the cabinet 2. The cabinet 2 is open. In it, you see a candle 1, and a spraybottle 2.\n",
10+
"Thought:\nMemory: [Task: put some spraybottle on toilet, In cabinet 2: candle 1, spraybottle 2]\n[Found a spraybottle. Need to take it and go to the target location.]\nPlan: 1. Take the spraybottle. 2. Go to the toilet.\n\nAction: take spraybottle 2 from cabinet 2\n",
11+
"Observation: You pick up the spraybottle 2 from the cabinet 2.\n",
12+
"Thought:\nMemory: [Task: put some spraybottle on toilet, Holding: spraybottle 2]\n[Have the spraybottle. Need to go to the toilet.]\nPlan: 1. Go to the toilet.\n\nAction: go to toilet 1\n",
13+
"Observation: On the toilet 1, you see a soapbottle 2.\n",
14+
"Thought:\nMemory: [Task: put some spraybottle on toilet, Holding: spraybottle 2, On toilet 1: soapbottle 2]\n[At the toilet. Need to put the spraybottle on it.]\nPlan: 1. Put the spraybottle on the toilet.\n\nAction: put spraybottle 2 in/on toilet 1\n",
15+
"Observation: You put the spraybottle 2 in/on the toilet 1.\n",
16+
"Thought:\nMemory: [Task: put some spraybottle on toilet, spraybottle 2 is on toilet 1]\n[Task completed.]\nPlan: 1. Signal task success.\n\nAction: Task succeeded.\n"
17+
],
18+
"clean": [
19+
"Observation: You are in the middle of a room. Looking quickly around you, you see a bathtubbasin 1, a garbagecan 1, a handtowelholder 1, a shelf 3, a shelf 2, a shelf 1, a sinkbasin 1, a toilet 1, a toiletpaperhanger 1, and a towelholder 1.\nYour task is to: clean some soapbar and put it in toilet.\n",
20+
"Thought:\nMemory: [Task: clean some soapbar and put it in toilet]\n[Need to find a soapbar, clean it, and put it in the toilet. Soapbar likely near toilet or sink.]\nPlan: 1. Search for a soapbar near the toilet. 2. Take the soapbar. 3. Go to the sinkbasin. 4. Clean the soapbar. 5. Go to the toilet. 6. Put the soapbar in the toilet.\n\nAction: go to toilet 1\n",
21+
"Observation: On the toilet 1, you see a soapbar 4, and a soapbar 3.\n",
22+
"Thought:\nMemory: [Task: clean some soapbar and put it in toilet, On toilet 1: soapbar 4, soapbar 3]\n[Found soapbars. Need to take one and clean it.]\nPlan: 1. Take a soapbar. 2. Go to the sinkbasin to clean it.\n\nAction: take soapbar 4 from toilet 1\n",
23+
"Observation: You pick up the soapbar 4 from the toilet 1.\n",
24+
"Thought:\nMemory: [Task: clean some soapbar and put it in toilet, Holding: soapbar 4]\n[Have the soapbar. Need to go to the sinkbasin to clean it.]\nPlan: 1. Go to the sinkbasin. 2. Clean the soapbar.\n\nAction: go to sinkbasin 1\n",
25+
"Observation: On the sinkbasin 1, you see nothing.\n",
26+
"Thought:\nMemory: [Task: clean some soapbar and put it in toilet, Holding: soapbar 4, At sinkbasin 1]\n[At the sinkbasin. Need to clean the soapbar.]\nPlan: 1. Clean the soapbar with the sinkbasin.\n\nAction: clean soapbar 4 with sinkbasin 1\n",
27+
"Observation: You clean the soapbar 4 using the sinkbasin 1.\n",
28+
"Thought:\nMemory: [Task: clean some soapbar and put it in toilet, Holding: cleaned soapbar 4]\n[Soapbar is clean. Need to put it in the toilet.]\nPlan: 1. Go to the toilet. 2. Put the soapbar in the toilet.\n\nAction: go to toilet 1\n",
29+
"Observation: On the toilet 1, you see a soapbar 3.\n",
30+
"Thought:\nMemory: [Task: clean some soapbar and put it in toilet, Holding: cleaned soapbar 4, On toilet 1: soapbar 3]\n[At the toilet. Need to put the cleaned soapbar in it.]\nPlan: 1. Put the cleaned soapbar in the toilet.\n\nAction: put soapbar 4 in/on toilet 1\n",
31+
"Observation: You put the soapbar 4 in/on the toilet 1.\n",
32+
"Thought:\nMemory: [Task: clean some soapbar and put it in toilet, cleaned soapbar 4 is on toilet 1]\n[Task completed.]\nPlan: 1. Signal task success.\n\nAction: Task succeeded.\n"
33+
],
34+
"heat": [
35+
"Observation: You are in the middle of a room. Looking quickly around you, you see a cabinet 13, a cabinet 12, a cabinet 11, a cabinet 10, a cabinet 9, a cabinet 8, a cabinet 7, a cabinet 6, a cabinet 5, a cabinet 4, a cabinet 3, a cabinet 2, a cabinet 1, a coffeemachine 1, a countertop 1, a diningtable 1, a drawer 1, a fridge 1, a garbagecan 1, a microwave 1, a shelf 3, a shelf 2, a shelf 1, a sinkbasin 1, a stoveburner 4, a stoveburner 3, a stoveburner 2, a stoveburner 1, and a toaster 1.\nYour task is to: put a hot apple in fridge.\n",
36+
"Thought:\nMemory: [Task: put a hot apple in fridge]\n[Need to find an apple, heat it, and put it in the fridge. Apple likely on dining table or in fridge.]\nPlan: 1. Search for an apple on dining tables. 2. Take the apple. 3. Go to the microwave. 4. Heat the apple. 5. Go to the fridge. 6. Put the apple in the fridge.\n\nAction: go to diningtable 1\n",
37+
"Observation: On the diningtable 1, you see a apple 1, a bread 1, a fork 3, a fork 2, a fork 1, a mug 2, a peppershaker 3, a plate 2, a pot 1, a soapbottle 3, a spatula 1, a spoon 3, a spoon 2, a spoon 1, a winebottle 3, and a winebottle 2.\n",
38+
"Thought:\nMemory: [Task: put a hot apple in fridge, On diningtable 1: apple 1, ...]\n[Found an apple. Need to take it and heat it.]\nPlan: 1. Take the apple. 2. Go to the microwave to heat it.\n\nAction: take apple 1 from diningtable 1\n",
39+
"Observation: You pick up the apple 1 from the diningtable 1.\n",
40+
"Thought:\nMemory: [Task: put a hot apple in fridge, Holding: apple 1]\n[Have the apple. Need to go to the microwave to heat it.]\nPlan: 1. Go to the microwave. 2. Heat the apple.\n\nAction: go to microwave 1\n",
41+
"Observation: The microwave 1 is closed.\n",
42+
"Thought:\nMemory: [Task: put a hot apple in fridge, Holding: apple 1, Microwave 1 is closed]\n[At the microwave. Need to heat the apple.]\nPlan: 1. Heat the apple with the microwave.\n\nAction: heat apple 1 with microwave 1\n",
43+
"Observation: You heat the apple 1 using the microwave 1.\n",
44+
"Thought:\nMemory: [Task: put a hot apple in fridge, Holding: hot apple 1]\n[Apple is hot. Need to go to the fridge and put it in.]\nPlan: 1. Go to the fridge. 2. Put the hot apple in the fridge.\n\nAction: go to fridge 1\n",
45+
"Observation: The fridge 1 is open. In it, you see a cup 1, and a egg 1.\n",
46+
"Thought:\nMemory: [Task: put a hot apple in fridge, Holding: hot apple 1, Fridge 1 is open]\n[At the fridge. Need to put the hot apple in it.]\nPlan: 1. Put the hot apple in the fridge.\n\nAction: put apple 1 in/on fridge 1\n",
47+
"Observation: You put the apple 1 in/on the fridge 1.\n",
48+
"Thought:\nMemory: [Task: put a hot apple in fridge, hot apple 1 is in fridge 1]\n[Task completed.]\nPlan: 1. Signal task success.\n\nAction: Task succeeded.\n"
49+
],
50+
"cool": [
51+
"Observation: You are in the middle of a room. Looking quickly around you, you see a cabinet 16, a cabinet 15, a cabinet 14, a cabinet 13, a cabinet 12, a cabinet 11, a cabinet 10, a cabinet 9, a cabinet 8, a cabinet 7, a cabinet 6, a cabinet 5, a cabinet 4, a cabinet 3, a cabinet 2, a cabinet 1, a coffeemachine 1, a countertop 2, a countertop 1, a diningtable 1, a drawer 5, a drawer 4, a drawer 3, a drawer 2, a drawer 1, a fridge 1, a garbagecan 1, a microwave 1, a safe 1, a sinkbasin 1, a stoveburner 4, a stoveburner 3, a stoveburner 2, a stoveburner 1, and a toaster 1.\nYour task is to: cool some pan and put it in stoveburner.\n",
52+
"Thought:\nMemory: [Task: cool some pan and put it in stoveburner]\n[Need to find a pan, cool it, and put it on a stoveburner. Pan likely on a stoveburner.]\nPlan: 1. Search for a pan on stoveburners. 2. Take the pan. 3. Go to the fridge. 4. Cool the pan. 5. Go to a stoveburner. 6. Put the pan on the stoveburner.\n\nAction: go to stoveburner 1\n",
53+
"Observation: On the stoveburner 1, you see nothing.\n",
54+
"Thought:\nMemory: [Task: cool some pan and put it in stoveburner, On stoveburner 1: nothing]\n[No pan on stoveburner 1. Continue searching.]\nPlan: 1. Search for a pan on stoveburner 2.\n\nAction: go to stoveburner 2\n",
55+
"Observation: On the stoveburner 2, you see a pot 1.\n",
56+
"Thought:\nMemory: [Task: cool some pan and put it in stoveburner, On stoveburner 2: pot 1]\n[No pan on stoveburner 2. Continue searching.]\nPlan: 1. Search for a pan on stoveburner 3.\n\nAction: go to stoveburner 3\n",
57+
"Observation: On the stoveburner 3, you see a pan 1.\n",
58+
"Thought:\nMemory: [Task: cool some pan and put it in stoveburner, On stoveburner 3: pan 1]\n[Found a pan. Need to take it and cool it.]\nPlan: 1. Take the pan. 2. Go to the fridge to cool it.\n\nAction: take pan 1 from stoveburner 3\n",
59+
"Observation: You pick up the pan 1 from the stoveburner 3.\n",
60+
"Thought:\nMemory: [Task: cool some pan and put it in stoveburner, Holding: pan 1]\n[Have the pan. Need to go to the fridge to cool it.]\nPlan: 1. Go to the fridge. 2. Cool the pan.\n\nAction: go to fridge 1\n",
61+
"Observation: The fridge 1 is closed.\n",
62+
"Thought:\nMemory: [Task: cool some pan and put it in stoveburner, Holding: pan 1, Fridge 1 is closed]\n[At the fridge. Need to cool the pan.]\nPlan: 1. Cool the pan with the fridge.\n\nAction: cool pan 1 with fridge 1\n",
63+
"Observation: You cool the pan 1 using the fridge 1.\n",
64+
"Thought:\nMemory: [Task: cool some pan and put it in stoveburner, Holding: cooled pan 1]\n[Pan is cooled. Need to return it to a stoveburner.]\nPlan: 1. Go to a stoveburner. 2. Put the pan on the stoveburner.\n\nAction: go to stoveburner 1\n",
65+
"Observation: On the stoveburner 1, you see nothing.\n",
66+
"Thought:\nMemory: [Task: cool some pan and put it in stoveburner, Holding: cooled pan 1, At stoveburner 1]\n[At a stoveburner. Need to put the cooled pan on it.]\nPlan: 1. Put the cooled pan on the stoveburner.\n\nAction: put pan 1 in/on stoveburner 1\n",
67+
"Observation: You put the pan 1 in/on the stoveburner 1.\n",
68+
"Thought:\nMemory: [Task: cool some pan and put it in stoveburner, cooled pan 1 is on stoveburner 1]\n[Task completed.]\nPlan: 1. Signal task success.\n\nAction: Task succeeded.\n"
69+
],
70+
"puttwo": [
71+
"Observation: You are in the middle of a room. Looking quickly around you, you see a coffeetable 1, a diningtable 1, a drawer 4, a drawer 3, a drawer 2, a drawer 1, a dresser 1, a garbagecan 1, a sidetable 2, a sidetable 1, and a sofa 1.\nYour task is to: put two cellphone in sofa.\n",
72+
"Thought:\nMemory: [Task: put two cellphone in sofa]\n[Need to find two cellphones and put them on the sofa. Will search common areas like tables and drawers.]\nPlan: 1. Search for the first cellphone. 2. Take the first cellphone. 3. Go to the sofa. 4. Put the first cellphone on the sofa. 5. Search for the second cellphone. 6. Take the second cellphone. 7. Go to the sofa. 8. Put the second cellphone on the sofa.\n\nAction: go to coffeetable 1\n",
73+
"Observation: On the coffeetable 1, you see a box 1, a cellphone 3, and a plate 1.\n",
74+
"Thought:\nMemory: [Task: put two cellphone in sofa, On coffeetable 1: cellphone 3, ...]\n[Found a cellphone. Need to take it and move it to the sofa.]\nPlan: 1. Take the cellphone. 2. Go to the sofa. 3. Put the cellphone on the sofa.\n\nAction: take cellphone 3 from coffeetable 1\n",
75+
"Observation: You pick up the cellphone 3 from the coffeetable 1.\n",
76+
"Thought:\nMemory: [Task: put two cellphone in sofa, Holding: cellphone 3]\n[Have one cellphone. Need to put it on the sofa.]\nPlan: 1. Go to the sofa. 2. Put the cellphone on the sofa.\n\nAction: go to sofa 1\n",
77+
"Observation: On the sofa 1, you see a newspaper 2, a pillow 1, a remotecontrol 2, and a remotecontrol 1.\n",
78+
"Thought:\nMemory: [Task: put two cellphone in sofa, Holding: cellphone 3, At sofa 1]\n[At the sofa with the first cellphone. Need to put it on the sofa.]\nPlan: 1. Put the cellphone on the sofa.\n\nAction: put cellphone 3 in/on sofa 1\n",
79+
"Observation: You put the cellphone 3 in/on the sofa 1.\n",
80+
"Thought:\nMemory: [Task: put two cellphone in sofa, cellphone 3 is on sofa 1]\n[One cellphone placed. Need to find the second one.]\nPlan: 1. Search for the second cellphone (continue searching common areas).\n\nAction: go to diningtable 1\n",
81+
"Observation: On the diningtable 1, you see a cellphone 2, a keychain 2, a laptop 1, a statue 2, and a statue 1.\n",
82+
"Thought:\nMemory: [Task: put two cellphone in sofa, On diningtable 1: cellphone 2, ...]\n[Found the second cellphone. Need to take it and move it to the sofa.]\nPlan: 1. Take the second cellphone. 2. Go to the sofa. 3. Put the second cellphone on the sofa.\n\nAction: take cellphone 2 from diningtable 1\n",
83+
"Observation: You pick up the cellphone 2 from the diningtable 1.\n",
84+
"Thought:\nMemory: [Task: put two cellphone in sofa, Holding: cellphone 2]\n[Have the second cellphone. Need to put it on the sofa.]\nPlan: 1. Go to the sofa. 2. Put the cellphone on the sofa.\n\nAction: go to sofa 1\n",
85+
"Observation: On the sofa 1, you see a cellphone 3, a newspaper 2, a pillow 1, a remotecontrol 2, and a remotecontrol 1.\n",
86+
"Thought:\nMemory: [Task: put two cellphone in sofa, Holding: cellphone 2, At sofa 1, cellphone 3 is already there]\n[At the sofa with the second cellphone. Need to put it on the sofa.]\nPlan: 1. Put the second cellphone on the sofa.\n\nAction: put cellphone 2 in/on sofa 1\n",
87+
"Observation: You put the cellphone 2 in/on the sofa 1.\n",
88+
"Thought:\nMemory: [Task: put two cellphone in sofa, cellphone 3 and cellphone 2 are on sofa 1]\n[Both cellphones placed. Task completed.]\nPlan: 1. Signal task success.\n\nAction: Task succeeded.\n"
89+
],
90+
"examine": [
91+
"Observation: You are in the middle of a room. Looking quickly around you, you see a coffeetable 1, a diningtable 1, a drawer 4, a drawer 3, a drawer 2, a drawer 1, a dresser 1, a garbagecan 1, a sidetable 2, a sidetable 1, and a sofa 1.\nYour task is to: look at statue under the desklamp.\n",
92+
"Thought:\nMemory: [Task: look at statue under the desklamp]\n[Need to find a statue and a desklamp, then use the desklamp to look at the statue. Will search common areas.]\nPlan: 1. Search for a statue. 2. Take the statue. 3. Search for a desklamp. 4. Go to the desklamp's location. 5. Use the desklamp.\n\nAction: go to dresser 1\n",
93+
"Observation: On the dresser 1, you see a cellphone 3, a newspaper 2, a statue 1, and a television 1.\n",
94+
"Thought:\nMemory: [Task: look at statue under the desklamp, On dresser 1: statue 1, ...]\n[Found a statue. Need to take it and then find a desklamp.]\nPlan: 1. Take the statue. 2. Search for a desklamp.\n\nAction: take statue 1 from dresser 1\n",
95+
"Observation: You pick up the statue 1 from the dresser 1.\n",
96+
"Thought:\nMemory: [Task: look at statue under the desklamp, Holding: statue 1]\n[Have the statue. Need to find a desklamp.]\nPlan: 1. Search for a desklamp (continue searching common areas).\n\nAction: go to sidetable 1\n",
97+
"Observation: On the sidetable 1, you see nothing.\n",
98+
"Thought:\nMemory: [Task: look at statue under the desklamp, Holding: statue 1, On sidetable 1: nothing]\n[No desklamp on sidetable 1. Continue searching.]\nPlan: 1. Search for a desklamp on sidetable 2.\n\nAction: go to sidetable 2\n",
99+
"Observation: On the sidetable 2, you see a desklamp 3, a newspaper 1, and a statue 2.\n",
100+
"Thought:\nMemory: [Task: look at statue under the desklamp, Holding: statue 1, On sidetable 2: desklamp 3, ...]\n[Found a desklamp. The task requires looking at the statue under the desklamp, which implies using the desklamp.]\nPlan: 1. Use the desklamp.\n\nAction: use desklamp 3\n",
101+
"Observation: You turn on the desklamp 3.\n",
102+
"Thought:\nMemory: [Task: look at statue under the desklamp, desklamp 3 is on]\n[The desklamp is on. Task completed.]\nPlan: 1. Signal task success.\n\nAction: Task succeeded.\n"
103+
]
104+
}
105+

0 commit comments

Comments
 (0)