We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent 5ab294e commit 7617e9fCopy full SHA for 7617e9f
README.md
@@ -10,6 +10,14 @@
10
11
[Paper](https://arxiv.org/abs/2412.21033) | [Website](https://gautierdag.github.io/plancraft/)
12
13
+### ⚠️ Important Note on Baseline Results
14
+
15
+The baseline performance metrics reported in the original paper are underreported due to a bug in the environment that has since been fixed.
16
17
+If you are using Plancraft, please re-run the baselines yourself using the code in this repository and use those as your point of comparison.
18
19
+For a full explanation, please see Issue [#2](/../../issues/2).
20
21
### Plancraft was accepted to COLM 2025!
22
23
Plancraft is a minecraft environment that benchmarks planning in LLM agents with an oracle RAG retriever.
0 commit comments