Hi, thanks for the great work!
I'm trying to reproduce the evaluation results in the paper, specifically the experiments on user instructions.
I have a few questions:
- Instruction dataset release: Are there plans to release the full set of 87 user instructions used for evaluation? This would be very helpful for fair comparison and reproducibility.
- Evaluation protocol details: relation between the instructions and 8 hm3d datasets?