Diffusion-Based Imaginative Coordination for Bimanual Manipulation

Figure 1: Framework overview of our diffusion-based policy.

Figure 2: Task visualization and results overview (2 ALOHA + 16 RoboTwin + 4 Real-world tasks).

📰 News

June 25th, 2025: Our paper is accepted by ICCV 2025.

May 20th, 2025: We released our code and model.

Clone the source code

https://github.com/return-sleep/Diffusion_based_imaginative_Coordination.git
cd Diffusion_based_imaginative_Coordination

ALOHA

🔧 Installation

Install the required packages, see INSTALLATION_ALOHA.md

📦 Download dataset and Change dataset path

Download the dataset from ALOHA_Data
Modify constants.py Line 5 to your own dataset path

🚀 Model training and evaluation

Training script

cd ALOHA
bash script/train_eval.sh sim_insertion_human 20000 0 0
# bash script/train_eval.sh <task_name> <num_steps> <seed> <cuda_id>

Evaluation script

bash script/eval.sh sim_insertion_human 20000 0 0 0 
# bash script/train_eval.sh <task_name> <num_steps> <seed> <cuda_id> <ckpt_type>

RoboTwin

🔧 Installation

conda create -n RoboTwin python=3.10

Install the required packages for RoboTwin, see INSTALLATION_RoboTwin.md
Install the required packages for Cosmos-Tokenizer and download the checkpoints from Hugging Face, see Cosmos-Tokenizer
Install the required packages for policy deployment

pip install diffusers wandb ipdb gpustat dm_control omegaconf hydra-core==1.2.0 einops==0.4.1 diffusers==0.11.1 numba==0.56.4 moviepy imageio av matplotlib termcolor

📦 Data collection and preprocessing

cd RoboTwin
bash run_task.sh block_hammer_beat 0
# bash run_task.sh ${task_name} ${gpu_id}
python script/pkl2zarr_mypolicy.py block_hammer_beat D435 100
# python script/pkl2zarr_mypolicy.py ${task_name} ${head_camera_type} ${expert_data_num}

🚀 Model training and evaluation

Training script

cd policy/ACT-DP-TP
bash scripts/act_dp_tp/train.sh block_hammer_beat 0 0 
# bash scripts/train.sh ${task_name} ${gpu_id} ${seed}

Evaluation script

bash scripts/act_dp_tp/eval.sh block_hammer_beat 0 0 0
# bash scripts/eval.sh ${task_name} ${gpu_id} ${seed} ${ckpt_type}

🙏 Acknowledgements

Our project builds upon the following excellent repositories:

We sincerely thank the authors for their inspiring work and open-source contributions.

Citation

If you find our work helpful, please cite us:

@misc{xu2025diffusionbasedimaginativecoordinationbimanual,
      title={Diffusion-Based Imaginative Coordination for Bimanual Manipulation}, 
      author={Huilin Xu and Jian Ding and Jiakun Xu and Ruixiang Wang and Jun Chen and Jinjie Mai and Yanwei Fu and Bernard Ghanem and Feng Xu and Mohamed Elhoseiny},
      year={2025},
      eprint={2507.11296},
      archivePrefix={arXiv},
      primaryClass={cs.RO},
      url={https://arxiv.org/abs/2507.11296}, 
}

License

All the code, model weights, and data are licensed under MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
ALOHA		ALOHA
RoboTwin		RoboTwin
assets		assets
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diffusion-Based Imaginative Coordination for Bimanual Manipulation

📰 News

Clone the source code

ALOHA

🔧 Installation

📦 Download dataset and Change dataset path

🚀 Model training and evaluation

Training script

Evaluation script

RoboTwin

🔧 Installation

📦 Data collection and preprocessing

🚀 Model training and evaluation

Training script

Evaluation script

🙏 Acknowledgements

Citation

License

About

Uh oh!

Releases

Packages

Languages

License

ChengyangHE/Diffusion_based_imaginative_Coordination

Folders and files

Latest commit

History

Repository files navigation

Diffusion-Based Imaginative Coordination for Bimanual Manipulation

📰 News

Clone the source code

ALOHA

🔧 Installation

📦 Download dataset and Change dataset path

🚀 Model training and evaluation

Training script

Evaluation script

RoboTwin

🔧 Installation

📦 Data collection and preprocessing

🚀 Model training and evaluation

Training script

Evaluation script

🙏 Acknowledgements

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages