Monte Carlo Tree Diffusion (MCTD)

This repository provides the official PyTorch implementation for Monte Carlo Tree Diffusion (MCTD) and Fast Monte Carlo Tree Diffusion (Fast-MCTD). Our work demonstrates how to leverage Monte Carlo Tree Search (MCTS) to guide diffusion models during inference, significantly improving planning performance in complex environments like the point and ant mazes.

Monte Carlo Tree Diffusion (MCTD)

Author: Jaesik Yoon, Hyeonseo Cho, Doojin Baek, Yoshua Bengio, Sungjin Ahn

Published in the proceedings of the International Conference on Machine Learning (ICML) 2025, Spotlight.

[Paper] [Project Page]

Monte Carlo Tree Diffusion (MCTD) is a novel framework that improves the inference-time performance of diffusion models by integrating the denoising process with Monte Carlo Tree Search (MCTS).

Fast Monte Carlo Tree Diffusion (Fast-MCTD)

Author: Jaesik Yoon*, Hyeonseo Cho*, Yoshua Bengio, Sungjin Ahn (* equal contribution)

Preprint, 2025.

[Paper] [Project Page]

Fast Monte Carlo Tree Diffusion (Fast-MCTD) is an enhanced version of MCTD that improves computational efficiency through parallel tree search and abstract-level diffusion planning.

Installation

We recommend using Docker to set up the environment for reproducibility.

1. Download MuJoCo binaries: Our Docker setup requires the MuJoCo 2.1.0 binaries. Please download them from this link and place the mujoco210 directory into ./dockerfile/mujoco/. The final path should look like ./dockerfile/mujoco/mujoco210.

2. Build the Docker image:

From the root of the repository, run the following command:

docker build -t fmctd:0.1 . -f dockerfile/Dockerfile

Note on the environment: The Dockerfile installs a customized version of the OGBench benchmark. This customization serves two purposes: it incorporates velocity into the maze environment's observation space and removes randomness from the start and goal positions to reduce performance variance.

3. Log in to Weights & Biases: This project uses Weights & Biases (W&B) for logging. You will need to log in to your W&B account.

wandb login

Evaluation

1. Download Pre-trained Models

Download the models from this link.

dql_trained_models.tar.gz: Contains pre-trained models for DQL. Extract this to the ./dql/ directory.
planner_trained_models.tar.gz: Contains pre-trained diffusion models. This requires a specific directory structure to match our W&B logs.
- First, create a directory path: mkdir -p ./output/downloaded/<YOUR_WANDB_ENTITY>/<YOUR_WANDB_PROJECT>
  - Replace <YOUR_WANDB_ENTITY> with your W&B username or entity name.
  - Replace <YOUR_WANDB_PROJECT> with a project name of your choice (e.g., mctd-eval).
- Then, extract the archive into that directory.

2. Configure and Create Evaluation Jobs

Example scripts are provided to create evaluation experiments. You need to edit these files to match your W&B setup.

Open insert_point_maze_validation_jobs.py and insert_antmaze_validation_jobs.py.
At the top of each file, set the WANDB_ENTITY and WANDB_PROJECT_NAME variables to the same values you used in the step above.
Run the scripts to add the jobs to the queue:

python insert_point_maze_validation_jobs.py
python insert_antmaze_validation_jobs.py

3. Run the Evaluation

Execute the jobs using the run_jobs.py and `run_dql_jobs.py scripts.

Open the scripts and configure the available_gpus variable to specify which GPUs to use.
Run the script:

python run_jobs.py
python run_dql_jobs.py

The scripts will automatically assign jobs to the available GPUs.

4. Summarize Results

After all evaluation jobs are complete, aggregate the results.

Open summarize_results.py and set the group_names variable to match the experiment groups you want to analyze.
Run the script:

python summarize_results.py

The results will be saved to the exp_results directory and printed to the terminal:

{'group': 'PMMN-PMCTD', 'success_rate': '100±0', 'planning_time': '11.11±2.13'}
{'group': 'PMLN-PMCTD', 'success_rate': '98±0', 'planning_time': '8.41±1.34'}
{'group': 'PMGN-PMCTD', 'success_rate': '98±0', 'planning_time': '9.68±0.51'}
{'group': 'PMMN-FMCTD', 'success_rate': '100±0', 'planning_time': '1.91±0.20'}
{'group': 'PMLN-FMCTD', 'success_rate': '82±0', 'planning_time': '2.06±0.08'}
{'group': 'PMGN-FMCTD', 'success_rate': '98±0', 'planning_time': '2.71±0.28'}

Public W&B Logs

The full Weights & Biases logs for the experiments in our paper are publicly available at this link. These logs correspond to the configurations in the example job creation scripts.

Training

To train new models from scratch, follow a similar process:

1. Configure and Create Training Jobs

Open insert_diffusion_training_jobs.py and insert_dql_training_jobs.py.
At the top of each file, set your desired WANDB_ENTITY, WANDB_PROJECT_NAME, and other training parameters.
Run the scripts to create the jobs:

python insert_diffusion_training_jobs.py
python insert_dql_training_jobs.py

2. Run the Training

Configure available_gpus in run_jobs.py.
Execute the script to start training:

python run_jobs.py

References

If you find our work useful, please consider citing:

@inproceedings{yoonmonte,
  title={Monte Carlo Tree Diffusion for System 2 Planning},
  author={Yoon, Jaesik and Cho, Hyeonseo and Baek, Doojin and Bengio, Yoshua and Ahn, Sungjin},
  booktitle={Forty-second International Conference on Machine Learning}
}

@article{yoon2025fast,
  title={Fast Monte Carlo Tree Diffusion: 100x Speedup via Parallel Sparse Planning},
  author={Yoon, Jaesik and Cho, Hyeonseo and Bengio, Yoshua and Ahn, Sungjin},
  journal={arXiv preprint arXiv:2506.09498},
  year={2025}
}

Acknowledgement

This repo is forked from Boyuan Chen's research template repo, especially, it is based on Diffusion Forcing source code, repo. We thank the authors for making their code publicly available.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
algorithms		algorithms
assets		assets
configurations		configurations
datasets		datasets
dockerfile		dockerfile
dql		dql
exp_results		exp_results
experiments		experiments
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
insert_antmaze_validation_jobs.py		insert_antmaze_validation_jobs.py
insert_diffusion_training_jobs.py		insert_diffusion_training_jobs.py
insert_dql_training_jobs.py		insert_dql_training_jobs.py
insert_pointmaze_validation_jobs.py		insert_pointmaze_validation_jobs.py
main.py		main.py
run_dql_jobs.py		run_dql_jobs.py
run_jobs.py		run_jobs.py
summarize_results.py		summarize_results.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Monte Carlo Tree Diffusion (MCTD)