MangaDiT: Reference-Guided Line Art Colorization

Official implementation of MangaDiT: Reference-Guided Line Art Colorization with Hierarchical Attention in Diffusion Transformers.

Introduction

We propose MangaDiT, a powerful model for reference-guided line art colorization based on Diffusion Transformers (DiT). Our model takes both line art and reference images as conditional inputs and introduces a hierarchical attention mechanism with a dynamic attention weighting strategy. This mechanism augments the vanilla attention with an additional context-aware path that leverages pooled spatial features, effectively expanding the model’s receptive field and enhancing region-level color alignment.

Update

2025-10-15: Inference code and paper are released.
⭐️ We will open the training code and benchmark datasets publicly upon acceptance of the paper.

Setup

Dependencies

GPU: NVIDIA A100-80G * 1

Install and requirements

conda create -n mangaDiT python=3.10 -y
conda activate mangaDiT
pip install -r requirements.txt
conda install -y ipykernel
python -m ipykernel install --user --name mangaDiT
huggingface-cli login
(using your own huggingface-cli token)

Inference:

Quick demo with gradio

python src/gradio/gradio_demo.py --share

Alternatively, you can use the Jupyter notebook demo: colorize_demo.ipynb

Acknowledgements

This project is developped on the codebase of FLUX and OminiControl. We appreciate their great work!

Citation

@misc{qiu2025mangaditreferenceguidedlineart,
    title={MangaDiT: Reference-Guided Line Art Colorization with Hierarchical Attention in Diffusion Transformers}, 
    author={Qianru Qiu and Jiafeng Mao and Kento Masui and Xueting Wang},
    year={2025},
    eprint={2508.09709},
    archivePrefix={arXiv},
    primaryClass={cs.CV},
    url={https://arxiv.org/abs/2508.09709}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
annotators		annotators
docs		docs
ops/config		ops/config
runs/20250709-084341		runs/20250709-084341
samples		samples
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
colorize_demo.ipynb		colorize_demo.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MangaDiT: Reference-Guided Line Art Colorization

Introduction

Update

Setup

Inference:

Acknowledgements

Citation

About

Uh oh!

Releases

Packages

Languages

License

CyberAgentAILab/MangaDiT

Folders and files

Latest commit

History

Repository files navigation

MangaDiT: Reference-Guided Line Art Colorization

Introduction

Update

Setup

Inference:

Acknowledgements

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages