GitHub - yongliang-wu/DoCo: [AAAI2025] Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient

Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient

News

🔥 [2024.12.10] Our paper is accepted by AAAI-2025 !

Introduction

This repository contains the PyTorch implementation for the paper Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient.

In this paper, we propose DoCo (Domain Correction), a novel concept domain correction framework for machine unlearning in text-to-image diffusion models. Our method addresses two major challenges in existing machine unlearning approaches: limited generalization and utility degradation. DoCo achieves comprehensive concept unlearning through:

A domain correction mechanism that aligns the output domains of sensitive and anchor concepts via adversarial training, ensuring effective unlearning across both seen and unseen prompts.
A concept-preserving gradient surgery technique that mitigates conflicting gradient components, maintaining the model's overall utility while selectively removing targeted concepts.

Our experiments demonstrate superior performance in unlearning various types of concepts (instances, styles, and offensive content) while preserving model functionality, even for out-of-distribution prompts.

Getting Started

git clone [email protected]:yongliang-wu/DoCo.git
cd DoCo
conda create -n DoCo python=3.10
conda activate DoCo
pip install -r requirements.txt

Training

Before training, please replace the site-packages/diffusers/schedulers/scheduling_ddpm.py file with the DoCo/scheduling_ddpm.py file provided in this repository. Our version includes an additional step_batch function and modifications to the _get_variance function to enable batch processing.

Note: To achieve the best results, set different hyperparameters such as anchor concept, max_train_steps, warm_up, and learning_rate. These hyperparameters may vary depending on distinct concepts.

Unlearning Style

Setup accelerate config and pretrained model and then launch training.

accelerate config
export MODEL_NAME="CompVis/stable-diffusion-v1-4"
export OUTPUT_DIR="logs_ablation/vangogh"

accelerate launch train.py \
          --pretrained_model_name_or_path=$MODEL_NAME  \
          --output_dir=$OUTPUT_DIR \
          --class_data_dir=./data/samples_painting/ \
          --class_prompt="painting"  \
          --caption_target "van gogh" \
          --concept_type style \
          --resolution=512  \
          --train_batch_size=8 \
          --learning_rate=6e-6  \
          --max_train_steps=2000 \
          --scale_lr --hflip --noaug \
          --parameter_group cross-attn \
          --allow_tf32 \
          --enable_xformers_memory_efficient_attention \
          --warm_up=1000 \
          --with_prior_preservation \
          --lambda_ 1 \
          --gradient_clip \
          --dlr 0.0005

Unlearning Instance

accelerate config
export MODEL_NAME="CompVis/stable-diffusion-v1-4"
export OPENAI_API_KEY="provide-your-openai-api-key"
export OUTPUT_DIR="logs_ablation/parachute"

accelerate launch train.py \
          --pretrained_model_name_or_path=$MODEL_NAME  \
          --output_dir=$OUTPUT_DIR \
          --class_data_dir=./data/samples_airplane/ \
          --class_prompt="airplane" \
          --caption_target "airplane+parachute" \
          --concept_type object \
          --resolution=512  \
          --train_batch_size=8  \
          --learning_rate=6e-6  \
          --max_train_steps=2000 \
          --scale_lr --hflip \
          --parameter_group cross-attn \
          --enable_xformers_memory_efficient_attention \
          --warm_up=1000 \
          --with_prior_preservation \
          --dlr 0.0001 \
          --lambda_ 1 \
          --gradient_clip

Unlearning Nudity

accelerate config
export MODEL_NAME="CompVis/stable-diffusion-v1-4"
export OPENAI_API_KEY="provide-your-openai-api-key"
export OUTPUT_DIR="logs_ablation/nudity"

accelerate launch train.py \
          --pretrained_model_name_or_path=$MODEL_NAME  \
          --output_dir=$OUTPUT_DIR \
          --class_data_dir=./data/samples_clothed/ \
          --class_prompt="clothed" \
          --caption_target "clothed+nudity" \
          --concept_type object \
          --resolution=512  \
          --train_batch_size=8  \
          --learning_rate=6e-6  \
          --max_train_steps=2000 \
          --scale_lr --hflip \
          --parameter_group cross-attn \
          --enable_xformers_memory_efficient_attention \
          --warm_up=1000 \
          --with_prior_preservation \
          --dlr 0.0001 \
          --lambda_ 1 \
          --gradient_clip

Inference

You can download our checkpoint from this Google Drive link.

from DoCo.model_pipeline import CustomDiffusionPipeline
import torch

pipe = CustomDiffusionPipeline.from_pretrained("CompVis/stable-diffusion-v1-4", torch_dtype=torch.float16).to("cuda")
pipe.load_model('logs_ablation/vangogh/delta.bin')
image = pipe("painting of a house in the style of van gogh", num_inference_steps=50, guidance_scale=6., eta=1.).images[0]

image.save("vangogh.png")

Evaluation

Please refer to the GitHub repository of SPM.

Citation

@inproceedings{wu2025unlearning,
  title={Unlearning concepts in diffusion model via concept domain correction and concept preserving gradient},
  author={Wu, Yongliang and Zhou, Shiji and Yang, Mingzhuo and Wang, Lianzhe and Chang, Heng and Zhu, Wenbo and Hu, Xinting and Zhou, Xiao and Yang, Xu},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={39},
  number={8},
  pages={8496--8504},
  year={2025}
}

Acknowledgements

We extend our gratitude to the following repositories for their contributions and resources:

Their works have significantly contributed to the development of our work.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
DoCo		DoCo
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient

News

Introduction

Getting Started

Training

Inference

Evaluation

Citation

Acknowledgements

About

Uh oh!

Uh oh!

Languages

yongliang-wu/DoCo

Folders and files

Latest commit

History

Repository files navigation

Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient

News

Introduction

Getting Started

Training

Inference

Evaluation

Citation

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages