Sealing The Backdoor

This repository contains the code and a link to model weights results from the work presented in the paper Sealing The Backdoor: Unlearning Adversarial Text Triggers In Diffusion Models Using Knowledge Distillation - Arxiv

The code files are:

self_kd.py - Self-Knowledge Distillation
attention_guided_kd.py - Self-Knowledge Distillation with Cross-Attention Guidance (Gaussian Noise matching)
attention_guided_kd_black.py - Self-Knowledge Distillation with Cross-Attention Guidance (Black Image matching)
attention_guided_kd_random_words.py - Self-Knowledge Distillation with Cross-Attention Guidance (Random Words matching)
finetune_rev.py - Finetune reversal of poisoning

The attention capture mechanism (in attention_map folder) is adapted from https://github.com/wooyeolBaek/attention-map

Model weights before and after unpoisoning can be found in this Huggingface Repo

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
attention_map		attention_map
.gitignore		.gitignore
README.md		README.md
attention_guided_kd.py		attention_guided_kd.py
attention_guided_kd_black.py		attention_guided_kd_black.py
attention_guided_kd_random_words.py		attention_guided_kd_random_words.py
attention_map_example.py		attention_map_example.py
base_prompts.txt		base_prompts.txt
dataset.py		dataset.py
ema.py		ema.py
finetune_rev.py		finetune_rev.py
generate.py		generate.py
random_words.txt		random_words.txt
self_kd.py		self_kd.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sealing The Backdoor

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Mystic-Slice/Sealing-The-Backdoor

Folders and files

Latest commit

History

Repository files navigation

Sealing The Backdoor

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages