Demographic Diffusion

This repository demonstrates a proof of concept: a diffusion model can be successfully fine-tuned using reinforcement learning and functions common in recommender systems. The workflow involves creating a synthetic dataset, learning a reward function, and then fine-tuning the diffusion model. All code related to data generation and reward training is located in the src directory, while diffusion model fine-tuning is handled in the ddpo directory.

This code was developed for a research study, which will be linked here once it becomes publicly available.

Workflow Overview

To replicate our results, please follow these steps.

Create a synthetic dataset from raw product data (using the Amazon-Products.csv dataset).
```
python src/main.py --task create_data
```
Compute aesthetic scores for images using CLIP embeddings.
```
python src/main.py --task aesthetic_inference
```

Bring the data to the individual level.

python src/main.py --task expand_dataset

Train a reward model to predict user engagement.
```
python src/main.py --task train_reward_function
```
Evaluate the trained reward model on a test set.
```
python src/main.py --task evaluate
```
Train a classifier to predict gender from embeddings (for demographic regularization).
```
python src/main.py --task train_clip_gender_probe
```

Train a classifier to predict age group from embeddings.

python src/main.py --task train_clip_age_probe_aggregated

Start fine-tuning Stable Diffusion v1.4 on all available GPUs using the experiment config. The default config is set up for multi-GPU training.
```
accelerate launch scripts/train.py --config config/experiment.py:deep_fm
```

The `ddpo` Directory

This part of the repository is used for fine-tuning diffusion models using reinforcement learning with custom reward functions and prompts. It uses the learned reward function and demographic probes from the src part. The code is an extension of the ddpo-pytorch repository. For more details, see the comments in the config files and the original repository.

For questions or comments, please contact: timo.stenz@tuebingen.mpg.de

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
ddpo		ddpo
src		src
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Demographic Diffusion

Workflow Overview

The `ddpo` Directory

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Demographic Diffusion

Workflow Overview

The ddpo Directory

About

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages

The `ddpo` Directory