Name	Name	Last commit message	Last commit date
parent directory ..
coco_eval	coco_eval
README.md	README.md
eval.sh	eval.sh
requirements.txt	requirements.txt
test.py	test.py
train.py	train.py
train.sh	train.sh
utils.py	utils.py

Name

Last commit message

Last commit date

README.md

Guidance Free Training script for Stable Diffusion 1.5

Environment / Dataset Setup

Install the required Python packages:

pip install -r requirements.txt

📂 Evaluation Dataset: COCO2014

The COCO2014 dataset is used as the evaluation benchmark. To prepare it, please download the images and corresponding test captions using the following commands:

wget https://huggingface.co/tianweiy/DMD2/resolve/main/data/coco/captions_coco14_test.pkl?download=true -O coco/captions_coco14_test.pkl
wget https://huggingface.co/tianweiy/DMD2/resolve/main/data/coco/val2014.zip?download=true -O coco/val2014.zip
unzip coco/val2014.zip -d coco

📂 Training Dataset: LAION Aesthetics 5+

A subset of the LAION Aesthetics 5+ dataset is used for training. To prepare the dataset, you need to create a jsonl file containing the image entries with the following format:

{"caption": "Antique Khotan, East Turkestan, late 19th century, 3 feet 6 inches x 5 feet 3 inches. Estimate: $2,000-$4,000. Image courtesy of Nazmiyal Collection.", "image_path": "/path/to/image/0000295806.jpg"}
{"caption": "7 Cookbooks by Black Chefs That Serve Up More Than Just Meals", "image_path": "/path/to/image//0000291187.jpg"}
{"caption": "Circular Oak Occasional Table", "image_path": "/path/to/image/0000295233.jpg"}

Once the jsonl file is ready, you can pass its path to the training script using the --train_data_jsonl argument. You can refer to train.sh for a complete example of how to launch training with all necessary arguments.

Model Evaluation

Pretrained models

model	reso.	FID (w/o CFG)	HF weights🤗
stable diffusion 1.5 (finetune)	512	8.10	SD1.5-GF-finetune

Evaluation

Please refer to stable-diffusion-v1-5/eval.sh for usage reference.

Model Training

Please refer to stable-diffusion-v1-5/train.sh for usage reference.

Acknowledgements

This codebase includes components adapted from the following repositories:

Hugging Face diffusers, specifically the training script train_text_to_image.py.
Tianwei Yin's DMD2, particularly the evaluation script test_folder_sd.py.

We thank the authors for their clear and well-structured implementations.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Guidance Free Training script for Stable Diffusion 1.5

Environment / Dataset Setup

📂 Evaluation Dataset: COCO2014

📂 Training Dataset: LAION Aesthetics 5+

Model Evaluation

Pretrained models

Evaluation

Model Training

Acknowledgements

FilesExpand file tree

stable-diffusion-v1-5

Directory actions

More options

Directory actions

More options

Latest commit

History

stable-diffusion-v1-5

Folders and files

parent directory

README.md

Guidance Free Training script for Stable Diffusion 1.5

Environment / Dataset Setup

📂 Evaluation Dataset: COCO2014

📂 Training Dataset: LAION Aesthetics 5+

Model Evaluation

Pretrained models

Evaluation

Model Training

Acknowledgements