Why does training loss not converge at the first stage?

I cannot find the other curated datasets in StyleGallery except for the wikiart subset. And why does the loss fail to converge when I only train on the wikiart dataset?Here is my training command.
accelerate launch --num_processes 3 --multi_gpu --mixed_precision "fp16" \
  tutorial_train_styleshot_stage_1.py \
  --pretrained_model_name_or_path="runwayml/stable-diffusion-v1-5/" \
  --image_encoder_path="StyleShot/laion/CLIP-ViT-H-14-laion2B-s32B-b79K" \
  --image_json_file="StyleShot/wikiart_only.jsonl" \
  --image_root_path="StyleShot/data/wikiart" \
  --mixed_precision="fp16" \
  --resolution=512 \
  --train_batch_size=8 \
  --dataloader_num_workers=4 \
  --learning_rate=1e-05 \
  --weight_decay=0.001 \
  --output_dir="StyleShot/results" 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why does training loss not converge at the first stage? #68

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Why does training loss not converge at the first stage? #68

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions