Train sd3 controlnet but result is bad #9675
              
                Unanswered
              
          
                  
                    
                      xduzhangjiayu
                    
                  
                
                  asked this question in
                Q&A
              
            Replies: 2 comments 6 replies
-
| This should belong to discussions. Ccing @PromeAIpro who contributed the script. | 
Beta Was this translation helpful? Give feedback.
                  
                    4 replies
                  
                
            -
Beta Was this translation helpful? Give feedback.
                  
                    2 replies
                  
                
            
  
    Sign up for free
    to join this conversation on GitHub.
    Already have an account?
    Sign in to comment
  
        
    
Uh oh!
There was an error while loading. Please reload this page.
-
Describe the bug
Hi,
I tried to train sd3 controlnet with
examples\controlnet\train_controlnet_sd3.py, but it seems the validation results are really bad, and the training loss was oscillating all the time, it really confused me for a long time, could you please give some advice? Thanks a lot !Here are some validation results and train loss:


validation output on steps 2900:
loss function in tensorboard
Reproduction
accelerate launch examples/controlnet/train_controlnet_sd3.py --pretrained_model_name_or_path=stable-diffusion-3-medium-diffusers --output_dir=output --resolution=1024 --learning_rate=1e-5 --validation_image "./conditioning_image_1.png" "./conditioning_image_2.png" --validation_prompt "red circle with blue background" "cyan circle with brown floral background" --train_batch_size=1 --gradient_checkpointing --gradient_accumulation_steps=4
Logs
No response
System Info
diffusers: 0.31.0.dev (latest)
torch: 2.4.0
Who can help?
@sayakpaul @DN6 @asomoza
Beta Was this translation helpful? Give feedback.
All reactions