-
Notifications
You must be signed in to change notification settings - Fork 39
Open
Description
Hi Sir, I'm working on a LDM for my thesis and your video was very helpful in figuring out how the DDPM works. I only have a doubt in the training process, right now I'm:
- Sample a Batch of images and related caption
- Pass the images trough the Encoder of che diffusion model (to obtain the latent) and the caption trough the clip encoder
- Sample a random T and add noise to the latent with the scheduler
- Pass the latent in the Unet obtaining the predicted noise
- Calculate the loss between real Noise and predicted Noise
My doubt is, is it all i have to do? During the training process i don't have to do all the steps during forward and reverse project, but i can only limit to the single t i randomly sample?
Metadata
Metadata
Assignees
Labels
No labels