Issues has been previously discussed here, but was closed unresolved. #3287. Diffusers only support variance learning in inference, which is weird if they did not support the training in the first place. Improved DDPM has been since used in lot of other repositories like ADM and DiT.
Can this be address please.