Hi, thank you for providing the code. It was really helpful.
One thing I am curious about is that when training VAE, unlike VanillaAE, the KL loss weight can affect the recontruction quality.
Adding focal frequency loss without changing the weight for KL loss will casue the recontruction loss to be weighted more, so that it is trivially able to reconstruct more details.
Can you share about how did you weight these terms? I did not find this description in the paper.
Thanks in advance!