Hello, I have another question about CFG and was hoping you could help me with it.
In the sampling code, the “unconditional” input bad_cvec is set to the downsampled cvec. However, from the training code, it looks like bad_cvec is not used during training. This raises a concern: could this lead to a train-inference mismatch?
Furthermore, CFG typically relies on contrasting a conditional input with an unconditional one. Since bad_cvec is merely a downsampled version, it arguably still carries conditions. I’m curious to know if using this “imperfect” unconditional input might offer some advantages.
Thanks in advance for your clarification!