Issue: I disagree with how the KL Divergence is derived. I think the description would benefit from expanding on the derivation.
In particular, I don't see how the highlighted intermediate step comes about without already having evaluated the integral.
