-
Notifications
You must be signed in to change notification settings - Fork 195
Open
Description
I came across your implementation of batch re-normalization in the BatchReNormDNNLayer class, and I think there might be an error that might be affecting the model's performance.
My understanding of batch re-norm is that it applies the standard BN normalization first, then applies the r/d correction, and then finally applies the gamma/beta scaling and bias. Something along the lines of this:
normed_x = (x - batch_mean) / batch_std # standard BN
normed_x = normed_x * r + d # The batch renorm correction
normed_x = normed_x * gamma + beta # final scale and biasHowever, this line is applying the r/d correction after the scaling and centering with gamma and beta.
https://github.com/ajbrock/Neural-Photo-Editor/blob/master/layers.py#L128
It probably works anyway, based on the good results you seem to have gotten. I just thought I'd bring it to your attention.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels