Learning Rate finder too strong loss smoothing #13404

hcgasser · 2022-06-24T15:54:24Z

hcgasser
Jun 24, 2022

The learning rate finder slowly increases the learning rate during its search process and records how the loss reacts to it. My understanding is that in theory, it is supposed to stay quite constant at the beginning and then decrease before a too high learning rate leads to divergence.

However, in the callback method _LRCallback.on_batch_end, a smoothed loss is calculated (link below). The problem here is in my opinion, that the smoothing starts with an initial self.avg_loss of zero. This leads to the counterintuitive behavior that the loss increases at first with learning rate. if the number of tested learning rates is low, this can actually be the case for a wide range of learning rate values - in particular as the standard beta value is set very high (high weight to past).

I think, the self.avg_loss value should be set to the initial value of the un-smoothed loss at the beginning instead of zero. What do you think?

Thank you for looking into this

https://github.com/Lightning-AI/lightning/blob/b84b02400a312240a6429c186cc63514eeb45a82/pytorch_lightning/trainer/lr_finder.py#L374

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Learning Rate finder too strong loss smoothing #13404

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Learning Rate finder too strong loss smoothing #13404

Uh oh!

hcgasser Jun 24, 2022

Replies: 0 comments

hcgasser
Jun 24, 2022