Don't reset optimizer progress at the end of an epoch #13199

philipbecker · 2022-06-01T12:44:35Z

philipbecker
Jun 1, 2022

Principally, I have optimizers configured like this

def configure_optimizers(self):
        return (
            {"optimizer": torch.optim.SGD(self.a.parameters(), lr=0.1), "frequency": 1},
            {"optimizer": torch.optim.SGD(self.b.parameters(), lr=0.2), "frequency": 2},
            {"optimizer": torch.optim.SGD(self.c.parameters(), lr=0.3), "frequency": 1},
        )

With this configuration, Pytorch Lightning will give each training step a new batch of the dataset. This behaviour is different from returning just a tuple of optimizers, where each optimizer operates on the same batch. So far, so good.

However, it comes with an (for me) undesirable behavior, namely, that lightning resets the internal optimizer loop at the end of every epoch. In our example above, we need at least 4 batches in the dataset, otherwise the last optimizer(s) will never be executed. More generally, if the number of batches in our dataset is not divisible by 4, then the actual frequency of optimizers does not match the defined frequency because the last cycle over the optimizers will be incomplete.

Maybe this is desirable default behaviour, but for me, I don't care at all about epochs during training, I just want to cycle over these optimizers in an even fashion. Since I do some active learning research, my datasets are often small-ish (at least in the beginning of training), this is an important issue for me. I thought by writing my own TraininigEpochLoop/OptimizerLoop/OptimizerProgress class I could easily fix this, but so far, I have failed to truly understand the workflow.

Here is a colab that demonstrates this problem: https://colab.research.google.com/drive/1LUXjm8r4MODLkkTkPlnjGaxyPtkkXaBj?usp=sharing

I tried removing/modifying certain reset calls in my own Loops or overwrite how the OptimizerProgress' state is set, but to no avail. Could someone give me some pointers how I can achieve my desired behaviour?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Don't reset optimizer progress at the end of an epoch #13199

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Don't reset optimizer progress at the end of an epoch #13199

Uh oh!

philipbecker Jun 1, 2022

Replies: 0 comments

philipbecker
Jun 1, 2022