Training with dual (optimizer+scheduler) only one learning rate is updated over training steps #14970

celsofranssa · 2022-10-01T21:31:52Z

celsofranssa
Oct 1, 2022

I am using two optimizers and two schedules in my PL model:

def configure_optimizers(self):
    # optimizers
    opt1 = torch.optim.AdamW(
        self.encoder_1.parameters(), 
        lr=self.hparams.lr_1,
        weight_decay=self.hparams.weight_decay
    )

    opt2 = torch.optim.AdamW(
        self.encoder_2.parameters(),
        lr=self.hparams.lr_2,
        weight_decay=self.hparams.weight_decay
    )

    step_size_up = round(0.07 * self.trainer.estimated_stepping_batches)

    schdlr_1 = torch.optim.lr_scheduler.CyclicLR(opt1, mode='triangular2',
                                                       base_lr=self.hparams.base_lr,
                                                       max_lr=self.hparams.max_lr, step_size_up=step_size_up
                                                       )
    schdlr_2 = torch.optim.lr_scheduler.CyclicLR(opt2, mode='triangular2',
                                                       base_lr=self.hparams.base_lr,
                                                       max_lr=self.hparams.max_lr, step_size_up=step_size_up
                                                       )

    return (
        {"optimizer": opt1, "lr_scheduler": schdlr_1, "frequency": self.hparams.frequency_1},
        {"optimizer": opt2, "lr_scheduler": schdlr_2, "frequency": self.hparams.frequency_2},
    )

However, inspecting the loss over the training steps revealed that only one learning rate was updated:

Answered by celsofranssa

Oct 3, 2022

It happend because I've forgotten about specifying the scheduler's interval.
The correct desired config is something like:

return (
            {"optimizer": opt_1, "lr_scheduler": {"scheduler": schdlr_1, "interval": "step", "name": "LRS-1"}, "frequency": 1},
            {"optimizer": opt_2, "lr_scheduler": {"scheduler": schdlr_1, "interval": "step", "name": "LRS-2"}, "frequency": 1}
)

View full answer

celsofranssa · 2022-10-03T13:36:40Z

celsofranssa
Oct 3, 2022
Author

It happend because I've forgotten about specifying the scheduler's interval.
The correct desired config is something like:

return (
            {"optimizer": opt_1, "lr_scheduler": {"scheduler": schdlr_1, "interval": "step", "name": "LRS-1"}, "frequency": 1},
            {"optimizer": opt_2, "lr_scheduler": {"scheduler": schdlr_1, "interval": "step", "name": "LRS-2"}, "frequency": 1}
)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Training with dual (optimizer+scheduler) only one learning rate is updated over training steps #14970

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Training with dual (optimizer+scheduler) only one learning rate is updated over training steps #14970

Uh oh!

Uh oh!

celsofranssa Oct 1, 2022

Replies: 1 comment

Uh oh!

celsofranssa Oct 3, 2022 Author

celsofranssa
Oct 1, 2022

celsofranssa
Oct 3, 2022
Author