SWA callback with torch.save state_dict() #11997

maxmatical · 2022-02-18T19:50:15Z

maxmatical
Feb 18, 2022

i'm trying to experiment with SWA callback when training pytorch models using the lightning trainer. one problem i'm running into is my lightningmodule is structured like

class MyModule(pl.LightningModule):
    def __init__(self, pytorch_model: nn.Module):
        self.pytorch_model = pytorch_model

    def forward(x):
        return self.pytorch_model(x)

...

my_module = MyModule(...)
trainer = Trainer(callbacks=[StochasticWeightAveraging(swa_epoch_start=0.5)])
trainer.fit(my_module)

torch.save(my_module.pytorch_model.state_dict(), save_path)

i tried training with and without swa callback (using the same seed for 10 epochs so i know it's averaging models). but after loading the pytorch model and running evaluation, i'm getting the same metric for the pytorch model when training with/without SWA

does anyone know why this is the case?

edit: solved the issue. turns out the trainer was automatically loading the best module checkpoint, which doesn't contain the averaged weights

rohitgavval · 2022-07-13T09:45:51Z

rohitgavval
Jul 13, 2022

Could you please show how you were able to retrieve the model with the averaged weights, if you were able to? I can't seem to figure that out

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SWA callback with torch.save state_dict() #11997

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

SWA callback with torch.save state_dict() #11997

Uh oh!

Uh oh!

maxmatical Feb 18, 2022

Replies: 1 comment

Uh oh!

rohitgavval Jul 13, 2022

maxmatical
Feb 18, 2022

rohitgavval
Jul 13, 2022