Scheduler + gradient clipping with AMP16 (manual optimization) #10880

OverLordGoldDragon · 2021-12-01T17:26:26Z

OverLordGoldDragon
Dec 1, 2021

How should it be done? Couldn't sort it from docs. Suppose

def training_step(self, batch, batch_idx):
    opt = self.optimizers()

    x, y = batch
    logits = self(x)
    loss = F.nll_loss(logits, y)

    def closure():
        opt.zero_grad()
        self.manual_backward(loss)
        return loss

    opt.step(closure=closure)

    self.lr_schedulers().step()
	
    self.log('train_loss', loss)
    return loss

It's unclear where self.clip_gradients should go; def optimizer_step (override) never gets called with above code. Clipping should follow unscaling, which is done with automatic optimization; first

https://github.com/PyTorchLightning/pytorch-lightning/blob/20bef8327f52248a02dfc6c013afb90089d01519/pytorch_lightning/plugins/precision/native_amp.py#L87-L88

then

https://github.com/PyTorchLightning/pytorch-lightning/blob/20bef8327f52248a02dfc6c013afb90089d01519/pytorch_lightning/plugins/precision/precision_plugin.py#L127

lastly

https://github.com/PyTorchLightning/pytorch-lightning/blob/20bef8327f52248a02dfc6c013afb90089d01519/pytorch_lightning/plugins/precision/native_amp.py#L93

With manual, however, it'd seem we'd have to recreate a bunch of internals to insert clipping between unscaling and weight updates. It's also unclear how optimizer_step overriding works since it's never called while using docs code for other manual optimization.

Related to #9923?

OverLordGoldDragon · 2021-12-02T02:22:30Z

OverLordGoldDragon
Dec 2, 2021
Author

def on_before_optimizer_step(self, optimizer, optimizer_idx):
    self.clip_gradients(optimizer,
                        gradient_clip_val=self.hparams.gradient_clip_val,
                        gradient_clip_algorithm=self.hparams.gradient_clip_algorithm)

seems to work

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Scheduler + gradient clipping with AMP16 (manual optimization) #10880

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Scheduler + gradient clipping with AMP16 (manual optimization) #10880

Uh oh!

OverLordGoldDragon Dec 1, 2021

Replies: 1 comment

Uh oh!

OverLordGoldDragon Dec 2, 2021 Author

OverLordGoldDragon
Dec 1, 2021

OverLordGoldDragon
Dec 2, 2021
Author