DL-Pytorch-Workshop/faq.md at main · Arunprakash-A/DL-Pytorch-Workshop

What is the use of `last_epoch` argument in the LRSchedulers like ConstatnLR?

First, the learning rate at the i-th epoch is a function of i (index)
Therefore, it is important to keep track of the index-i (last index). Often it is last epoch
It should be stored while checkpointing model parameters.

I have a model containing multiple layers. I want to set a different learning rate for each layer. Is it possible to achieve that easily in PyTorch?

Yes.

Create parameter groups while creating the model. For example

import torch
import torch.nn as nn
import torch.optim as optim    

class SimpleCNN(nn.Module):
    def __init__(self):
        super(SimpleCNN, self).__init__()
        # Convolutional part
        self.conv = nn.Sequential(
            nn.Conv2d(1, 16, kernel_size=3, padding=1),
            nn.ReLU(),
            nn.MaxPool2d(2)
        )
        # Fully connected part
        self.fc = nn.Sequential(
            nn.Linear(16 * 14 * 14, 10)
        )

    def forward(self, x):
        x = self.conv(x)
        x = x.view(x.size(0), -1)
        x = self.fc(x)
        return x
    
# Initialize model
model = SimpleCNN()

# Create two parameter groups
optimizer = optim.SGD([
    {'params': model.conv.parameters(), 'lr': 0.01},   # Group 1: conv layers
    {'params': model.fc.parameters(), 'lr': 0.1}       # Group 2: fc layers
], momentum=0.9)

# Print parameter groups for clarity
for i, group in enumerate(optimizer.param_groups):
    print(f"Parameter group {i+1}: learning rate = {group['lr']}")

You can also pass a list of schedulers for each group while initializing the learning rate scheduler.
Note: You can use only a learning rate scheduler (i.e., a function of epochs), not a function of validation loss (like ReduceLROnPlateau)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What is the use of `last_epoch` argument in the LRSchedulers like ConstatnLR?

I have a model containing multiple layers. I want to set a different learning rate for each layer. Is it possible to achieve that easily in PyTorch?

FilesExpand file tree

faq.md

Latest commit

History

faq.md

File metadata and controls

What is the use of last_epoch argument in the LRSchedulers like ConstatnLR?

I have a model containing multiple layers. I want to set a different learning rate for each layer. Is it possible to achieve that easily in PyTorch?

What is the use of `last_epoch` argument in the LRSchedulers like ConstatnLR?