torchzero

torchzero provides efficient implementations of a wide range of optimization algorithms with pytorch optimizer interface, encompassing many classes of unconstrained optimization - convex and non-convex, local and global, derivative free, gradient based and second order, least squares, etc.

The algorithms are designed to be as modular as possibe - they can be freely combined, for example all second order-like methods can be combined with any line search or trust region algorithm. Techniques like gradient clipping, weight decay, sharpness-aware minimization, cautious updates, gradient accumulation are their own modules and can be used with anything else.

note: This project is being actively developed, there may be API changes, although at this point I am very happy with the API.

Installation

pip install torchzero

The github version may be a bit more recent and less tested:

pip install git+https://github.com/inikishev/torchzero

How to use

Each module represents a distinct step in the optimization process. See list of modules on the wiki.

Construct a tz.Optimizer optimizer with the desired modules and use as any other pytorch optimizer:

optimizer = tz.Optimizer(
    model.parameters(),
    tz.m.ClipValue(1),
    tz.m.Adam(),
    tz.m.WeightDecay(1e-2),
    tz.m.LR(1e-1)
)

Here is what happens:

The gradient is passed to the ClipValue(1) module, which returns gradient with magnitudes clipped to be no larger than 1.
Clipped gradient is passed to Adam(), which updates Adam momentum buffers and returns the Adam update.
The Adam update is passed to WeightDecay() which adds a weight decay penalty to the Adam update. Since we placed it after Adam, the weight decay is decoupled. By moving WeightDecay() before Adam(), we can get coupled weight decay.
Finally the update is passed to LR(0.1), which multiplies it by the learning rate of 0.1.

Advanced optimization

Certain modules such as line searches and trust regions require a closure, similar to L-BFGS in PyTorch. Also some modules require closure to accept an additional backward argument, refer to example below:

model = nn.Sequential(nn.Linear(10, 10), nn.ELU(), nn.Linear(10, 1))
inputs = torch.randn(100,10)
targets = torch.randn(100, 1)

optimizer = tz.Optimizer(
    model.parameters(),
    tz.m.CubicRegularization(tz.m.Newton()),
)

for i in range(1, 51):

    def closure(backward=True):
        preds = model(inputs)
        loss = F.mse_loss(preds, targets)

        # If backward=True, closure should call
        # optimizer.zero_grad() and loss.backward()
        if backward:
            optimizer.zero_grad()
            loss.backward()

        return loss

    loss = optimizer.step(closure)

    if i % 10 == 0:
        print(f"step: {i}, loss: {loss.item():.4f}")

The code above will also work with any other optimizer because all PyTorch optimizers and most custom ones support closure, so there is no need to rewrite training loop.

Rosenbrock minimization example:

import torch
import torchzero as tz

def rosen(x, y):
    return (1 - x) ** 2 + 100 * (y - x ** 2) ** 2

X = torch.tensor([-1.1, 2.5], requires_grad=True)

def closure(backward=True):
    loss = rosen(*X)
    if backward:
        X.grad = None # same as opt.zero_grad()
        loss.backward()
    return loss

opt = tz.Optimizer([X], tz.m.NewtonCGSteihaug())
for step in range(24):
    loss = opt.step(closure)
    print(f'{step} - {loss}')

Learn more

To learn more about how to use torchzero check Basics.

An overview of optimization algorithms in torchzero along with visualizations, explanations and benchmarks is available in the overview section.

If you just want to see what algorithms are implemeted, check API reference.

Name		Name	Last commit message	Last commit date
Latest commit History 627 Commits
.github/workflows		.github/workflows
docs		docs
tests		tests
torchzero		torchzero
.gitattributes		.gitattributes
.gitignore		.gitignore
license		license
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

torchzero

Installation

How to use

Advanced optimization

Learn more

About

Uh oh!

Releases 25

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

inikishev/torchzero

Folders and files

Latest commit

History

Repository files navigation

torchzero

Installation

How to use

Advanced optimization

Learn more

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 25

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages