Skip to content
Discussion options

You must be logged in to vote

Hi @Brayan532 — you can enable MuSGD in standard Ultralytics training by setting optimizer="MuSGD" (or leaving optimizer="auto", which will select MuSGD for longer runs based on total iterations); for the exact behavior (which params get Muon vs plain SGD, and the internal Muon/SGD blend used), see the MuSGD/Muon API reference and the trainer’s optimizer construction logic.

from ultralytics import YOLO

model = YOLO("yolo26n.pt")
model.train(data="coco8.yaml", epochs=100, imgsz=640, optimizer="MuSGD")

If you’re training a fully custom PyTorch loop (not model.train()), you can also import MuSGD directly from ultralytics.optim.muon and pass explicit param groups (with use_muon=True only on …

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@glenn-jocher
Comment options

Answer selected by Brayan532
@Brayan532
Comment options

@glenn-jocher
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
documentation Improvements or additions to documentation enhancement New feature or request advice needed Non-issue requests for advice, recommendations, and best practices
3 participants