Request for More Detailed Documentation on MuSGD Optimizer Usage #23595
-
|
Hi Ultralytics team, First of all, thank you for releasing YOLO26 and all the significant advancements you’ve brought to computer vision workflows. Thanks very much for your continued investment in the Ultralytics ecosystem and for making cutting-edge CV technology more accessible! Sincerely, |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 3 replies
-
|
👋 Hello @Brayan532, thank you for your interest in Ultralytics 🚀! This is an automated response to help you get unblocked quickly—an Ultralytics engineer will also take a look and assist soon. We recommend a visit to the Docs for new users where you can find many Python and CLI usage examples and where many of the most common questions may already be answered. If this is a 🐛 Bug Report (for example, if MuSGD isn’t behaving as expected or you’re seeing errors enabling it), please provide a minimum reproducible example to help us debug it (including your command/script, a small config snippet, and the exact logs). If this is a custom training ❓ Question (like recommended MuSGD usage patterns, configuration knobs, or best-practice settings for custom models), please provide as much information as possible, including dataset image examples and training logs, and verify you are following our Tips for Best Training Results ✅. Join the Ultralytics community where it suits you best. For real-time chat, head to Discord 🎧. Prefer in-depth discussions? Check out Discourse. Or dive into threads on our Subreddit to share knowledge with the community. UpgradeUpgrade to the latest pip install -U ultralyticsEnvironmentsYOLO may be run in any of the following up-to-date verified environments (with all dependencies including CUDA/CUDNN, Python and PyTorch preinstalled):
StatusIf this badge is green, all Ultralytics CI tests are currently passing. CI tests verify correct operation of all YOLO Modes and Tasks on macOS, Windows, and Ubuntu every 24 hours and on every commit. |
Beta Was this translation helpful? Give feedback.
Hi @Brayan532 — you can enable MuSGD in standard Ultralytics training by setting
optimizer="MuSGD"(or leavingoptimizer="auto", which will select MuSGD for longer runs based on total iterations); for the exact behavior (which params get Muon vs plain SGD, and the internal Muon/SGD blend used), see the MuSGD/Muon API reference and the trainer’s optimizer construction logic.If you’re training a fully custom PyTorch loop (not
model.train()), you can also importMuSGDdirectly fromultralytics.optim.muonand pass explicit param groups (withuse_muon=Trueonly on …