Skip to content

Adaptive learning rate for Muon: NorMuon and AdaMuon#76

Merged
skyw merged 32 commits intoNVIDIA-NeMo:mainfrom
mkhona-nvidia:adaptive_orthogonalized_optimizer
Nov 18, 2025
Merged

Adaptive learning rate for Muon: NorMuon and AdaMuon#76
skyw merged 32 commits intoNVIDIA-NeMo:mainfrom
mkhona-nvidia:adaptive_orthogonalized_optimizer

Commits

Commits on Nov 18, 2025