You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix: pass layer_wise_distributed_optimizer via config instead of kwarg
The test was passing layer_wise_distributed_optimizer as a keyword arg
to get_megatron_muon_optimizer(), but that function doesn't accept it.
Set it on the OptimizerConfig object instead.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
0 commit comments