Skip to content

[#10780][feat] AutoDeploy: Support per-expert scales in FP8 and NVFP4 MoE#11322

Merged
galagam merged 8 commits intoNVIDIA:mainfrom
nv-auto-deploy:gagam/handle-non-identical-moe-scales-v2
Feb 9, 2026
Merged

[#10780][feat] AutoDeploy: Support per-expert scales in FP8 and NVFP4 MoE#11322
galagam merged 8 commits intoNVIDIA:mainfrom
nv-auto-deploy:gagam/handle-non-identical-moe-scales-v2

Commits

Commits on Feb 8, 2026