Skip to content

Commit e781c54

Browse files
yuantailingdominicshanshan
authored andcommitted
[None][fix] Fix is_post_quant_all2all_supported for MNNVL (NVIDIA#8355)
Signed-off-by: Tailing Yuan <[email protected]>
1 parent 4e4617c commit e781c54

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

tensorrt_llm/_torch/modules/fused_moe/fused_moe_wide_ep.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -445,7 +445,7 @@ def is_post_quant_all2all_supported(self):
445445
if not self.use_postquant_alltoall:
446446
return False
447447
if self.alltoall_method_type == AlltoallMethodType.MNNVL:
448-
return False
448+
return True
449449
elif self.alltoall_method_type == AlltoallMethodType.DeepEP:
450450
return self.has_nvfp4
451451
elif self.alltoall_method_type == AlltoallMethodType.DeepEPLowLatency:

0 commit comments

Comments
 (0)