Skip to content

Commit 48206f3

Browse files
authored
[https://nvbugs/5850094][fix] Fix MoE cost estimation for auto multi-stream scheduling (#11160)
Signed-off-by: Yi Zhang <187001205+yizhang-nv@users.noreply.github.com>
1 parent 50379d0 commit 48206f3

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

tensorrt_llm/_torch/compilation/multi_stream/auto_multi_stream.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -39,6 +39,7 @@ def estimate_time(node: Node) -> int:
3939
moe_ops = {
4040
torch.ops.trtllm.fp4_block_scale_moe_runner.default,
4141
torch.ops.trtllm.fused_moe.default,
42+
torch.ops.trtllm.moe_custom_op.default,
4243
}
4344

4445
gemm_ops = {

0 commit comments

Comments
 (0)