Skip to content

Commit 74832a1

Browse files
authored
[https://nvbugs/5766986][fix] fixed the shard_all_unprocessed default value to align with the default.yml (#10271)
Signed-off-by: Eran Geva <[email protected]>
1 parent 1f0365d commit 74832a1

File tree

2 files changed

+1
-11
lines changed

2 files changed

+1
-11
lines changed

tensorrt_llm/_torch/auto_deploy/transform/library/sharding.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -136,7 +136,7 @@ class ShardingTransformConfig(TransformConfig):
136136
default_factory=lambda: [ShardingDim.TP, ShardingDim.EP, ShardingDim.BMM]
137137
)
138138
shard_all_unprocessed: bool = Field(
139-
default=False,
139+
default=True,
140140
description="When True, apply simple shard (column split + all_gather) to "
141141
"'leftover' linear nodes that are not part of any layer subgraph.",
142142
)

tests/integration/test_lists/waives.txt

Lines changed: 0 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -496,16 +496,6 @@ disaggregated/test_disaggregated.py::test_disaggregated_benchmark_on_diff_backen
496496
accuracy/test_llm_api_pytorch.py::TestDeepSeekR1::test_nvfp4_multi_gpus[latency_trtllmgen] SKIP (https://nvbugs/5715568)
497497
unittest/executor/test_rpc_proxy.py SKIP (https://nvbugs/5605741)
498498
unittest/executor/test_rpc_worker.py SKIP (https://nvbugs/5605741)
499-
unittest/_torch/auto_deploy/unit/multigpu/transformations/library/test_dist_backend.py::test_dist_backend_all_gather[torch] SKIP (https://nvbugs/5766986)
500-
unittest/_torch/auto_deploy/unit/multigpu/transformations/library/test_dist_backend.py::test_dist_backend_all_gather[trtllm] SKIP (https://nvbugs/5766986)
501-
unittest/_torch/auto_deploy/unit/multigpu/transformations/library/test_tp_sharding.py::test_sharding[Linear-torch_dist_all_gather-False-False-2] SKIP (https://nvbugs/5766982)
502-
unittest/_torch/auto_deploy/unit/multigpu/transformations/library/test_tp_sharding.py::test_sharding[Linear-torch_dist_all_gather-False-True-2] SKIP (https://nvbugs/5766982)
503-
unittest/_torch/auto_deploy/unit/multigpu/transformations/library/test_tp_sharding.py::test_sharding[Linear-torch_dist_all_gather-True-False-2] SKIP (https://nvbugs/5766982)
504-
unittest/_torch/auto_deploy/unit/multigpu/transformations/library/test_tp_sharding.py::test_sharding[Linear-torch_dist_all_gather-True-True-2] SKIP (https://nvbugs/5766982)
505-
unittest/_torch/auto_deploy/unit/multigpu/transformations/library/test_tp_sharding.py::test_sharding_pattern_detection[Linear-torch_dist_all_gather-False-False-8] SKIP (https://nvbugs/5766974)
506-
unittest/_torch/auto_deploy/unit/multigpu/transformations/library/test_tp_sharding.py::test_sharding_pattern_detection[Linear-torch_dist_all_gather-False-True-8] SKIP (https://nvbugs/5766974)
507-
unittest/_torch/auto_deploy/unit/multigpu/transformations/library/test_tp_sharding.py::test_sharding_pattern_detection[Linear-torch_dist_all_gather-True-False-8] SKIP (https://nvbugs/5766974)
508-
unittest/_torch/auto_deploy/unit/multigpu/transformations/library/test_tp_sharding.py::test_sharding_pattern_detection[Linear-torch_dist_all_gather-True-True-8] SKIP (https://nvbugs/5766974)
509499
triton_server/test_triton.py::test_gpt_gather_logits[gpt-gather-logits] SKIP (https://nvbugs/5766960)
510500
stress_test/stress_test.py::test_run_stress_test[llama-v3-8b-instruct-hf_tp1-stress_time_300s_timeout_450s-GUARANTEED_NO_EVICT-pytorch-stress-test] SKIP (https://nvbugs/5766952)
511501
accuracy/test_llm_api_pytorch.py::TestQwen3NextInstruct::test_bf16_4gpu[tp4ep4_cudagraph_overlap] SKIP (https://nvbugs/5722618)

0 commit comments

Comments
 (0)