Skip to content

Commit 2d423a3

Browse files
xiaohuguo2023naromero77amd
authored andcommitted
add one more reduction config to avoid large vgpr spills
1 parent 4b62333 commit 2d423a3

File tree

1 file changed

+8
-0
lines changed

1 file changed

+8
-0
lines changed

torch/_inductor/runtime/triton_heuristics.py

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2564,6 +2564,14 @@ def make_config(x, r, num_warps=None, num_stages=1, register_intensive=False):
25642564
num_stages=1,
25652565
waves_per_eu=2
25662566
))
2567+
result_configs.append(triton_config_reduction(
2568+
size_hints,
2569+
512,
2570+
8,
2571+
num_warps=4,
2572+
num_stages=1,
2573+
waves_per_eu=1
2574+
))
25672575

25682576
return result_configs
25692577

0 commit comments

Comments
 (0)