Skip to content

Commit 2e79001

Browse files
AmdSampsapytorchmergebot
authored andcommitted
a few more configs for 2d grids (#2649)
Added two nice grid configs for the 2d pointwise kernel cases for WRT5 workload. Confirmed that they were picked up when using max autotune. (cherry picked from commit f1eac49)
1 parent 10af207 commit 2e79001

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

torch/_inductor/runtime/triton_heuristics.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2543,6 +2543,8 @@ def pointwise(
25432543
triton_config_with_settings(size_hints, 64, 64), # ~8% better for fp16
25442544
triton_config_with_settings(size_hints, 256, 16),
25452545
triton_config_with_settings(size_hints, 16, 256),
2546+
triton_config_with_settings(size_hints, 128, 16), # wrt: +10% for some kernels
2547+
triton_config_with_settings(size_hints, 32, 512), # wrt: +30% for some kernels
25462548
triton_config_with_settings(size_hints, bs, 1),
25472549
triton_config_with_settings(size_hints, 1, bs),
25482550
*hinted_configs,

0 commit comments

Comments
 (0)