Skip to content

Commit d13e652

Browse files
AmdSampsanaromero77amd
authored andcommitted
a few more configs for 2d grids (#2649)
Added two nice grid configs for the 2d pointwise kernel cases for WRT5 workload. Confirmed that they were picked up when using max autotune. (cherry picked from commit f1eac49) (cherry picked from commit 2e79001) (cherry picked from commit 04aa3e4)
1 parent c269eb3 commit d13e652

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

torch/_inductor/runtime/triton_heuristics.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1913,6 +1913,8 @@ def pointwise(
19131913
triton_config_with_settings(size_hints, 64, 64), # ~8% better for fp16
19141914
triton_config_with_settings(size_hints, 256, 16),
19151915
triton_config_with_settings(size_hints, 16, 256),
1916+
triton_config_with_settings(size_hints, 128, 16), # wrt: +10% for some kernels
1917+
triton_config_with_settings(size_hints, 32, 512), # wrt: +30% for some kernels
19161918
triton_config_with_settings(size_hints, bs, 1),
19171919
triton_config_with_settings(size_hints, 1, bs),
19181920
*hinted_configs,

0 commit comments

Comments
 (0)