Skip to content

Commit 8ca9cbd

Browse files
AmdSampsanaromero77amd
authored andcommitted
a few more configs for 2d grids (#2649)
Added two nice grid configs for the 2d pointwise kernel cases for WRT5 workload. Confirmed that they were picked up when using max autotune. (cherry picked from commit f1eac49) (cherry picked from commit 2e79001)
1 parent 32bb1da commit 8ca9cbd

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

torch/_inductor/runtime/triton_heuristics.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2601,6 +2601,8 @@ def pointwise(
26012601
triton_config_with_settings(size_hints, 64, 64), # ~8% better for fp16
26022602
triton_config_with_settings(size_hints, 256, 16),
26032603
triton_config_with_settings(size_hints, 16, 256),
2604+
triton_config_with_settings(size_hints, 128, 16), # wrt: +10% for some kernels
2605+
triton_config_with_settings(size_hints, 32, 512), # wrt: +30% for some kernels
26042606
triton_config_with_settings(size_hints, bs, 1),
26052607
triton_config_with_settings(size_hints, 1, bs),
26062608
*hinted_configs,

0 commit comments

Comments
 (0)