Skip to content

Commit 2c417db

Browse files
AmdSampsanaromero77amd
authored andcommitted
a few more configs for 2d grids (#2649)
Added two nice grid configs for the 2d pointwise kernel cases for WRT5 workload. Confirmed that they were picked up when using max autotune. (cherry picked from commit f1eac49)
1 parent 40d40ab commit 2c417db

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

torch/_inductor/runtime/triton_heuristics.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2505,6 +2505,8 @@ def pointwise(
25052505
triton_config_with_settings(size_hints, 64, 64), # ~8% better for fp16
25062506
triton_config_with_settings(size_hints, 256, 16),
25072507
triton_config_with_settings(size_hints, 16, 256),
2508+
triton_config_with_settings(size_hints, 128, 16), # wrt: +10% for some kernels
2509+
triton_config_with_settings(size_hints, 32, 512), # wrt: +30% for some kernels
25082510
triton_config_with_settings(size_hints, bs, 1),
25092511
triton_config_with_settings(size_hints, 1, bs),
25102512
*hinted_configs,

0 commit comments

Comments
 (0)