Skip to content

Commit 04aa3e4

Browse files
AmdSampsajataylo
authored andcommitted
a few more configs for 2d grids (ROCm#2649)
Added two nice grid configs for the 2d pointwise kernel cases for WRT5 workload. Confirmed that they were picked up when using max autotune. (cherry picked from commit f1eac49) (cherry picked from commit 2e79001)
1 parent b9e0182 commit 04aa3e4

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

torch/_inductor/runtime/triton_heuristics.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2502,6 +2502,8 @@ def pointwise(
25022502
triton_config_with_settings(size_hints, 64, 64), # ~8% better for fp16
25032503
triton_config_with_settings(size_hints, 256, 16),
25042504
triton_config_with_settings(size_hints, 16, 256),
2505+
triton_config_with_settings(size_hints, 128, 16), # wrt: +10% for some kernels
2506+
triton_config_with_settings(size_hints, 32, 512), # wrt: +30% for some kernels
25052507
triton_config_with_settings(size_hints, bs, 1),
25062508
triton_config_with_settings(size_hints, 1, bs),
25072509
*hinted_configs,

0 commit comments

Comments
 (0)