We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent da569f1 commit 1499d4fCopy full SHA for 1499d4f
benchmarks/triton_kernels_benchmark/gemm_benchmark.py
@@ -32,7 +32,7 @@
32
] + [
33
triton.Config(
34
{'BLOCK_SIZE_M': 256, 'BLOCK_SIZE_N': 128, 'BLOCK_SIZE_K': 32, 'GROUP_SIZE_M': 4, 'grf_mode': 'large'},
35
- num_stages=s, num_warps=32) for s in [2, 3]
+ num_stages=s, num_warps=32) for s in [2, 3, 4]
36
37
38
{'BLOCK_SIZE_M': 64, 'BLOCK_SIZE_N': 128, 'BLOCK_SIZE_K': 32, 'GROUP_SIZE_M': 4, 'grf_mode': 'large'},
0 commit comments