Skip to content

Commit 217594f

Browse files
JChunXfacebook-github-bot
authored andcommitted
Zen LLATTE CoFormer Triton FP8 tune (#4951)
Summary: Pull Request resolved: #4951 X-link: facebookresearch/FBGEMM#1971 Tune these FP8 shapes: ``` m,n,k,context 3072,4096,4096,"call__kernel_matmul_fp8_row_non_persistent_0" 3072,5120,5120,"call__kernel_matmul_fp8_row_non_persistent_2" 3072,10752,5120,"call__kernel_matmul_fp8_row_non_persistent_3" ``` Reviewed By: pranavsharma Differential Revision: D83583235 fbshipit-source-id: 21b68ecbbfa163f39b9b7709ac651944ab74dfdc
1 parent f5a86c6 commit 217594f

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

fbgemm_gpu/experimental/gemm/triton_gemm/fp8_gemm.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3838,6 +3838,8 @@ def get_full_non_persistent_tuning_space():
38383838
(128, 64, 64, 4, 1, 0, 16, 2, 4, 2),
38393839
(128, 64, 64, 1, 1, 0, 16, 2, 4, 2),
38403840
(256, 128, 128, 1, 1, 2, 16, 1, 8, 2),
3841+
(128, 256, 128, 2, 1, 2, 16, 2, 4, 1),
3842+
(256, 128, 64, 2, 1, 2, 16, 1, 4, 2),
38413843
]
38423844

38433845

0 commit comments

Comments
 (0)