Skip to content

Commit 0d243f2

Browse files
authored
[ROCm][MoE] mi300 mixtral8x7B perf for specific BS (#13577)
Signed-off-by: Divakar Verma <[email protected]>
1 parent 88f6ba3 commit 0d243f2

File tree

3 files changed

+5
-5
lines changed

3 files changed

+5
-5
lines changed

vllm/model_executor/layers/fused_moe/configs/E=8,N=14336,device_name=AMD_Instinct_MI300X.json

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -45,8 +45,8 @@
4545
},
4646
"16": {
4747
"BLOCK_SIZE_M": 16,
48-
"BLOCK_SIZE_N": 16,
49-
"BLOCK_SIZE_K": 256,
48+
"BLOCK_SIZE_N": 64,
49+
"BLOCK_SIZE_K": 64,
5050
"GROUP_SIZE_M": 1,
5151
"num_warps": 2,
5252
"num_stages": 2,

vllm/model_executor/layers/fused_moe/configs/E=8,N=3584,device_name=AMD_Instinct_MI300X.json

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -45,8 +45,8 @@
4545
},
4646
"16": {
4747
"BLOCK_SIZE_M": 16,
48-
"BLOCK_SIZE_N": 32,
49-
"BLOCK_SIZE_K": 256,
48+
"BLOCK_SIZE_N": 64,
49+
"BLOCK_SIZE_K": 128,
5050
"GROUP_SIZE_M": 1,
5151
"num_warps": 2,
5252
"num_stages": 2,

vllm/model_executor/layers/fused_moe/configs/E=8,N=7168,device_name=AMD_Instinct_MI300X.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -128,7 +128,7 @@
128128
"num_warps": 8,
129129
"num_stages": 2,
130130
"waves_per_eu": 0,
131-
"matrix_instr_nonkdim": 32,
131+
"matrix_instr_nonkdim": 16,
132132
"kpack": 2
133133
},
134134
"512": {

0 commit comments

Comments
 (0)