Skip to content

Commit 099ba67

Browse files
liqunfurohan11235813
authored andcommitted
Fix typos so to call correct vnni functions under vnni condition (#21625)
### Description Fix 2 typos in mlas avx 4bit gemm implementation to call correct vnni functions under vnni condition ### Motivation and Context needed for 1.19.0 release Signed-off-by: liqunfu <[email protected]>
1 parent 7377a72 commit 099ba67

File tree

2 files changed

+3
-3
lines changed

2 files changed

+3
-3
lines changed

onnxruntime/core/mlas/lib/sqnbitgemm_kernel_avx512_int8_blklen16.h

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -679,9 +679,9 @@ Q4Int8GemmR1xC1BlkLen16Avx512(
679679
const __m512i av_01_epi8 = _mm512_loadu_si512((const __m512i*)(QuantAPtr + 64));
680680

681681
if constexpr (vnni) {
682-
accumulate_blklen16_r1c1blk8_avx512(av_00_epi8, av_01_epi8, QuantBDataPtr, QuantAScalePtr, QuantBScalePtr, acc0);
683-
} else {
684682
accumulate_blklen16_r1c1blk8_avx512vnni(av_00_epi8, av_01_epi8, QuantBDataPtr, QuantAScalePtr, QuantBScalePtr, acc0);
683+
} else {
684+
accumulate_blklen16_r1c1blk8_avx512(av_00_epi8, av_01_epi8, QuantBDataPtr, QuantAScalePtr, QuantBScalePtr, acc0);
685685
}
686686

687687
QuantAPtr += BlkLen16 * PerAccuBlk8;

onnxruntime/core/mlas/lib/sqnbitgemm_kernel_avx512_int8_blklen32.h

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -721,7 +721,7 @@ Q4Int8GemmR1xC1BlkLen32Avx512(
721721
accumulate_blklen32_r1c1blk4_avx512vnni(av_00_epi8, av_01_epi8, QuantBDataPtr, QuantAScalePtr, QuantBScalePtr, acc0);
722722
}
723723
else {
724-
accumulate_blklen32_r1c1blk4_avx512vnni(av_00_epi8, av_01_epi8, QuantBDataPtr, QuantAScalePtr, QuantBScalePtr, acc0);
724+
accumulate_blklen32_r1c1blk4_avx512(av_00_epi8, av_01_epi8, QuantBDataPtr, QuantAScalePtr, QuantBScalePtr, acc0);
725725
}
726726

727727
QuantAPtr += BlkLen32 * PerAccuBlk4;

0 commit comments

Comments
 (0)