Skip to content
Discussion options

You must be logged in to vote

your understanding is correct. it is not optimized for avx. We have optimized kernel for ARM neon chips and in the process of refining that:
https://github.com/microsoft/onnxruntime/blob/main/onnxruntime/core/mlas/lib/halfgemm_kernel_neon.cpp

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
1 reply
@jeyblu
Comment options

Answer selected by sophies927
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
3 participants