float16 support for AMD64 target #23536
Unanswered
georgen117
asked this question in
Other Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I am working on adding MLAS_TARGET_AMD64 support to the fp16_common.h header.
This currently has implemented a lot of inline functions of NEON intrinsic instructions.
I have implemented the functions using intrinsic instructions from immintrin.h.
What is the preference for adding the code.
Does the community prefer
Option 1 implement all the AMD64 versions of the function then all the ARM NEON version of the functions
Or is this still prefered?
Option2 interleave the AMD64 and ARM NEON code
I hope it is clear what I am asking.
For the float32 version of similar code found in mlasi.h option 2 was used.
However the code I have been implementing currently uses Option 1 I wanted to know if there was a preference so I could switch it it needed.
Beta Was this translation helpful? Give feedback.
All reactions