Skip to content

b6515

Latest

Choose a tag to compare

@github-actions github-actions released this 18 Sep 21:51
3edd87c
opencl: optimize mxfp4 kernels (#16037)

- flatten mxfp4 and packed fp4->fp16 bit-wise convert function (replace lut)
- MoE kernel optimizations

---------

Co-authored-by: Li He <[email protected]>