Skip to content

b4770

Compare
Choose a tag to compare
@github-actions github-actions released this 25 Feb 10:10
58d07a8
metal : copy kernels for quant to F32/F16 conversions (#12017)

metal: use dequantize_q templates

---------

Co-authored-by: Georgi Gerganov <[email protected]>