Skip to content
Discussion options

You must be logged in to vote

CoreML is solely utilized for encoding, while decoding is managed through GGML. Although CoreML doesn't support these specialized quantization methods, theoretically, it's still feasible to run the decoder using a quantized model because it is managed through GGML.

Unquantized CoreML model + quantized GGML model

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by ggerganov
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants