Add support for missing quants in CPY (Metal & CUDA). #11987

gcp · 2025-02-20T22:45:09Z

ggerganov · 2025-02-21T06:27:08Z

Could you make a separate PR just for the Metal changes? Btw, I think the copy kernels could be implemented by reusing the dequantize_qX_X functions, likely with a single template + 4 instantiations. Would result in much smaller code change and allows to generalize in the future to other quantizations.

gcp · 2025-02-21T09:10:36Z

Btw, I think the copy kernels could be implemented by reusing the dequantize_qX_X functions, likely with a single template + 4 instantiations. Would result in much smaller code change and allows to generalize in the future to other quantizations.

Reusing the dequantize_qX_Y functions works, but doing it with templates is a bit tricky because dequantize_q8_0 swizzles its results differently than all the others (boo!). It would've been nice if this had just been a ggml_get_to_fp32_cuda call but that doesn't deal with the permutations the CPY code is expected to handle 😢

gcp added 2 commits February 19, 2025 23:45

cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (ggml-…

01d4b59

…org#10976)

metal: Copy kernels for quant to F32 conversions (ggml-org#10976).

71cef96

github-actions bot added Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning Apple Metal https://en.wikipedia.org/wiki/Metal_(API) labels Feb 20, 2025

gcp closed this Feb 21, 2025

stephen-ebenezar mentioned this pull request Apr 25, 2025

Misc. bug: Unsupported op "CPY" / SIGABRT on Apple CPU #13112

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for missing quants in CPY (Metal & CUDA). #11987

Add support for missing quants in CPY (Metal & CUDA). #11987

Uh oh!

gcp commented Feb 20, 2025 •

edited

Loading

Uh oh!

ggerganov commented Feb 21, 2025

Uh oh!

gcp commented Feb 21, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add support for missing quants in CPY (Metal & CUDA). #11987

Add support for missing quants in CPY (Metal & CUDA). #11987

Uh oh!

Conversation

gcp commented Feb 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov commented Feb 21, 2025

Uh oh!

gcp commented Feb 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gcp commented Feb 20, 2025 •

edited

Loading

gcp commented Feb 21, 2025 •

edited

Loading