ggml : fix condition of im2col on Metal backend #15460
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Fixes #15426
Important
Unfortunately, this fix does NOT improve the performance of Gemma 3. Need to dig deeper...
Correct the condition in
ggml_metal_supports_op
to match theGGML_ASSERT
insideggml_metal_encode_node
Also remove the check for
src0->type
(aka kernel dtype), as we only need the shape of the kernel, not its type. It's also the same for other backends. For example, CUDA:llama.cpp/ggml/src/ggml-cuda/im2col.cu
Lines 76 to 86 in 1bc664a