Skip to content

Conversation

kperreau
Copy link

I might have found a small mistake in ggml/src/ggml-cpu/ops.cpp, in the non-contiguous path of ggml_compute_forward_dup_f16 when dst->type == GGML_TYPE_F16.

The carry after each element uses ne00/ne01/ne02/ne03 (source dims). Since the indices i10..i13 point to dst, I think it should compare against ne0/ne1/ne2/ne3 (destination dims). The adjacent F32 branch seems to use ne0..ne3.

Apologies if I’m missing context. If this intentional, feel free to close.

@github-actions github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Aug 28, 2025
@ggerganov
Copy link
Member

Yes, looks like a bug. Can you try to add a test-backend-ops test that demonstrates the problem and gets fixed with this change?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml changes relating to the ggml tensor library for machine learning

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants