Skip to content

Commit e3fec17

Browse files
ikawrakowIwan Kawrakow
andauthored
Fix DeepSeek FA (#382)
Co-authored-by: Iwan Kawrakow <[email protected]>
1 parent f7c9a0f commit e3fec17

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

ggml/src/ggml-cuda/fattn.cu

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -521,6 +521,7 @@ void ggml_cuda_flash_attn_ext(ggml_backend_cuda_context & ctx, ggml_tensor * dst
521521
// K and V head sizes.
522522
if (K->ne[0] != V->ne[0]) {
523523
ggml_cuda_flash_attn_ext_wmma_f16(ctx, dst);
524+
return;
524525
}
525526

526527
ggml_cuda_flash_attn_ext_mma_f16(ctx, dst);

0 commit comments

Comments
 (0)