Skip to content

Fix CUDA DeepSeek FlashMLA-3 with quantized KV cache#400

Merged
ikawrakow merged 1 commit intomainfrom
ik/cuda_fix_quantized_flash_mla3
May 9, 2025
Merged

Fix CUDA DeepSeek FlashMLA-3 with quantized KV cache#400
ikawrakow merged 1 commit intomainfrom
ik/cuda_fix_quantized_flash_mla3

Commits

Commits on May 9, 2025