Skip to content

CUDA: muh faster prompt processing for MoE models and small u-batch sizes#728

Merged
ikawrakow merged 23 commits intomainfrom
ik/add_mmq_id
Aug 26, 2025
Merged

CUDA: muh faster prompt processing for MoE models and small u-batch sizes#728
ikawrakow merged 23 commits intomainfrom
ik/add_mmq_id

Commits

Commits on Aug 24, 2025

Commits on Aug 25, 2025

Commits on Aug 26, 2025