Skip to content

Pull requests: ikawrakow/ik_llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

CUDA: FA optimization for models using SWA
#752 opened Sep 2, 2025 by ikawrakow Loading…
Offload only activated experts to the GPU
#698 opened Aug 16, 2025 by ikawrakow Loading…
[DRAFT] Function call updates
#670 opened Aug 2, 2025 by iSevenDays Loading…
2 of 4 tasks
Add GitHub data: backup and convertion scripts + backup update
#653 opened Jul 26, 2025 by ThomasBaruzier Loading…
2 of 4 tasks
Quantization tweaks
#624 opened Jul 17, 2025 by ikawrakow Loading…
Another minor readme update
#592 opened Jul 8, 2025 by saood06 Loading…
Make sure MMVQ is supported before using it
#487 opened Jun 3, 2025 by ikawrakow Loading…
Remove GGML_IQK_MUL_MAT option
#457 opened May 25, 2025 by ikawrakow Loading…
mmap backed KV cache
#290 opened Mar 25, 2025 by saood06 Draft
2 of 4 tasks
Feat/lock free server
#236 opened Feb 27, 2025 by orca-zhang Draft
2 of 4 tasks
Some minor quant strategies tweaks
#117 opened Nov 22, 2024 by Nexesenex Loading…
2 of 4 tasks
AVX2/Zen4 horizontal sums
#57 opened Sep 17, 2024 by ikawrakow Loading…
Binary KQ mask
#28 opened Aug 28, 2024 by ikawrakow Draft
ProTip! no:milestone will show everything without a milestone.