Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

WIP: ggml-cuda: Add bf16 cuda support to fattn (Flash Attention) examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes
#15261 opened Aug 12, 2025 by eous Loading…
musa: fix build warnings ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#15258 opened Aug 12, 2025 by yeahdongcn Loading…
vulkan: fuse adds ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#15252 opened Aug 11, 2025 by jeffbolznv Loading…
ci : Enable pre-built cuda releases on ubuntu (#5106) devops improvements to build systems and github actions
#15249 opened Aug 11, 2025 by michaelgiba Loading…
vulkan: perf_logger improvements ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#15246 opened Aug 11, 2025 by jeffbolznv Loading…
Fix HIP warp synchronization function conflicts for ROCm 7.0+ ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#15241 opened Aug 11, 2025 by slojosic-amd Loading…
server: implement GLM-style MTP examples hot Something that is hot server
#15225 opened Aug 11, 2025 by F1LM1 Draft
Adding Resume for curl downloads testing Everything test related
#15217 opened Aug 10, 2025 by taf2 Loading…
ci : add copilot-setup-steps.yml devops improvements to build systems and github actions
#15214 opened Aug 10, 2025 by CISC Loading…
introduce how to build with Vulkan on Raspbian OS documentation Improvements or additions to documentation
#15206 opened Aug 10, 2025 by MaoJianwei Loading…
webui: prettify styling examples server
#15201 opened Aug 9, 2025 by olegshulyakov Loading…
11 tasks done
ggml-rpc: chunk send()/recv() to avoid EINVAL for very large tensors over RPC (macOS & others) ggml changes relating to the ggml tensor library for machine learning
#15188 opened Aug 9, 2025 by Tak-RS Loading…
common : add GLM-4.5 tool calling support
#15186 opened Aug 8, 2025 by dhandhalyabhavik Loading…
ggml: add conv3d op ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#15182 opened Aug 8, 2025 by rmatif Loading…
gpt-oss: implement harmony parsing examples hot Something that is hot server testing Everything test related
#15181 opened Aug 8, 2025 by aldehir Loading…
server : enable -td and -tbd parameters examples server
#15172 opened Aug 8, 2025 by CISC Loading…
MoE Expert manipulation args demo Demonstrate some concept or idea, not intended to be merged
#15165 opened Aug 8, 2025 by kooshi Loading…
tool-call: Qwen3 Coder chat format support testing Everything test related
#15162 opened Aug 8, 2025 by ochafik Draft
ProTip! Exclude everything labeled bug with -label:bug.