Skip to content

Actions: ggml-org/llama.cpp

Actions

Build on RISCV Linux Machine by Cloud-V

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
2,612 workflow run results
2,612 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Refactor llama-model.cpp
Build on RISCV Linux Machine by Cloud-V #2980: Pull request #16252 synchronize by pwilkin
CUDA: Fix bug in topk-moe for gpt-oss
Build on RISCV Linux Machine by Cloud-V #2978: Pull request #16821 synchronize by am17an
CUDA: Fix bug in topk-moe for gpt-oss
Build on RISCV Linux Machine by Cloud-V #2975: Pull request #16821 opened by am17an
server : remove n_past
Build on RISCV Linux Machine by Cloud-V #2970: Pull request #16818 opened by ggerganov
cuda : Add conv2d Implicit GEMM
Build on RISCV Linux Machine by Cloud-V #2968: Pull request #15805 synchronize by bssrdf
cuda : Add conv2d Implicit GEMM
Build on RISCV Linux Machine by Cloud-V #2967: Pull request #15805 reopened by bssrdf
llama-cli: add support for reasoning
Build on RISCV Linux Machine by Cloud-V #2965: Pull request #16603 synchronize by bandoti
metal : initial Metal4 tensor API support
Build on RISCV Linux Machine by Cloud-V #2961: Pull request #16634 synchronize by ggerganov
server : support unified cache across slots
Build on RISCV Linux Machine by Cloud-V #2960: Pull request #16736 synchronize by ggerganov
memory : remove KV cache size padding
Build on RISCV Linux Machine by Cloud-V #2959: Pull request #16812 synchronize by ggerganov
llama: Fused QKV multiplication
Build on RISCV Linux Machine by Cloud-V #2955: Pull request #16813 synchronize by am17an
server : support unified cache across slots
Build on RISCV Linux Machine by Cloud-V #2952: Pull request #16736 synchronize by ggerganov