Skip to content

Actions: ggml-org/llama.cpp

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
43,094 workflow run results
43,094 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

CUDA: add bf16 and f32 support to cublas_mul_mat_batched
Pull Request Labeler #12744: Pull request #14361 synchronize by am17an
15m 12s
llama : add high-throughput mode
Pull Request Labeler #12743: Pull request #14363 opened by ggerganov
4m 0s
llama : expose C API to get layer device type
Pull Request Labeler #12739: Pull request #14358 synchronize by okaris
1m 13s
llama : expose C API to get layer device type
Pull Request Labeler #12738: Pull request #14358 synchronize by okaris
29s