Workflow runs · ggml-org/llama.cpp · GitHub

Actions

All workflows
Workflows
- Build Actions Cache Build Actions Cache
- Build on Linux using cross-compiler Build on Linux using cross-compiler
- Build on RISCV Linux Machine by Cloud-V Build on RISCV Linux Machine by Cloud-V
- Build relocatable cmake package Build relocatable cmake package
- Check Pre-Tokenizer Hashes Check Pre-Tokenizer Hashes
- Check vendor Check vendor
- CI CI
- CI (AMD) CI (AMD)
- Close inactive issues Close inactive issues
- Copilot code review Copilot code review
Management
- Caches

All workflows

Actions

Loading...
Loading

Showing runs from all workflows

43,094 workflow run results

43,094 workflow run results

CUDA: add bf16 and f32 support to cublas_mul_mat_batched Pull Request Labeler #12744: Pull request #14361 synchronize by am17an

15m 12s

15m 12s

llama : add high-throughput mode Server #15487: Pull request #14363 opened by ggerganov

20m 25s gg/llama-high-throughput

gg/llama-high-throughput

20m 25s

llama : add high-throughput mode CI #24091: Pull request #14363 opened by ggerganov

23m 17s gg/llama-high-throughput

gg/llama-high-throughput

23m 17s

llama : add high-throughput mode Pull Request Labeler #12743: Pull request #14363 opened by ggerganov

4m 0s

4m 0s

server : fix assistant prefilling when content is an array Server #15486: Pull request #14360 synchronize by CISC

11m 6s cisc/assistant-prefilling-content-array

cisc/assistant-prefilling-content-array

11m 6s

server : fix assistant prefilling when content is an array CI #24090: Pull request #14360 synchronize by CISC

55m 22s cisc/assistant-prefilling-content-array

cisc/assistant-prefilling-content-array

55m 22s

server : fix assistant prefilling when content is an array EditorConfig Checker #27054: Pull request #14360 synchronize by CISC

17s cisc/assistant-prefilling-content-array

cisc/assistant-prefilling-content-array

17s

server : fix assistant prefilling when content is an array flake8 Lint #18506: Pull request #14360 synchronize by CISC

18s cisc/assistant-prefilling-content-array

cisc/assistant-prefilling-content-array

18s

server : fix assistant prefilling when content is an array Python Type-Check #3086: Pull request #14360 synchronize by CISC

2m 3s cisc/assistant-prefilling-content-array

cisc/assistant-prefilling-content-array

2m 3s

server : fix assistant prefilling when content is an array Pull Request Labeler #12742: Pull request #14360 synchronize by CISC

11s

11s

add tests Python Type-Check #3085: Commit 2aac8e8 pushed by CISC

1m 34s cisc/assistant-prefilling-content-array

cisc/assistant-prefilling-content-array

1m 34s

cmake : use LLAMA_BUILD_NUMBER when defining LLAMA_INSTALL_VERSION CI #24089: Pull request #14362 opened by mbaudier

1h 10m 19s mbaudier:use-llama-build-number-for-version

mbaudier:use-llama-build-number-for-version

1h 10m 19s

cmake : use LLAMA_BUILD_NUMBER when defining LLAMA_INSTALL_VERSION EditorConfig Checker #27053: Pull request #14362 opened by mbaudier

56s mbaudier:use-llama-build-number-for-version

mbaudier:use-llama-build-number-for-version

56s

cmake : use LLAMA_BUILD_NUMBER when defining LLAMA_INSTALL_VERSION Server #15485: Pull request #14362 opened by mbaudier

20m 40s mbaudier:use-llama-build-number-for-version

mbaudier:use-llama-build-number-for-version

20m 40s

cmake : use LLAMA_BUILD_NUMBER when defining LLAMA_INSTALL_VERSION Pull Request Labeler #12741: Pull request #14362 opened by mbaudier

4m 41s

4m 41s

server : fix assistant prefilling when content is an array CI #24088: Pull request #14360 synchronize by CISC

52m 36s cisc/assistant-prefilling-content-array

cisc/assistant-prefilling-content-array

52m 36s

server : fix assistant prefilling when content is an array EditorConfig Checker #27052: Pull request #14360 synchronize by CISC

3m 41s cisc/assistant-prefilling-content-array

cisc/assistant-prefilling-content-array

3m 41s

server : fix assistant prefilling when content is an array Server #15484: Pull request #14360 synchronize by CISC

12m 0s cisc/assistant-prefilling-content-array

cisc/assistant-prefilling-content-array

12m 0s

server : fix assistant prefilling when content is an array Pull Request Labeler #12740: Pull request #14360 synchronize by CISC

12s

12s

llama : expose C API to get layer device type Pull Request Labeler #12739: Pull request #14358 synchronize by okaris

1m 13s

1m 13s

llama : expose C API to get layer device type Pull Request Labeler #12738: Pull request #14358 synchronize by okaris

29s

29s

CUDA: add bf16 and f32 support to cublas_mul_mat_batched CI #24085: Pull request #14361 opened by am17an

44m 27s am17an:add_bp16_fp32_to_cublas_batched

am17an:add_bp16_fp32_to_cublas_batched

44m 27s

CUDA: add bf16 and f32 support to cublas_mul_mat_batched Server #15481: Pull request #14361 opened by am17an

10m 58s am17an:add_bp16_fp32_to_cublas_batched

am17an:add_bp16_fp32_to_cublas_batched

10m 58s

CUDA: add bf16 and f32 support to cublas_mul_mat_batched EditorConfig Checker #27049: Pull request #14361 opened by am17an

16s am17an:add_bp16_fp32_to_cublas_batched

am17an:add_bp16_fp32_to_cublas_batched

16s

CUDA: add bf16 and f32 support to cublas_mul_mat_batched Pull Request Labeler #12737: Pull request #14361 opened by am17an

13s

13s