Sync master with upstream release b5973 #177

jan-service-account · 2025-07-24T00:12:52Z

Updates dev branch with latest release (b5973) from ggml-org/llama.cpp

* weight format to nz for 310p * remove quant weight format to nz * clean code * fix * make the conditions for converting weights to NZ format consistent * clean code

…org#14675) * Update llama-memory-recurrent.cpp handle saving/loading null layers in recurrent memory * fixed styling issues and updated comments * fix styling issue Co-authored-by: Sigbjørn Skjæret <[email protected]> --------- Co-authored-by: Sigbjørn Skjæret <[email protected]>

ggml-ci

* CUDA: fix quantized KV cache + multiple sequences * Update ggml/src/ggml-cuda/fattn-common.cuh Co-authored-by: Georgi Gerganov <[email protected]> --------- Co-authored-by: Georgi Gerganov <[email protected]>

csabakecskemeti and others added 10 commits July 22, 2025 19:29

ggml : model card yaml tab->2xspace (ggml-org#14819)

acd6cb1

CUDA: add fused rms norm (ggml-org#14800)

8c988fa

CANN: weight format to NZ for Ascend310P3 (ggml-org#14407)

14c28df

* weight format to nz for 310p * remove quant weight format to nz * clean code * fix * make the conditions for converting weights to NZ format consistent * clean code

ggml: fix loongarch quantize_row_q8_1 error (ggml-org#14827)

6c88b3b

tests : add non-cont K,V FA tests

18f3b5f

ggml-ci

CUDA: fix quantized KV cache + multiple sequences (ggml-org#14822)

07a19e2

* CUDA: fix quantized KV cache + multiple sequences * Update ggml/src/ggml-cuda/fattn-common.cuh Co-authored-by: Georgi Gerganov <[email protected]> --------- Co-authored-by: Georgi Gerganov <[email protected]>

ci : correct label refactor->refactoring (ggml-org#14832)

221c0e0

CUDA: fix compilation with GGML_CUDA_F16 (ggml-org#14837)

b284197

CUDA: fix overflow in FA, tune performance (ggml-org#14840)

a86f52b

jan-service-account merged commit fa1602d into dev Jul 24, 2025
13 checks passed

jan-service-account deleted the update-dev-from-master-2025-07-24-00-12 branch July 24, 2025 00:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sync master with upstream release b5973 #177

Sync master with upstream release b5973 #177

Uh oh!

jan-service-account commented Jul 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

Sync master with upstream release b5973 #177

Sync master with upstream release b5973 #177

Uh oh!

Conversation

jan-service-account commented Jul 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants