Sync master with upstream release b6276 #218

jan-service-account · 2025-08-26T00:11:48Z

Updates dev branch with latest release (b6276) from ggml-org/llama.cpp

This commit removes the content from the Makefile and updates the current deprecation message to information that `make` has been replaced by CMake instead. The message when `make` is invoked will now be the following: ```console $ make Makefile:6: *** Build system changed: The Makefile build has been replaced by CMake. For build instructions see: https://github.com/ggml-org/llama.cpp/blob/master/docs/build.md . Stop. ``` The motivation for this is that many, if not all targets fail to build now, after changes to the system, and `make` has also been deprected for some time now.

Signed-off-by: noemotiovon <[email protected]>

* support interns1-mini * fix comment * update

ggml-ci

Signed-off-by: Weizhao Ouyang <[email protected]>

…5562) * batched-bench : fix unified KV cache handling + pp timing * cont : run dummy token only with split KV cache

…ml-org#15557) * model-conversion: add model card template for embeddings [no ci] This commit adds a separate model card template (model repository README.md template) for embedding models. The motivation for this is that there server command for the embedding model is a little different and some addition information can be useful in the model card for embedding models which might not be directly relevant for causal models. * squash! model-conversion: add model card template for embeddings [no ci] Fix pyright lint error. * remove --pooling override and clarify embd_normalize usage

…5564) This commit explicitly sets the pooling type to 'none' in the logits.cpp to support models that have a pooling type specified. The motivation for this is that some models may have a pooling type set in the model file (.gguf file) and for this specific case where we only want to extract logits, we need to ensure that no pooling is used to so that we are comparing raw logits and not pooled embeddings.

* CUDA: MoE helper in device code, better tile sizes * reduce superfluous CUDA blocks

This avoids backend-dependent behavior for argmax that leads to intermittent failures.

…gml-org#15565)

danbev and others added 13 commits August 26, 2025 08:44

CANN: ROPE cache sin/cos repeat (ggml-org#15501)

8d3ca00

Signed-off-by: noemotiovon <[email protected]>

convert : support interns1-mini (ggml-org#15412)

9f8ee91

* support interns1-mini * fix comment * update

metal : add FA kernels for HS=40 (ggml-org#15559)

906fa36

ggml-ci

convert : update Ernie 4.5 dense architecture name (ggml-org#15555)

9369e7b

Signed-off-by: Weizhao Ouyang <[email protected]>

batched-bench : fix unified KV cache handling + pp timing (ggml-org#1…

1289ec2

…5562) * batched-bench : fix unified KV cache handling + pp timing * cont : run dummy token only with split KV cache

CUDA: MoE helper in device code, better tile sizes (ggml-org#15525)

b774aa3

* CUDA: MoE helper in device code, better tile sizes * reduce superfluous CUDA blocks

metal: fix regression when no metal devices are present (ggml-org#15531)

60f66bf

tests: Generate unique input values for count_equal (ggml-org#15487)

e53771b

This avoids backend-dependent behavior for argmax that leads to intermittent failures.

vulkan: fix min subgroup 16 condition for mmid subgroup optimization (g…

9e92649

…gml-org#15565)

opencl: fix support ops condition for rms_norm (ggml-org#15560)

4bd0e50

Minh141120 force-pushed the update-dev-from-master-2025-08-26-00-11 branch from f7207b0 to 4bd0e50 Compare August 26, 2025 01:46

Minh141120 closed this Aug 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sync master with upstream release b6276 #218

Sync master with upstream release b6276 #218

Uh oh!

jan-service-account commented Aug 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

13 participants

Sync master with upstream release b6276 #218

Sync master with upstream release b6276 #218

Uh oh!

Conversation

jan-service-account commented Aug 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

13 participants