Skip to content

Conversation

@jan-service-account
Copy link

Updates dev branch with latest release (b6558) from ggml-org/llama.cpp

haiyuewa and others added 8 commits September 22, 2025 19:57
)

Disable 'performance-enum-size' checking:

Enum 'llama_token_type' uses a larger base type ('unsigned int', size: 4 bytes)
than necessary for its value set, consider using 'std::uint8_t' (1 byte) as the
base type to reduce its size.
…n) (ggml-org#16177)

This is a configuration of the hparams in the GraniteHybrid architecture
that devolves to the Granite (or GraniteMoe) architecture (ie Granite 3.x).
It may be used for some models in the Granite 4 family with the
GraniteHybrid architecture acting as a superset arch. Rather than support
it directly in the c++ graph, we simply coerce the architecture flag back
to the correct "granite" or "granitemoe" architecture.

Branch: gabe-l-hart/GraniteNonHybridConversion

Signed-off-by: Gabe Goodhart <[email protected]>

Co-authored-by: Sigbjørn Skjæret <[email protected]>
* devops: add s390x dockerfile

Signed-off-by: Aaron Teo <[email protected]>

* devops: add missing ninja

Signed-off-by: Aaron Teo <[email protected]>

* devops: move s390x docker into cpu docker

Signed-off-by: Aaron Teo <[email protected]>

* devops: rework s390x docker

Signed-off-by: Aaron Teo <[email protected]>

* devops: copy more tools

Signed-off-by: Aaron Teo <[email protected]>

* devops: add server build step

Signed-off-by: Aaron Teo <[email protected]>

* devops: remove apt clean steps as distroless misses it

Signed-off-by: Aaron Teo <[email protected]>

* devops: remove apt commands from distroless

Signed-off-by: Aaron Teo <[email protected]>

* devops: fix shared libs in distroless

Signed-off-by: Aaron Teo <[email protected]>

* devops: use correct libs path

Signed-off-by: Aaron Teo <[email protected]>

* devops: fix shared libs

Signed-off-by: Aaron Teo <[email protected]>

* devops: add collector stage

Signed-off-by: Aaron Teo <[email protected]>

* devops: fix missing stage ref

Signed-off-by: Aaron Teo <[email protected]>

* devops: fix permission issue

Signed-off-by: Aaron Teo <[email protected]>

* devops: fix unknown model loading failures

Signed-off-by: Aaron Teo <[email protected]>

* devops: attempt at fixing model loading failure

Signed-off-by: Aaron Teo <[email protected]>

* devops: fix missing ggml shared object

failure to load model

Signed-off-by: Aaron Teo <[email protected]>

* devops: remove move shared objects

Signed-off-by: Aaron Teo <[email protected]>

* devops: move libggml-cpu and blas into bin

Signed-off-by: Aaron Teo <[email protected]>

* devops: finalise hardened server stage

Signed-off-by: Aaron Teo <[email protected]>

* devops: add cli target

Signed-off-by: Aaron Teo <[email protected]>

* devops: fix typos

Signed-off-by: Aaron Teo <[email protected]>

* devops: fix missing shared libraries in base

Signed-off-by: Aaron Teo <[email protected]>

* devops: update debian target

Signed-off-by: Aaron Teo <[email protected]>

* devops: formalise llama.cpp loc

Signed-off-by: Aaron Teo <[email protected]>

* Revert "devops: formalise llama.cpp loc"

This reverts commit 0a7664a.

Signed-off-by: Aaron Teo <[email protected]>

* devops: formalise llama.cpp loc

Signed-off-by: Aaron Teo <[email protected]>
(cherry picked from commit 0a7664a)
Signed-off-by: Aaron Teo <[email protected]>

* devops: attempt at fixing missing dir

Signed-off-by: Aaron Teo <[email protected]>

* devops: attempt at making it cache the build

Signed-off-by: Aaron Teo <[email protected]>

* devops: fix copying process

Signed-off-by: Aaron Teo <[email protected]>

* devops: make build dir an argument

Signed-off-by: Aaron Teo <[email protected]>

* Revert "devops: make build dir an argument"

This reverts commit 4386989.

Signed-off-by: Aaron Teo <[email protected]>

* devops: add build stage for gguf-py

Signed-off-by: Aaron Teo <[email protected]>

* devops: move gguf-py installation into build stage

Signed-off-by: Aaron Teo <[email protected]>

* devops: break system packages?

Signed-off-by: Aaron Teo <[email protected]>

* devops: add rust compiler installer

Signed-off-by: Aaron Teo <[email protected]>

* devops: fix rustc not found

Signed-off-by: Aaron Teo <[email protected]>

* devops: remove cache mount to allow rustc to persist

Signed-off-by: Aaron Teo <[email protected]>

* devops: move rustc installation to another layer

Signed-off-by: Aaron Teo <[email protected]>

* devops: move gguf-py installation to full stage, fix copying

Signed-off-by: Aaron Teo <[email protected]>

* devops: remove rustc installation in build

Signed-off-by: Aaron Teo <[email protected]>

* devops: disable full target for now

Signed-off-by: Aaron Teo <[email protected]>

* devops: attempting static build

Signed-off-by: Aaron Teo <[email protected]>

* devops: merge s390x dockerfile into cpu for now

Signed-off-by: Aaron Teo <[email protected]>

* devops: switch to gcc image for build step

Signed-off-by: Aaron Teo <[email protected]>

* devops: remove build essentials

Signed-off-by: Aaron Teo <[email protected]>

* devops: install openblas into base target

Signed-off-by: Aaron Teo <[email protected]>

* devops: go back to s390x dockerfile

Signed-off-by: Aaron Teo <[email protected]>

* devops: remove libggml and libblas

Signed-off-by: Aaron Teo <[email protected]>

* devops: add full target

Signed-off-by: Aaron Teo <[email protected]>

* devops: add break system packages

Signed-off-by: Aaron Teo <[email protected]>

* devops: add libjpeg

Signed-off-by: Aaron Teo <[email protected]>

* devops: add missing cmake dep

Signed-off-by: Aaron Teo <[email protected]>

* devops: finalise docker images for s390x

Signed-off-by: Aaron Teo <[email protected]>

* devops: add custom openblas patch

Signed-off-by: Aaron Teo <[email protected]>

* devops: use libopenblas-dev instead of libopenblas-openmp-dev

Signed-off-by: Aaron Teo <[email protected]>

* devops: add s390x docker build

Signed-off-by: Aaron Teo <[email protected]>

---------

Signed-off-by: Aaron Teo <[email protected]>
…g#16190)

This commit adds examples/model-conversion/ to the CODEOWNERS file and
assigns myself (@danbev) as the code owner for this directory.
* zdnn: initial matmul refactor

Signed-off-by: Aaron Teo <[email protected]>

* ggml-zdnn: rm static from funcs

Signed-off-by: Aaron Teo <[email protected]>

* ggml-zdnn: update ggml-zdnn.h

Signed-off-by: Aaron Teo <[email protected]>

* ggml-zdnn: change header files to hpp

Signed-off-by: Aaron Teo <[email protected]>

* ggml-zdnn: switch to common.hpp

Signed-off-by: Aaron Teo <[email protected]>

* ggml-zdnn: move mulmat forward around

Signed-off-by: Aaron Teo <[email protected]>

* ggml-zdnn: rm inline from utils

Signed-off-by: Aaron Teo <[email protected]>

* ggml-zdnn: code cleanup

Signed-off-by: Aaron Teo <[email protected]>

* docs: add zDNN docs

Signed-off-by: Aaron Teo <[email protected]>

---------

Signed-off-by: Aaron Teo <[email protected]>
…l-org#15928)

* fix uninitialized is_on_grid in quantize_row_iq3_xxs_impl

* change initialization to true
@jan-service-account jan-service-account merged commit 921ab0b into dev Sep 24, 2025
3 checks passed
@jan-service-account jan-service-account deleted the update-dev-from-master-2025-09-24-00-33 branch September 24, 2025 00:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

8 participants