Sync master with upstream release b5548 #110

jan-service-account · 2025-05-31T00:08:50Z

Updates dev branch with latest release (b5548) from ggml-org/llama.cpp

* add distilbert * small fixes * add note for LLM_ARCH_DISTIL_BERT * Use MODEL_ARCH.BERT for DistilBert --------- Co-authored-by: dinhhuy <[email protected]>

…-org#13847) * convert : allow partial update to the chkhsh pre-tokenizer list * code style * update tokenizer out * rm inp/out files for models not having gguf * fixed hash for glm * skip nomic-bert-moe test * Update convert_hf_to_gguf_update.py * fix minerva-7b hash * rm redundant import

* sync : vendor ggml-ci * cont : fix httplib version ggml-ci * cont : fix lint * cont : fix lint * vendor : move to common folder /vendor ggml-ci * cont : fix lint * cont : move httplib to /vendor + use json_fwd.hpp ggml-ci * cont : fix server build ggml-ci * cont : add missing headers ggml-ci * cont : header clean-up ggml-ci

* SYCL: Add mrope kernel * feat: Optimize rope operations with vectorization Uses `sycl::vec` to load and store two elements at a time, significantly improving performance in `rope_norm`, `rope_neox`, and `rope_multi`. This reduces the number of memory accesses and leverages SIMD instructions for faster execution. * Use ceil_div

…3927) ggml-ci

…ml-org#13922)

zkh2016 and others added 10 commits May 30, 2025 10:31

llama : use llm_build_granite for minicpm (ggml-org#13911)

2c90da4

llama : add support for DistilBert (ggml-org#13907)

291f2b6

* add distilbert * small fixes * add note for LLM_ARCH_DISTIL_BERT * Use MODEL_ARCH.BERT for DistilBert --------- Co-authored-by: dinhhuy <[email protected]>

convert : fix rwkv bos/eos token (ggml-org#13844)

db38704

cuda : prevent using split buffers with 3d/4d matrices (ggml-org#13919)

df0c0c7

parallel : increase the variability of the prompt lengths (ggml-org#1…

dd665cc

…3927) ggml-ci

sched : avoid changing cur_copy when a graph is already allocated (gg…

b47ab7b

…ml-org#13922)

CUDA: fix typo in FlashAttention code (ggml-org#13926)

e562eec

jan-service-account merged commit 1021f2f into dev May 31, 2025
15 checks passed

jan-service-account deleted the update-dev-from-master-2025-05-31-00-08 branch May 31, 2025 00:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sync master with upstream release b5548 #110

Sync master with upstream release b5548 #110

Uh oh!

jan-service-account commented May 31, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

Sync master with upstream release b5548 #110

Sync master with upstream release b5548 #110

Uh oh!

Conversation

jan-service-account commented May 31, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants