Sync master with upstream release b6115 #198

jan-service-account · 2025-08-08T00:13:14Z

Updates dev branch with latest release (b6115) from ggml-org/llama.cpp

This commit addresses an issue with the convert_hf_to_gguf script which is currently failing with: ```console AttributeError: module 'torch' has no attribute 'uint64' ``` This occurred because safetensors expects torch.uint64 to be available in the public API, but PyTorch 2.2.x only provides limited support for unsigned types beyond uint8 it seems. The torch.uint64 dtype exists but is not exposed in the standard torch namespace (see pytorch/pytorch#58734). PyTorch 2.4.0 properly exposes torch.uint64 in the public API, resolving the compatibility issue with safetensors. This also required torchvision to updated to =0.19.0 for compatibility. Refs: https://huggingface.co/spaces/ggml-org/gguf-my-repo/discussions/186#68938de803e47d990aa087fb Refs: pytorch/pytorch#58734

* CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16

…-org#15094) Any available libraries are found and loaded dynamically at runtime.

…age metrics (ggml-org#15103)

* support internvl * support interns1 * resolve comments * put interns1 in tensor mapping * resolve comment * move tokenizer changes to sub class

* convert : support non-mxfp4 HF model * rm redundant check * disable debug check

danbev and others added 9 commits August 7, 2025 05:31

scripts: fix crash when --tool is not set (ggml-org#15133)

20638e4

CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16 (ggml-org#15131)

1d72c84

* CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16

ggml: Skip backend library linking code when GGML_BACKEND_DL=ON (ggml…

9a96389

…-org#15094) Any available libraries are found and loaded dynamically at runtime.

HIP: add cmake option to enable compiler output of kernel resource us…

7ad67ba

…age metrics (ggml-org#15103)

llama : Support intern-s1 (ggml-org#14875)

99acbc9

* support internvl * support interns1 * resolve comments * put interns1 in tensor mapping * resolve comment * move tokenizer changes to sub class

vulkan: Add env var to disable host visible vidmem (ggml-org#15109)

a0552c8

vulkan: support fattn sinks (ggml-org#15126)

c4f5356

convert : support non-mxfp4 HF model (ggml-org#15153)

50aa938

* convert : support non-mxfp4 HF model * rm redundant check * disable debug check

jan-service-account merged commit f68cb3c into dev Aug 8, 2025
17 checks passed

jan-service-account deleted the update-dev-from-master-2025-08-08-00-13 branch August 8, 2025 00:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sync master with upstream release b6115 #198

Sync master with upstream release b6115 #198

Uh oh!

jan-service-account commented Aug 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

Sync master with upstream release b6115 #198

Sync master with upstream release b6115 #198

Uh oh!

Conversation

jan-service-account commented Aug 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants