Commit fe6b72b
committed
Merge remote-tracking branch 'origin/master' into gabe-l-hart/thinking-model-disabled-agent-prefill
* origin/master: (84 commits)
CUDA: fastdiv, launch bounds for mmvq + q8_1 quant (ggml-org#15802)
tests : add --list-ops and --show-coverage options (ggml-org#15745)
gguf: gguf_writer refactor (ggml-org#15691)
kv-cache : fix SWA checks + disable cacheless iSWA (ggml-org#15811)
model-conversion : add --embeddings flag to modelcard.template [no ci] (ggml-org#15801)
chat : fixed crash when Hermes 2 <tool_call> had a newline before it (ggml-org#15639)
chat : nemotron thinking & toolcalling support (ggml-org#15676)
scripts : add Jinja tester PySide6 simple app (ggml-org#15756)
llama : add support for EmbeddingGemma 300m (ggml-org#15798)
metal : Add template specialization for mul_mm_id w/ ne20 == 10 (ggml-org#15799)
llama : set n_outputs to 1 to avoid 0 outputs mean-pooling (ggml-org#15791)
CANN: Refactor ND to NZ workspace to be per-device (ggml-org#15763)
server: add exceed_context_size_error type (ggml-org#15780)
Document the new max GPU layers default in help (ggml-org#15771)
ggml: add ops for WAN video model (cuda && cpu) (ggml-org#15669)
CANN: Fix precision issue on 310I DUO multi-devices (ggml-org#15784)
opencl: add hs=40 to FA (ggml-org#15758)
CANN: fix acl_rstd allocation size in ggml_cann_rms_norm (ggml-org#15760)
vulkan: fix mmv subgroup16 selection (ggml-org#15775)
vulkan: don't use std::string in load_shaders, to improve compile time (ggml-org#15724)
...File tree
142 files changed
+7178
-1767
lines changed- ci
- common
- docs
- backend
- examples
- convert-llama2c-to-ggml
- diffusion
- eval-callback
- model-conversion
- scripts
- causal
- embedding
- utils
- speculative
- ggml
- include
- src
- ggml-cann
- ggml-cpu
- arch/riscv
- kleidiai
- ggml-cuda
- ggml-metal
- ggml-opencl
- kernels
- ggml-sycl
- ggml-vulkan
- vulkan-shaders
- ggml-webgpu
- gguf-py/gguf
- include
- models/templates
- scripts
- jinja
- src
- tests
- tools
- batched-bench
- llama-bench
- server
- tests
- unit
- tts
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
142 files changed
+7178
-1767
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
22 | 22 | | |
23 | 23 | | |
24 | 24 | | |
25 | | - | |
| 25 | + | |
26 | 26 | | |
27 | 27 | | |
28 | 28 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
137 | 137 | | |
138 | 138 | | |
139 | 139 | | |
| 140 | + | |
140 | 141 | | |
141 | 142 | | |
142 | 143 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
386 | 386 | | |
387 | 387 | | |
388 | 388 | | |
389 | | - | |
390 | | - | |
391 | | - | |
392 | | - | |
| 389 | + | |
| 390 | + | |
| 391 | + | |
| 392 | + | |
393 | 393 | | |
394 | 394 | | |
395 | 395 | | |
| |||
520 | 520 | | |
521 | 521 | | |
522 | 522 | | |
523 | | - | |
524 | | - | |
| 523 | + | |
| 524 | + | |
525 | 525 | | |
526 | 526 | | |
527 | 527 | | |
| |||
651 | 651 | | |
652 | 652 | | |
653 | 653 | | |
654 | | - | |
655 | | - | |
656 | | - | |
657 | | - | |
| 654 | + | |
| 655 | + | |
| 656 | + | |
| 657 | + | |
658 | 658 | | |
659 | 659 | | |
660 | 660 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1545 | 1545 | | |
1546 | 1546 | | |
1547 | 1547 | | |
1548 | | - | |
1549 | | - | |
1550 | | - | |
1551 | | - | |
| 1548 | + | |
| 1549 | + | |
| 1550 | + | |
| 1551 | + | |
| 1552 | + | |
| 1553 | + | |
| 1554 | + | |
| 1555 | + | |
| 1556 | + | |
| 1557 | + | |
| 1558 | + | |
| 1559 | + | |
1552 | 1560 | | |
1553 | 1561 | | |
1554 | 1562 | | |
| |||
2458 | 2466 | | |
2459 | 2467 | | |
2460 | 2468 | | |
2461 | | - | |
| 2469 | + | |
2462 | 2470 | | |
2463 | 2471 | | |
2464 | 2472 | | |
| |||
2555 | 2563 | | |
2556 | 2564 | | |
2557 | 2565 | | |
2558 | | - | |
| 2566 | + | |
2559 | 2567 | | |
2560 | 2568 | | |
2561 | 2569 | | |
2562 | 2570 | | |
2563 | 2571 | | |
2564 | 2572 | | |
2565 | 2573 | | |
2566 | | - | |
| 2574 | + | |
2567 | 2575 | | |
2568 | 2576 | | |
2569 | 2577 | | |
| |||
2954 | 2962 | | |
2955 | 2963 | | |
2956 | 2964 | | |
2957 | | - | |
2958 | | - | |
2959 | | - | |
2960 | | - | |
2961 | | - | |
2962 | | - | |
2963 | | - | |
2964 | 2965 | | |
2965 | 2966 | | |
2966 | 2967 | | |
2967 | 2968 | | |
2968 | 2969 | | |
2969 | 2970 | | |
2970 | 2971 | | |
| 2972 | + | |
| 2973 | + | |
| 2974 | + | |
| 2975 | + | |
| 2976 | + | |
| 2977 | + | |
| 2978 | + | |
2971 | 2979 | | |
2972 | 2980 | | |
2973 | 2981 | | |
| |||
3459 | 3467 | | |
3460 | 3468 | | |
3461 | 3469 | | |
3462 | | - | |
3463 | | - | |
3464 | 3470 | | |
3465 | 3471 | | |
3466 | 3472 | | |
| |||
3475 | 3481 | | |
3476 | 3482 | | |
3477 | 3483 | | |
3478 | | - | |
3479 | | - | |
3480 | 3484 | | |
3481 | 3485 | | |
3482 | 3486 | | |
| |||
3491 | 3495 | | |
3492 | 3496 | | |
3493 | 3497 | | |
3494 | | - | |
3495 | | - | |
3496 | 3498 | | |
3497 | 3499 | | |
3498 | 3500 | | |
| |||
3508 | 3510 | | |
3509 | 3511 | | |
3510 | 3512 | | |
3511 | | - | |
3512 | 3513 | | |
3513 | | - | |
3514 | | - | |
3515 | 3514 | | |
3516 | 3515 | | |
3517 | 3516 | | |
| |||
3527 | 3526 | | |
3528 | 3527 | | |
3529 | 3528 | | |
3530 | | - | |
3531 | 3529 | | |
3532 | | - | |
3533 | | - | |
3534 | 3530 | | |
3535 | 3531 | | |
3536 | 3532 | | |
| |||
3545 | 3541 | | |
3546 | 3542 | | |
3547 | 3543 | | |
3548 | | - | |
3549 | | - | |
3550 | 3544 | | |
3551 | 3545 | | |
3552 | 3546 | | |
| |||
0 commit comments