Commit e8831e0
latest from upstream (#4)
* cuda : remove legacy copy-op pointer indirection code (ggml-org#16485)
* remove legacy copy-op pointer indirection code
* further removal of copy-op indirection code
* renamed check_node_graph_compatibility_and_refresh_copy_ops function
* CUDA: add fp kernel for larger batch size MoE (ggml-org#16512)
* CUDA: kernel for larger batch sizes for MoE
* WIP
* WIP
* WIP
* WIP
* WIP
* WIP
* fixup
* tests
* Move mmq_ids_helper to mmid
* cleanup
* Remove redundant checks
* CUDA: use fastdiv + ggml_cuda_mad for mmvf (ggml-org#16557)
* CUDA: use fastdiv + ggml_cuda_mad for mmvf
* use bf16 directly + fix formatting
* Add exception for HIP code
* CUDA: enable FA for FP32 KV cache (ggml-org#16546)
* vulkan: Improve build time for MSVC (ggml-org#16545)
Enable CMP0147 so custom build steps (invoking vulkan-shader-gen) are run in parallel.
Enable /MP so source files are compiled in parallel.
* vulkan: Support FA with K/V in F32 (ggml-org#16543)
* CUDA + openCL: fix bug in accessing rms_norm->src while doing fusion (ggml-org#16577)
* vulkan: Add ACC_TYPE_VEC2 implementation (ggml-org#16203)
Signed-off-by: Stefan Savic <[email protected]>
Co-authored-by: Stefan Savic <[email protected]>
* metal : avoid using Metal's gpuAddress property (ggml-org#16576)
* metal : avoid using Metal's gpuAddress property
* metal : fix rope kernels buffer check
---------
Signed-off-by: Stefan Savic <[email protected]>
Co-authored-by: Anav Prasad <[email protected]>
Co-authored-by: Aman Gupta <[email protected]>
Co-authored-by: Johannes Gäßler <[email protected]>
Co-authored-by: Jeff Bolz <[email protected]>
Co-authored-by: SavicStefan <[email protected]>
Co-authored-by: Stefan Savic <[email protected]>
Co-authored-by: Georgi Gerganov <[email protected]>1 parent 36955c3 commit e8831e0
File tree
24 files changed
+761
-496
lines changed- ggml/src
- ggml-cuda
- ggml-metal
- ggml-opencl
- ggml-vulkan
- vulkan-shaders
- tests
24 files changed
+761
-496
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
944 | 944 | | |
945 | 945 | | |
946 | 946 | | |
947 | | - | |
948 | | - | |
949 | | - | |
950 | | - | |
951 | | - | |
952 | | - | |
953 | | - | |
954 | 947 | | |
955 | 948 | | |
956 | 949 | | |
| |||
Large diffs are not rendered by default.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
5 | | - | |
| 5 | + | |
6 | 6 | | |
7 | 7 | | |
8 | | - | |
9 | | - | |
10 | | - | |
11 | | - | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
516 | 516 | | |
517 | 517 | | |
518 | 518 | | |
519 | | - | |
520 | | - | |
| 519 | + | |
| 520 | + | |
521 | 521 | | |
522 | 522 | | |
523 | 523 | | |
| |||
526 | 526 | | |
527 | 527 | | |
528 | 528 | | |
529 | | - | |
530 | | - | |
531 | | - | |
532 | | - | |
533 | | - | |
534 | 529 | | |
535 | 530 | | |
536 | 531 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
116 | 116 | | |
117 | 117 | | |
118 | 118 | | |
119 | | - | |
120 | | - | |
121 | | - | |
122 | | - | |
123 | | - | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
124 | 128 | | |
125 | 129 | | |
126 | 130 | | |
| |||
247 | 251 | | |
248 | 252 | | |
249 | 253 | | |
| 254 | + | |
250 | 255 | | |
251 | 256 | | |
252 | 257 | | |
| |||
272 | 277 | | |
273 | 278 | | |
274 | 279 | | |
275 | | - | |
| 280 | + | |
276 | 281 | | |
277 | 282 | | |
278 | 283 | | |
| |||
305 | 310 | | |
306 | 311 | | |
307 | 312 | | |
308 | | - | |
| 313 | + | |
309 | 314 | | |
310 | 315 | | |
311 | 316 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2633 | 2633 | | |
2634 | 2634 | | |
2635 | 2635 | | |
2636 | | - | |
| 2636 | + | |
2637 | 2637 | | |
2638 | 2638 | | |
2639 | 2639 | | |
2640 | | - | |
2641 | 2640 | | |
2642 | 2641 | | |
2643 | 2642 | | |
| |||
2688 | 2687 | | |
2689 | 2688 | | |
2690 | 2689 | | |
2691 | | - | |
2692 | | - | |
2693 | | - | |
2694 | | - | |
2695 | | - | |
2696 | | - | |
2697 | | - | |
2698 | | - | |
2699 | | - | |
2700 | | - | |
2701 | | - | |
2702 | | - | |
2703 | | - | |
2704 | | - | |
2705 | | - | |
2706 | | - | |
2707 | 2690 | | |
2708 | 2691 | | |
2709 | 2692 | | |
2710 | 2693 | | |
2711 | 2694 | | |
2712 | | - | |
2713 | | - | |
2714 | | - | |
2715 | | - | |
2716 | | - | |
2717 | | - | |
2718 | 2695 | | |
2719 | 2696 | | |
2720 | 2697 | | |
| |||
2733 | 2710 | | |
2734 | 2711 | | |
2735 | 2712 | | |
2736 | | - | |
2737 | 2713 | | |
2738 | 2714 | | |
2739 | 2715 | | |
| |||
2754 | 2730 | | |
2755 | 2731 | | |
2756 | 2732 | | |
2757 | | - | |
2758 | 2733 | | |
2759 | 2734 | | |
2760 | 2735 | | |
| |||
2901 | 2876 | | |
2902 | 2877 | | |
2903 | 2878 | | |
2904 | | - | |
| 2879 | + | |
2905 | 2880 | | |
2906 | 2881 | | |
2907 | 2882 | | |
| |||
3120 | 3095 | | |
3121 | 3096 | | |
3122 | 3097 | | |
3123 | | - | |
| 3098 | + | |
3124 | 3099 | | |
3125 | 3100 | | |
3126 | 3101 | | |
| |||
3147 | 3122 | | |
3148 | 3123 | | |
3149 | 3124 | | |
3150 | | - | |
3151 | | - | |
3152 | | - | |
3153 | | - | |
3154 | 3125 | | |
3155 | 3126 | | |
3156 | 3127 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
| 3 | + | |
| 4 | + | |
3 | 5 | | |
4 | 6 | | |
5 | 7 | | |
| |||
37 | 39 | | |
38 | 40 | | |
39 | 41 | | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
40 | 48 | | |
41 | 49 | | |
42 | 50 | | |
| |||
54 | 62 | | |
55 | 63 | | |
56 | 64 | | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
57 | 92 | | |
58 | 93 | | |
59 | 94 | | |
60 | 95 | | |
61 | 96 | | |
62 | 97 | | |
63 | 98 | | |
64 | | - | |
| 99 | + | |
65 | 100 | | |
66 | 101 | | |
67 | 102 | | |
68 | 103 | | |
69 | 104 | | |
70 | 105 | | |
71 | 106 | | |
72 | | - | |
| 107 | + | |
73 | 108 | | |
74 | 109 | | |
75 | 110 | | |
76 | 111 | | |
77 | 112 | | |
78 | 113 | | |
79 | 114 | | |
80 | | - | |
| 115 | + | |
81 | 116 | | |
82 | 117 | | |
83 | 118 | | |
| |||
98 | 133 | | |
99 | 134 | | |
100 | 135 | | |
101 | | - | |
| 136 | + | |
102 | 137 | | |
103 | | - | |
104 | | - | |
| 138 | + | |
105 | 139 | | |
106 | 140 | | |
107 | 141 | | |
| |||
0 commit comments