Commit 851553e
cuda: add SET operation support (ggml-org#16804)
* feat(cuda): add GGML_OP_SET support
Implement CUDA kernel for SET operation with f32 support.
All tests passing (14598/14598).
* cuda(set): add I32 support; keep F32
* refactor(cuda): use ggml_cuda_cpy to unify SET operator logic and remove code duplication
* Update ggml/src/ggml-cuda/ggml-cuda.cu
Co-authored-by: Sigbjørn Skjæret <[email protected]>
* Update ggml/src/ggml-cuda/set.cu
Co-authored-by: Sigbjørn Skjæret <[email protected]>
---------
Co-authored-by: Sigbjørn Skjæret <[email protected]>1 parent 85a7d86 commit 851553e
3 files changed
+57
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
50 | 50 | | |
51 | 51 | | |
52 | 52 | | |
| 53 | + | |
53 | 54 | | |
54 | 55 | | |
55 | 56 | | |
| |||
2416 | 2417 | | |
2417 | 2418 | | |
2418 | 2419 | | |
| 2420 | + | |
| 2421 | + | |
| 2422 | + | |
2419 | 2423 | | |
2420 | 2424 | | |
2421 | 2425 | | |
| |||
3842 | 3846 | | |
3843 | 3847 | | |
3844 | 3848 | | |
| 3849 | + | |
| 3850 | + | |
| 3851 | + | |
| 3852 | + | |
| 3853 | + | |
| 3854 | + | |
| 3855 | + | |
3845 | 3856 | | |
3846 | 3857 | | |
3847 | 3858 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
0 commit comments