Commit bbac6a2
ggml: fix cuda kernel launch configuration for k_compute_batched_ptrs to support large batch (ggml-org#16744)
* fix k_compute_batched_ptrs
* add backend ops test
* Update ggml/src/ggml-cuda/ggml-cuda.cu
Co-authored-by: Johannes Gäßler <[email protected]>
* reduce the batch size
---------
Co-authored-by: Johannes Gäßler <[email protected]>1 parent 73a48c9 commit bbac6a2
2 files changed
+12
-2
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1957 | 1957 | | |
1958 | 1958 | | |
1959 | 1959 | | |
1960 | | - | |
1961 | | - | |
| 1960 | + | |
| 1961 | + | |
| 1962 | + | |
| 1963 | + | |
| 1964 | + | |
| 1965 | + | |
| 1966 | + | |
| 1967 | + | |
| 1968 | + | |
1962 | 1969 | | |
1963 | 1970 | | |
1964 | 1971 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
6697 | 6697 | | |
6698 | 6698 | | |
6699 | 6699 | | |
| 6700 | + | |
| 6701 | + | |
| 6702 | + | |
6700 | 6703 | | |
6701 | 6704 | | |
6702 | 6705 | | |
| |||
0 commit comments