Commit 3845706
committed
[fix](ggml-cuda): ensure min 1 block per SM
Some kernel configurations can produce zero occupancy on certain
GPUs (example: RX 6700XT). This adds a safeguard to ensure at least
one block is launched, preventing floating point exception.
Co-authored-by: Attila Dusnoki <[email protected]>.1 parent ababae7 commit 3845706
1 file changed
+1
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
895 | 895 | | |
896 | 896 | | |
897 | 897 | | |
| 898 | + | |
898 | 899 | | |
899 | 900 | | |
900 | 901 | | |
| |||
0 commit comments