Skip to content

Commit 7ff3b06

Browse files
author
Bodhi Hu
committed
update
1 parent 117f7dd commit 7ff3b06

File tree

2 files changed

+3
-4
lines changed

2 files changed

+3
-4
lines changed

docs/build.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -205,7 +205,8 @@ This provides GPU acceleration using the MUSA cores of your Moore Threads MTT GP
205205
cmake -B build -DGGML_MUSA=ON
206206
cmake --build build --config Release
207207
```
208-
- For static build:
208+
209+
For static build:
209210

210211
```bash
211212
cmake -B build -DGGML_MUSA=ON \

ggml/src/ggml-cuda/ggml-cuda.cu

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -262,9 +262,7 @@ static ggml_cuda_device_info ggml_cuda_init() {
262262
id, prop.name, prop.gcnArchName, info.devices[id].cc & 0xffff,
263263
device_vmm ? "yes" : "no", prop.warpSize);
264264
#elif defined(GGML_USE_MUSA)
265-
// TODO: MUSA will reserve some shared mem, and 24B should be enough,
266-
// we can remove the **24** in the future when MUSA no longer reserves shared mem.
267-
info.devices[id].smpbo = prop.sharedMemPerBlockOptin - 24;
265+
info.devices[id].smpbo = prop.sharedMemPerBlockOptin;
268266
info.devices[id].cc = 100*prop.major + 10*prop.minor;
269267
#else
270268
info.devices[id].smpbo = prop.sharedMemPerBlockOptin;

0 commit comments

Comments
 (0)