You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- By default, NNPA is disabled by default. To enable it:
45
+
- By default, NNPA is enabled when available. To disable it (not recommended):
46
46
47
47
```bash
48
48
cmake -S . -B build \
49
49
-DCMAKE_BUILD_TYPE=Release \
50
50
-DGGML_BLAS=ON \
51
51
-DGGML_BLAS_VENDOR=OpenBLAS \
52
-
-DGGML_NNPA=ON
52
+
-DGGML_NNPA=OFF
53
53
54
54
cmake --build build --config Release -j $(nproc)
55
55
```
@@ -166,7 +166,7 @@ Only available in IBM z15/LinuxONE 3 or later system with the `-DGGML_VXE=ON` (t
166
166
167
167
### 2. NNPA Vector Intrinsics Acceleration
168
168
169
-
Only available in IBM z16/LinuxONE 4 or later system with the `-DGGML_NNPA=ON` (turned off by default) compile flag. No hardware acceleration is possible with llama.cpp with older systems, such as IBM z15/arch13. In such systems, the APIs can still run but will use a scalar implementation.
169
+
Only available in IBM z16/LinuxONE 4 or later system with the `-DGGML_NNPA=ON` (turned on when available) compile flag. No hardware acceleration is possible with llama.cpp with older systems, such as IBM z15/arch13. In such systems, the APIs can still run but will use a scalar implementation.
170
170
171
171
### 3. zDNN Accelerator (WIP)
172
172
@@ -230,10 +230,6 @@ IBM VXE/VXE2 SIMD acceleration depends on the BLAS implementation. It is strongl
Answer: We are aware of this as detailed in [this issue](https://github.com/ggml-org/llama.cpp/issues/14877). Please either try reducing the number of threads, or disable the compile option using `-DGGML_NNPA=OFF`.
236
-
237
233
## Getting Help on IBM Z & LinuxONE
238
234
239
235
1. **Bugs, Feature Requests**
@@ -292,4 +288,4 @@ IBM VXE/VXE2 SIMD acceleration depends on the BLAS implementation. It is strongl
292
288
- 🚫 - acceleration unavailable, will still run using scalar implementation
293
289
- ❓ - acceleration unknown, please contribute if you can test it yourself
0 commit comments