File tree Expand file tree Collapse file tree 2 files changed +4
-4
lines changed
content/learning-paths/laptops-and-desktops/dgx_spark_llamacpp Expand file tree Collapse file tree 2 files changed +4
-4
lines changed Original file line number Diff line number Diff line change @@ -156,9 +156,9 @@ ALL ALL 2.52 8.37 0.00 0.00 100.00 26566
156156```
157157
158158Here is an interpretation of the values:
159- - NEON (≈ 7–15 %) : Active SIMD integer and floating-point operations
160- - FPARMv8 : Scalar FP operations such as activation and normalization
161- - SVE/SVE2 = 0 : The kernel does not issue SVE instructions
159+ - NEON: 7–15% for SIMD integer and floating-point operations
160+ - FPARMv8: 2-5% for scalar FP operations such as activation and normalization
161+ - SVE/SVE2: 0%, the kernel does not issue SVE instructions
162162
163163This confirms that the Grace CPU performs quantized inference primarily using NEON.
164164
Original file line number Diff line number Diff line change 11---
2- title : Unlock quantized LLM performance on Arm-based NVIDIA DGX Spark using Armv9 SIMD instructions
2+ title : Unlock quantized LLM performance on Arm-based NVIDIA DGX Spark
33
44minutes_to_complete : 60
55
You can’t perform that action at this time.
0 commit comments