Commit 7b34a67
committed
[SYSTEMDS-3896] Leverage SIMD Vector API for Counting NNZ
This patch leverages the new Vector API for the core primitive of
counting the number of non-zeros (which is still single-threaded
because usually done for chunks as part of multi-threaded tasks).
For single-threaded computeNnz on an 8GB dense matrix after JIT
compilation, this patch improved performance from 1100ms to 850ms.1 parent ca8d209 commit 7b34a67
File tree
1 file changed
+12
-17
lines changed- src/main/java/org/apache/sysds/runtime/util
1 file changed
+12
-17
lines changedLines changed: 12 additions & 17 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
58 | 58 | | |
59 | 59 | | |
60 | 60 | | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
61 | 64 | | |
62 | 65 | | |
| 66 | + | |
| 67 | + | |
63 | 68 | | |
| 69 | + | |
64 | 70 | | |
65 | 71 | | |
66 | 72 | | |
| |||
876 | 882 | | |
877 | 883 | | |
878 | 884 | | |
879 | | - | |
| 885 | + | |
880 | 886 | | |
881 | | - | |
| 887 | + | |
882 | 888 | | |
883 | | - | |
884 | | - | |
885 | | - | |
886 | | - | |
887 | | - | |
888 | | - | |
889 | | - | |
890 | | - | |
891 | | - | |
892 | | - | |
893 | | - | |
894 | | - | |
895 | | - | |
896 | | - | |
897 | | - | |
| 889 | + | |
| 890 | + | |
| 891 | + | |
| 892 | + | |
898 | 893 | | |
899 | 894 | | |
900 | 895 | | |
| |||
0 commit comments