Skip to content

Commit dc522b0

Browse files
authored
Update README.md
ND4J's reduced sum is actually slower than Compute.scala on large arrays
1 parent 7405139 commit dc522b0

File tree

1 file changed

+1
-2
lines changed

1 file changed

+1
-2
lines changed

README.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -340,7 +340,6 @@ Some information can be found in the benchmark result:
340340
* Apparently, Compute.scala supports both NVIDIA GPU and AMD GPU, while ND4J does not support AMD GPU.
341341
* Compute.scala is faster than ND4J on large arrays or complex expressions.
342342
* ND4J is faster than Compute.scala when performing one simple primary operation on very small arrays.
343-
* ND4J's reduced sum is faster than Compute.scala.
344343
* ND4J's `permute` and `broadcast` are extremely slow, causing very low score in the convolution benchmark (unlike this benchmark, Deeplearning4j's convolution operation internally uses some undocumented variant of `permute` and `broadcast` in ND4J, which are not extremely slow).
345344

346345
## Future work
@@ -351,4 +350,4 @@ Now this project is only a minimum viable product. Many important features are s
351350
* Add more OpenCL math functions ([#101](https://github.com/ThoughtWorksInc/Compute.scala/issues/101)).
352351
* Further optimization of performance ([#62, #103](https://github.com/ThoughtWorksInc/Compute.scala/labels/performance)).
353352

354-
Contribution is welcome. Check [good first issues](https://github.com/ThoughtWorksInc/Compute.scala/labels/good%20first%20issue) to start hacking.
353+
Contribution is welcome. Check [good first issues](https://github.com/ThoughtWorksInc/Compute.scala/labels/good%20first%20issue) to start hacking.

0 commit comments

Comments
 (0)