-
I'm really happy about the ginkgo solver, thank you very much. I'm quite happy with the performance on CPU and GPU (RTX 5080). I was curious about the fp32 performance and use ginkgo with double or float. I get slightly different results and number of iterations, so I assume my implementation is right. Interestingly I see almost no performance differences on CPU (Apple Silicon) and CUDA (RTX 5080). I'm happy with the performance I have, I'm just curious if this behavior is expected or if I do something wrong. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 11 replies
-
Which performance are you comparing ? CPU FP32 v/s CPU FP64 or CPU FP32 v/s CUDA FP64 ? Or some other variant ? |
Beta Was this translation helpful? Give feedback.
To use float, you need to use
gko::preconditioner::Jacobi<float>
, solvergko::solver::Cg<float>
, also the criterion for exampleResidualNorm<float>