Commit e74aab8
committed
Update on "[ExecuTorch][Llama] Use Eigen blas for custom sdpa"
OpenBlas's implementation is not thread safe. Thus when used within
parallel_for, it produces incorrect output. This has been documented in a few
places like here OpenMathLib/OpenBLAS#1441 and
OpenMathLib/OpenBLAS#2543.
I tried few options to disable openblas's multithreading but none of them
seemed to work.
It is possible that upstream openblas has fixed this. Will validate this by
pulling in latest, but in the meanwhile using eigen_blas to unblock.
Differential Revision: [D64398816](https://our.internmc.facebook.com/intern/diff/D64398816/)
[ghstack-poisoned]1 parent 8c0113d commit e74aab8
File tree
0 file changed
+0
-0
lines changed0 file changed
+0
-0
lines changed
0 commit comments