|
1 | 1 | OpenBLAS ChangeLog
|
| 2 | +==================================================================== |
| 3 | +Version 0.3.18 |
| 4 | + 02-Oct-2021 |
| 5 | + |
| 6 | +general: |
| 7 | + - when the build-time number of preconfigured threads is exceeded |
| 8 | + at runtime (typically by an external program calling BLAS functions |
| 9 | + from a larger number of threads in parallel), OpenBLAS will now |
| 10 | + allocate an auxiliary control structure for up to 512 additional |
| 11 | + threads instead of aborting |
| 12 | + - added support for Loongson's LoongArch64 cpu architecture |
| 13 | + - fixed building OpenBLAS with CMAKE and -DBUILD_BFLOAT16=ON |
| 14 | + - added support for building OpenBLAS as a CMAKE subproject |
| 15 | + - added support for building for Windows/ARM64 targets with clang |
| 16 | + - improved support for building with the IBM xlf compiler |
| 17 | + - imported Reference-LAPACK PR 625 (out-of-bounds reads in ?LARRV) |
| 18 | + - imported Reference-LAPACK PR 597 for testsuite compatibility with |
| 19 | + LLVM's libomp |
| 20 | + |
| 21 | +x86_64: |
| 22 | + - added SkylakeX S/DGEMM kernels for small problem sizes (M*N*K<=1000000) |
| 23 | + - added optimized SBGEMM for Intel Cooper Lake |
| 24 | + - reinstated the performance patch for AVX512 SGEMV_T with a proper fix |
| 25 | + - added a workaround for a gcc11 tree-vectorizer bug that caused spurious |
| 26 | + failures in the test programs for complex BLAS3 when compiling at -O3 |
| 27 | + (the default for cmake "release" builds) |
| 28 | + - added support for runtime cpu count detection under Haiku OS |
| 29 | + - worked around a long-standing miscompilation issue of the Haswell DGEMV_T |
| 30 | + kernel with gcc that could produce NaN output in some corner cases |
| 31 | + |
| 32 | +POWER: |
| 33 | + - improved performance of DASUM on POWER10 |
| 34 | + |
| 35 | +ARMV8: |
| 36 | + - fixed crashes (use of reserved register x18) on Apple M1 under OSX |
| 37 | + - fixed building with gcc releases earlier than 5.1 |
| 38 | + |
| 39 | +MIPS: |
| 40 | + - fixed building under BSD |
| 41 | + |
| 42 | +MIPS64: |
| 43 | + - fixed building under BSD |
| 44 | + |
2 | 45 | ====================================================================
|
3 | 46 | Version 0.3.17
|
4 | 47 | 15-Jul-2021
|
|
0 commit comments