Skip to content

Commit 5a468ae

Browse files
authored
Update Changelog for 0.3.18 (#3388)
* Update Changelog for 0.3.18
1 parent f0e8560 commit 5a468ae

File tree

1 file changed

+43
-0
lines changed

1 file changed

+43
-0
lines changed

Changelog.txt

Lines changed: 43 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,47 @@
11
OpenBLAS ChangeLog
2+
====================================================================
3+
Version 0.3.18
4+
02-Oct-2021
5+
6+
general:
7+
- when the build-time number of preconfigured threads is exceeded
8+
at runtime (typically by an external program calling BLAS functions
9+
from a larger number of threads in parallel), OpenBLAS will now
10+
allocate an auxiliary control structure for up to 512 additional
11+
threads instead of aborting
12+
- added support for Loongson's LoongArch64 cpu architecture
13+
- fixed building OpenBLAS with CMAKE and -DBUILD_BFLOAT16=ON
14+
- added support for building OpenBLAS as a CMAKE subproject
15+
- added support for building for Windows/ARM64 targets with clang
16+
- improved support for building with the IBM xlf compiler
17+
- imported Reference-LAPACK PR 625 (out-of-bounds reads in ?LARRV)
18+
- imported Reference-LAPACK PR 597 for testsuite compatibility with
19+
LLVM's libomp
20+
21+
x86_64:
22+
- added SkylakeX S/DGEMM kernels for small problem sizes (M*N*K<=1000000)
23+
- added optimized SBGEMM for Intel Cooper Lake
24+
- reinstated the performance patch for AVX512 SGEMV_T with a proper fix
25+
- added a workaround for a gcc11 tree-vectorizer bug that caused spurious
26+
failures in the test programs for complex BLAS3 when compiling at -O3
27+
(the default for cmake "release" builds)
28+
- added support for runtime cpu count detection under Haiku OS
29+
- worked around a long-standing miscompilation issue of the Haswell DGEMV_T
30+
kernel with gcc that could produce NaN output in some corner cases
31+
32+
POWER:
33+
- improved performance of DASUM on POWER10
34+
35+
ARMV8:
36+
- fixed crashes (use of reserved register x18) on Apple M1 under OSX
37+
- fixed building with gcc releases earlier than 5.1
38+
39+
MIPS:
40+
- fixed building under BSD
41+
42+
MIPS64:
43+
- fixed building under BSD
44+
245
====================================================================
346
Version 0.3.17
447
15-Jul-2021

0 commit comments

Comments
 (0)