Skip to content

Commit 90e6c66

Browse files
authored
Merge pull request #2397 from martin-frbg/038changes
Update Changelog with changes from 0.3.8
2 parents 47c1bf7 + 32d9733 commit 90e6c66

File tree

1 file changed

+55
-0
lines changed

1 file changed

+55
-0
lines changed

Changelog.txt

Lines changed: 55 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,59 @@
11
OpenBLAS ChangeLog
2+
====================================================================
3+
Version 0.3.8
4+
9-Feb-2020
5+
6+
common:
7+
` * LAPACK has been updated to 3.9.0 (plus patches up to
8+
January 2nd, 2020)
9+
* CMAKE support has been improved in several areas including
10+
cross-compilation
11+
* a thread race condition in the GEMM3M kernels was resolved
12+
* the "generic" (plain C) gemm beta kernel used by many targets
13+
has been sped up
14+
* an optimized version of the LAPACK trtrs functions has been added
15+
* an incompatibilty between the LAPACK tests and the OpenBLAS
16+
implementation of XERBLA was resolved, removing the numerous
17+
warnings about wrong error exits in the former
18+
* support for NetBSD has been added
19+
* support for compilation with g95 and non-GNU versions of ld
20+
has been improved
21+
* support for compilation with (upcoming) gcc 10 has been added
22+
23+
POWER:
24+
* worked around miscompilation of several POWER8 and POWER9
25+
kernels by older versions of gcc
26+
* added support for big-endian POWER8 and for compilation on AIX
27+
* corrected bugs in the big-endian support for PPC440 and PPC970
28+
* DYNAMIC_ARCH support is now available in CMAKE builds as well
29+
30+
ARMV8:
31+
* performance of DGEMM_BETA and SGEMM_NCOPY has been improved
32+
* compilation for 32bit works again
33+
* performance of the RPCC function has been improved
34+
* improved performance on small systems
35+
* DYNAMIC_ARCH support is now available in CMAKE builds as well
36+
* cross-compilation from OSX to IOS was simplified
37+
38+
x86_64:
39+
* a new AVX512 DGEMM kernel was added and the AVX512 SGEMM kernel
40+
was significantly improved
41+
* optimized AVX512 kernels for CGEMM and ZGEMM have been added
42+
* AVX2 kernels for STRMM, SGEMM, and CGEMM have been significantly
43+
sped up and optimized CGEMM3M and ZGEMM3M kernels have been added
44+
* added support for QEMU virtual cpus
45+
* a compilation problem with PGI and SUN compilers was fixed
46+
* Intel "Goldmont plus" is now autodetected
47+
* a potential crash on program exit on MS Windows has been fixed
48+
49+
x86:
50+
* an unwanted case sensitivity in the implementation of LSAME
51+
on older 32bit AMD cpus was fixed
52+
53+
zarch:
54+
* Z15 is now supported as Z14
55+
* DYNAMIC_ARCH is now available on ZARCH as well
56+
257
====================================================================
358
Version 0.3.7
459
11-Aug 2019

0 commit comments

Comments
 (0)