|
1 | 1 | OpenBLAS ChangeLog
|
| 2 | +==================================================================== |
| 3 | +Version 0.3.8 |
| 4 | + 9-Feb-2020 |
| 5 | + |
| 6 | +common: |
| 7 | +` * LAPACK has been updated to 3.9.0 (plus patches up to |
| 8 | + January 2nd, 2020) |
| 9 | + * CMAKE support has been improved in several areas including |
| 10 | + cross-compilation |
| 11 | + * a thread race condition in the GEMM3M kernels was resolved |
| 12 | + * the "generic" (plain C) gemm beta kernel used by many targets |
| 13 | + has been sped up |
| 14 | + * an optimized version of the LAPACK trtrs functions has been added |
| 15 | + * an incompatibilty between the LAPACK tests and the OpenBLAS |
| 16 | + implementation of XERBLA was resolved, removing the numerous |
| 17 | + warnings about wrong error exits in the former |
| 18 | + * support for NetBSD has been added |
| 19 | + * support for compilation with g95 and non-GNU versions of ld |
| 20 | + has been improved |
| 21 | + * support for compilation with (upcoming) gcc 10 has been added |
| 22 | + |
| 23 | +POWER: |
| 24 | + * worked around miscompilation of several POWER8 and POWER9 |
| 25 | + kernels by older versions of gcc |
| 26 | + * added support for big-endian POWER8 and for compilation on AIX |
| 27 | + * corrected bugs in the big-endian support for PPC440 and PPC970 |
| 28 | + * DYNAMIC_ARCH support is now available in CMAKE builds as well |
| 29 | + |
| 30 | +ARMV8: |
| 31 | + * performance of DGEMM_BETA and SGEMM_NCOPY has been improved |
| 32 | + * compilation for 32bit works again |
| 33 | + * performance of the RPCC function has been improved |
| 34 | + * improved performance on small systems |
| 35 | + * DYNAMIC_ARCH support is now available in CMAKE builds as well |
| 36 | + * cross-compilation from OSX to IOS was simplified |
| 37 | + |
| 38 | +x86_64: |
| 39 | + * a new AVX512 DGEMM kernel was added and the AVX512 SGEMM kernel |
| 40 | + was significantly improved |
| 41 | + * optimized AVX512 kernels for CGEMM and ZGEMM have been added |
| 42 | + * AVX2 kernels for STRMM, SGEMM, and CGEMM have been significantly |
| 43 | + sped up and optimized CGEMM3M and ZGEMM3M kernels have been added |
| 44 | + * added support for QEMU virtual cpus |
| 45 | + * a compilation problem with PGI and SUN compilers was fixed |
| 46 | + * Intel "Goldmont plus" is now autodetected |
| 47 | + * a potential crash on program exit on MS Windows has been fixed |
| 48 | + |
| 49 | +x86: |
| 50 | + * an unwanted case sensitivity in the implementation of LSAME |
| 51 | + on older 32bit AMD cpus was fixed |
| 52 | + |
| 53 | +zarch: |
| 54 | + * Z15 is now supported as Z14 |
| 55 | + * DYNAMIC_ARCH is now available on ZARCH as well |
| 56 | + |
2 | 57 | ====================================================================
|
3 | 58 | Version 0.3.7
|
4 | 59 | 11-Aug 2019
|
|
0 commit comments