11OpenBLAS ChangeLog
2+ ====================================================================
3+ Version 0.3.12
4+ 24-Oct-2020
5+
6+ common:
7+ * Fixed missibg LAPACK functions (inadvertently dropped during
8+ the build system restructuring)
9+ * Fixed argument conversion macro in LAPACKE_zgesvdq (LAPACK #458)
10+
11+ POWER:
12+ * Added optimized SCOPY/CCOPY kernels for POWER10
13+ * Increased and unified the default size of the GEMM BUFFER
14+ * Fixed building for POWER1ß in DYNAMIC_ARCH mode
15+ * POWER10 compatibility test now checks binutils version as well
16+ * Cleaned up compiler warnings
17+
18+ x86_64:
19+ * corrected compiler version checks for AVX2 compatibility
20+ * added compiler option -mavx2 for building with flang
21+ * fixed direct SGEMM pathway for small matrix sizes (broken by
22+ the code refactoring in 0.3.11)
23+ * fixed unhandled partial register clobbers in several kernels
24+ for AXPY,DOT,GEMV_N and GEMV_T flagged by gcc10 tree-vectorizer
25+
26+ ARMV8:
27+ * improved Apple Vortex support to include cross-compiling
28+
229====================================================================
330Version 0.3.11
431 17-Oct-2020
532
6- common:
33+ common:
734 * API change:
835 the newly added BFLOAT16 functions were renamed to use the
936 letter "B" instead of "H" to avoid potential confusion with
@@ -28,7 +55,7 @@ Version 0.3.11
2855 * Makefile builds no longer misread NO_CBLAS=0 or NO_LAPACK=0 as
2956 enabling these options
3057 * Fixed detection of gfortran when invoked through an mpi wrapper
31- * Improve thread reinitialization performance with OpenMP xafter a fork
58+ * Improve thread reinitialization performance with OpenMP after a fork
3259 * Added support for building only the subset of the library required
3360 for a particular precision by specifying BUILD_SINGLE, BUILD_DOUBLE
3461 * Optional function name prefixes and suffixes are now correctly
6693 * Fixed cpu detection on BSD-like systems
6794 * Fixed compilation in -std=C18 mode
6895
69-
7096IBM Z:
7197 * Added support for compiling with the clang compiler
7298 * Improved GEMM performance on Z14
0 commit comments