@@ -59,42 +59,42 @@ Version 0.3.7
59
59
11-Aug 2019
60
60
61
61
common:
62
- * having the gmake special variables TARGET_ARCH or TARGET_MACH
63
- defined no longer causes build failures in ctest or utest
64
- * defining NO_AFFINITY or USE_TLS to 0 in gmake builds no longer
65
- has the same effect as setting them to 1
66
- * a new test program was added to allow checking the library for
67
- thread safety
68
- * a new option USE_LOCKING was added to ensure thread safety when
69
- OpenBLAS itself is built without multithreading but will be
70
- called from multiple threads.
71
- * a build failure on Linux with glibc versions earlier than 2.5
72
- was fixed
73
- * a runtime error with CPU enumeration (and NO_AFFINITY not set)
74
- on glibc 2.6 was fixed
75
- * NO_AFFINITY was added to the CMAKE options (and defaults to being
76
- active on Linux, as in the gmake builds)
62
+ * having the gmake special variables TARGET_ARCH or TARGET_MACH
63
+ defined no longer causes build failures in ctest or utest
64
+ * defining NO_AFFINITY or USE_TLS to 0 in gmake builds no longer
65
+ has the same effect as setting them to 1
66
+ * a new test program was added to allow checking the library for
67
+ thread safety
68
+ * a new option USE_LOCKING was added to ensure thread safety when
69
+ OpenBLAS itself is built without multithreading but will be
70
+ called from multiple threads.
71
+ * a build failure on Linux with glibc versions earlier than 2.5
72
+ was fixed
73
+ * a runtime error with CPU enumeration (and NO_AFFINITY not set)
74
+ on glibc 2.6 was fixed
75
+ * NO_AFFINITY was added to the CMAKE options (and defaults to being
76
+ active on Linux, as in the gmake builds)
77
77
78
78
x86_64:
79
- * the build-time logic for detection of AVX512 availability in
80
- the processor and compiler was fixed
81
- * gmake builds on OSX now set the internal name of the library to
82
- libopenblas.0.dylib (consistent with CMAKE)
83
- * the Haswell DGEMM kernel received a significant speedup through
84
- improved prefetch and load instructions
85
- * performance of DGEMM, DTRMM, DTRSM and ZDOT on Zen/Zen2 was markedly
86
- increased by avoiding vpermpd instructions
87
- * the SKYLAKEX (AVX512) DGEMM helper functions have now been disabled
88
- to fix remaining errors in DGEMM, DSYMM and DTRMM
89
-
90
- ## POWER:
91
- * added support for building on FreeBSD/powerpc64 and FreeBSD/ppc970
92
- * added optimized kernels for POWER9 SGEMM and STRMM
93
-
94
- ## ARMV7:
95
- * fixed the softfp implementations of xAMAX and IxAMAX
96
- * removed the predefined -march= flags on both ARMV5 and ARMV6 as
97
- they were appropriate for only a subset of platforms
79
+ * the build-time logic for detection of AVX512 availability in
80
+ the processor and compiler was fixed
81
+ * gmake builds on OSX now set the internal name of the library to
82
+ libopenblas.0.dylib (consistent with CMAKE)
83
+ * the Haswell DGEMM kernel received a significant speedup through
84
+ improved prefetch and load instructions
85
+ * performance of DGEMM, DTRMM, DTRSM and ZDOT on Zen/Zen2 was markedly
86
+ increased by avoiding vpermpd instructions
87
+ * the SKYLAKEX (AVX512) DGEMM helper functions have now been disabled
88
+ to fix remaining errors in DGEMM, DSYMM and DTRMM
89
+
90
+ POWER:
91
+ * added support for building on FreeBSD/powerpc64 and FreeBSD/ppc970
92
+ * added optimized kernels for POWER9 SGEMM and STRMM
93
+
94
+ ARMV7:
95
+ * fixed the softfp implementations of xAMAX and IxAMAX
96
+ * removed the predefined -march= flags on both ARMV5 and ARMV6 as
97
+ they were appropriate for only a subset of platforms
98
98
99
99
====================================================================
100
100
Version 0.3.6
0 commit comments