|
1 | 1 | OpenBLAS ChangeLog
|
| 2 | +==================================================================== |
| 3 | +Version 0.3.13 |
| 4 | + 12-Dec-2020 |
| 5 | + |
| 6 | + common: |
| 7 | + * Added a generic bfloat16 SBGEMV kernel |
| 8 | + * Fixed a potentially severe memory leak after fork in OpenMP builds |
| 9 | + that was introduced in 0.3.12 |
| 10 | + * Added detection of the Fujitsu Fortran compiler |
| 11 | + * Added detection of the (e)gfortran compiler on OpenBSD |
| 12 | + * Added support for overriding the default name of the library independently |
| 13 | + from symbol suffixing in the gmake builds (already supported in cmake) |
| 14 | + |
| 15 | +RISCV: |
| 16 | + * Added a RISC V port optimized for C910V |
| 17 | + |
| 18 | +POWER: |
| 19 | + * Added optimized POWER10 kernels for SAXPY, CAXPY, SDOT, DDOT and DGEMV_N |
| 20 | + * Improved DGEMM performance on POWER10 |
| 21 | + * Improved STRSM and DTRSM performance on POWER9 and POWER10 |
| 22 | + * Fixed segmemtation faults in DYNAMIC_ARCH builds |
| 23 | + * Fixed compilation with the PGI compiler |
| 24 | + |
| 25 | +x86: |
| 26 | + * Fixed compilation of kernels that require SSE2 intrinsics since 0.3.12 |
| 27 | + |
| 28 | +x86_64: |
| 29 | + * Added an optimized bfloat16 SBGEMV kernel for SkylakeX and Cooperlake |
| 30 | + * Improved the performance of SASUM and DASUM kernels through parallelization |
| 31 | + * Improved the performance of SROT and DROT kernels |
| 32 | + * Improved the performance of multithreaded xSYRK |
| 33 | + * Fixed OpenMP builds that use the LLVM Clang compiler together with GNU gfortran |
| 34 | + (where linking of both the LLVM libomp and GNU libgomp could lead to lockups or |
| 35 | + wrong results) |
| 36 | + * Fixed miscompilations by old gcc 4.6 |
| 37 | + * Fixed misdetection of AVX2 capability in some Sandybridge cpus |
| 38 | + * Fixed lockups in builds combining DYNAMIC_ARCH with TARGET=GENERIC on OpenBSD |
| 39 | + |
| 40 | +ARM64: |
| 41 | + * Fixed segmemtation faults in DYNAMIC_ARCH builds |
| 42 | + |
| 43 | +MIPS: |
| 44 | + * Improved kernels for Loongson 3R3 ("3A") and 3R4 ("3B") models, including MSA |
| 45 | + * Fixed bugs in the MSA kernels for CGEMM, CTRMM, CGEMV and ZGEMV |
| 46 | + * Added handling of zero increments in the MSA kernels for SSWAP and DSWAP |
| 47 | + * Added DYNAMIC_ARCH support for MIPS64 (currently Loongson3R3/3R4 only) |
| 48 | + |
| 49 | +SPARC: |
| 50 | + * Fixed building 32 and 64 bit SPARC kernels with the SolarisStudio compilers |
| 51 | + |
2 | 52 | ====================================================================
|
3 | 53 | Version 0.3.12
|
4 | 54 | 24-Oct-2020
|
|
0 commit comments