You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- clBLAS's Gemm implementation has been comprehensively overhauled to use AutoGemm. AutoGemm is a suite of python scripts which generate optimized kernels and kernel selection logic, for all precisions, transposes, tile sizes and so on.
30
-
- CMake is configured to use AutoGemm for clBLAS so the build and usage experience of Gemm remains unchanged (only performance and maintainability has been improved). Kernel sources are generated at build time (not runtime) and can be configured within CMake to be pre-compiled at build time.
31
-
- clBLAS users with unique Gemm requirements can customize AutoGemm to their needs (such as non-default tile sizes for very small or very skinny matrices); see [AutoGemm](http://github.com/clMathLibraries/clBLAS/wiki/AutoGemm) documentation for details.
26
+
## clBLAS update notes 01/2017
32
27
28
+
- v2.12 is a bugfix release as a rollup of all fixes in /develop branch
29
+
- Thanks to @pavanky, @iotamudelta, @shahsan10, @psyhtest, @haahh, @hughperkins, @tfauck
30
+
@abhiShandy, @IvanVergiliev, @zougloub, @mgates3 for contributions to clBLAS v2.12
31
+
- Summary of fixes available to read on the releases tab
33
32
34
33
## clBLAS library user documentation
35
34
@@ -202,7 +201,7 @@ The simple example below shows how to use clBLAS to compute an OpenCL accelerate
202
201
- Netlib CBLAS (recommended)
203
202
Ubuntu: install by "apt-get install libblas-dev"
204
203
Windows: download & install lapack-3.6.0 which comes with CBLAS
0 commit comments