Skip to content

Commit b567cd4

Browse files
author
Kent Knox
committed
Update README with release notes
1 parent a71aa63 commit b567cd4

File tree

1 file changed

+7
-8
lines changed

1 file changed

+7
-8
lines changed

README.md

Lines changed: 7 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ This repository houses the code for the OpenCL™ BLAS portion of clMath.
1010
The complete set of BLAS level 1, 2 & 3 routines is implemented. Please
1111
see Netlib BLAS for the list of supported routines. In addition to GPU
1212
devices, the library also supports running on CPU devices to facilitate
13-
debugging and multicore programming. APPML 1.10 is the most current
13+
debugging and multicore programming. APPML 1.12 is the most current
1414
generally available pre-packaged binary version of the library available
1515
for download for both Linux and Windows platforms.
1616

@@ -23,13 +23,12 @@ library does generate and enqueue optimized OpenCL kernels, relieving
2323
the user from the task of writing, optimizing and maintaining kernel
2424
code themselves.
2525

26-
## clBLAS update notes 09/2015
27-
28-
- Introducing [AutoGemm](http://github.com/clMathLibraries/clBLAS/wiki/AutoGemm)
29-
- clBLAS's Gemm implementation has been comprehensively overhauled to use AutoGemm. AutoGemm is a suite of python scripts which generate optimized kernels and kernel selection logic, for all precisions, transposes, tile sizes and so on.
30-
- CMake is configured to use AutoGemm for clBLAS so the build and usage experience of Gemm remains unchanged (only performance and maintainability has been improved). Kernel sources are generated at build time (not runtime) and can be configured within CMake to be pre-compiled at build time.
31-
- clBLAS users with unique Gemm requirements can customize AutoGemm to their needs (such as non-default tile sizes for very small or very skinny matrices); see [AutoGemm](http://github.com/clMathLibraries/clBLAS/wiki/AutoGemm) documentation for details.
26+
## clBLAS update notes 01/2017
3227

28+
- v2.12 is a bugfix release as a rollup of all fixes in /develop branch
29+
- Thanks to @pavanky, @iotamudelta, @shahsan10, @psyhtest, @haahh, @hughperkins, @tfauck
30+
@abhiShandy, @IvanVergiliev, @zougloub, @mgates3 for contributions to clBLAS v2.12
31+
- Summary of fixes available to read on the releases tab
3332

3433
## clBLAS library user documentation
3534

@@ -202,7 +201,7 @@ The simple example below shows how to use clBLAS to compute an OpenCL accelerate
202201
- Netlib CBLAS (recommended)
203202
Ubuntu: install by "apt-get install libblas-dev"
204203
Windows: download & install lapack-3.6.0 which comes with CBLAS
205-
- or ACML on windows/linux; Accelerate on Mac OSX
204+
- or ACML on windows/linux; Accelerate on Mac OSX
206205
207206
### Performance infrastructure
208207
* Python

0 commit comments

Comments
 (0)