Skip to content

Commit 37aeff5

Browse files
author
Timmy
committed
enable sgemm column major TN case to take advantage of faster sgemm NN kernel by doing transpose separately
1 parent 6ab7fa9 commit 37aeff5

File tree

5 files changed

+536
-4
lines changed

5 files changed

+536
-4
lines changed

src/library/blas/include/blas_funcs.h

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -78,6 +78,7 @@ typedef enum BlasFunctionID {
7878
CLBLAS_iAMAX,
7979
CLBLAS_NRM2,
8080
CLBLAS_ASUM,
81+
CLBLAS_TRANSPOSE,
8182

8283
/* ! Must be the last */
8384
BLAS_FUNCTIONS_NUMBER

0 commit comments

Comments
 (0)