Skip to content

Commit 86aaf3c

Browse files
authored
Add fast sine and cosine definitions in common.h for CUDA templates (#865)
1 parent 48c9a35 commit 86aaf3c

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

src/tl_templates/cuda/common.h

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -18,6 +18,8 @@ using int4_t = int4;
1818
#define hexp cutlass::fast_exp
1919
#define hlog cutlass::fast_log
2020
#define hsqrt cutlass::fast_sqrt
21+
#define hsin cutlass::fast_sin
22+
#define hcos cutlass::fast_cos
2123
#define htanh cutlass::fast_tanh
2224
#define hpow powf
2325

0 commit comments

Comments
 (0)