Skip to content

Commit 5ee8846

Browse files
committed
Add static switch utility and utility functions for tensor operations
- Introduced `static_switch.h` with BOOL_SWITCH, EVENK_SWITCH, SOFTCAP_SWITCH, FP16_SWITCH, and HEADDIM_SWITCH macros for compile-time conditional execution. - Added `utils.h` containing various utility functions for tensor operations, including relu, max, sum, and GEMM implementations. - Implemented specialized relu functions for half and bfloat16 types using inline PTX assembly for performance optimization. - Enhanced tensor layout conversion functions to support different configurations for GEMM operations. - Included support for asynchronous copy operations and softmax calculations within the FLASH_NAMESPACE.
1 parent 7e63d9c commit 5ee8846

File tree

88 files changed

+0
-0
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

88 files changed

+0
-0
lines changed
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.
File renamed without changes.

0 commit comments

Comments
 (0)