triton-kernels

i write kernels when bored and publish them here.

some are efficient, some are not (as native torch utilizes inline PTX in CUDA environments)

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
attention_mechanism		attention_mechanism
basic_operations		basic_operations
fused_kernels		fused_kernels
LICENSE		LICENSE
README.md		README.md

Provide feedback