Skip to content

ltoniazzi/cuda-examples

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cuda Examples

Collection of example kernels:

  • Tiled MatMul: A simple implementation of tiled multiplication
  • 1D Softmax: different implementations of 1D softmax with some profiling
  • Flash Atetntion: Implementation of fused matmul and softmax and then flash attention.
  • Reduce: Simple implementations of the sum/reduce kernel.

Setup

make setup

Tested on:

  • NVIDIA A10G
  • CUDA Version: 12.6

About

Collection of examples of kernels

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published