Skip to content

RonithManikonda/attention-efficiency

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Attention Efficiency Simulator (WIP)

Early-stage C scaffold for experimenting with attention implementations and efficiency trade-offs (KV-cache, IO, runtime). The project will grow into a small simulator/benchmark suite comparing MHA, GQA/MQA, MLA, and FlashAttention-style tiling in C with stable-softmax correctness tests.

Quick start

cmake -S . -B build
cmake --build build

This currently builds stub binaries (attn_bench, attn_tests) as placeholders for the upcoming implementations.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published