Skip to content

[FA]:Optimize FlashAttention for N_CTX <= 512#2600

Merged
quintinwang5 merged 1 commit intomainfrom
quintin/perf_n_ctx_512
Nov 1, 2024
Merged

[FA]:Optimize FlashAttention for N_CTX <= 512#2600
quintinwang5 merged 1 commit intomainfrom
quintin/perf_n_ctx_512

Commits

Commits on Oct 31, 2024