Skip to content

Commit 9799c3f

Browse files
authored
[webgpu] Enable FlashAttention for GQA (microsoft#23761)
### Description <!-- Describe your changes. --> ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->
1 parent 50c835d commit 9799c3f

File tree

3 files changed

+116
-84
lines changed

3 files changed

+116
-84
lines changed

0 commit comments

Comments
 (0)