Commit 9799c3f
authored
[webgpu] Enable FlashAttention for GQA (microsoft#23761)
### Description
<!-- Describe your changes. -->
### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->1 parent 50c835d commit 9799c3f
File tree
3 files changed
+116
-84
lines changed- onnxruntime/contrib_ops/webgpu/bert
3 files changed
+116
-84
lines changed
0 commit comments