Commit 81a04ca
authored
[webgpu] Fix the wrong fallback in Attention (#26608)
Attention input handling updates:
* Corrected the input indices for `past` from `input[5]` to `input[4]`
in the fallback logic, ensuring the code reflects the actual input
order.
With this change, the Attention ops in phi-4-mm-vision.onnx can go to
the gpu instead of cpu.1 parent e6023b0 commit 81a04ca
File tree
1 file changed
+2
-2
lines changed- onnxruntime/core/providers/webgpu
1 file changed
+2
-2
lines changedLines changed: 2 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
878 | 878 | | |
879 | 879 | | |
880 | 880 | | |
881 | | - | |
| 881 | + | |
882 | 882 | | |
883 | | - | |
| 883 | + | |
884 | 884 | | |
885 | 885 | | |
886 | 886 | | |
| |||
0 commit comments