Commit 066f34b
ssjia
Update base for Update on "[ET-VK][AOT] Enable exporting Q8 Quantized Linear + Convolution"
As title. Introduce fusion patterns to enable fusing quantized convolution and linear graph patterns into a custom op.
## Changes
Introduce the concept of using custom pattern detection functions to detect graph patterns rather than solely relying on SubgraphMatcher. The issue with SubgraphMatcher is that a large number of graph patterns may need to be exported to obtain variants for different combinations of decompositions/quantization workflows. Having a custom detection function improves maintainability.
Implement detection + replacement functions for quantized linear and quantized conv2d.
Differential Revision: [D81323425](https://our.internmc.facebook.com/intern/diff/D81323425/)
[ghstack-poisoned]1 parent c680357 commit 066f34b
File tree
1 file changed
+4
-0
lines changed- backends/vulkan/runtime/graph/ops/impl
1 file changed
+4
-0
lines changedLines changed: 4 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
478 | 478 | | |
479 | 479 | | |
480 | 480 | | |
| 481 | + | |
| 482 | + | |
481 | 483 | | |
482 | 484 | | |
483 | 485 | | |
| |||
552 | 554 | | |
553 | 555 | | |
554 | 556 | | |
| 557 | + | |
| 558 | + | |
555 | 559 | | |
556 | 560 | | |
557 | 561 | | |
| |||
0 commit comments