Commit b2c1bc4
committed
Update on "[ET-VK] Removed shared memory usage and simplied conv2d dw op shader to improve performance."
This diff removes shared memory usage in `conv2d_dw_output_tile.glsl` shader to improve performance.
Makes sum a one dimensional array, and moves bias application before storing texel.
Differential Revision: [D75499165](https://our.internmc.facebook.com/intern/diff/D75499165/)
[ghstack-poisoned]File tree
109 files changed
+3712
-3329
lines changed- .ci/scripts/wheel
- .github/workflows
- backends
- apple/coreml
- partition
- runtime/util
- test
- arm
- quantizer
- test/ops
- cadence/aot
- tests
- qualcomm
- _passes
- builders
- partition
- quantizer
- observers
- runtime
- backends
- tests
- transforms
- vulkan/runtime/graph/ops
- glsl
- impl
- xnnpack
- operators
- partition
- config
- runtime
- serialization
- test/ops
- docs/source
- examples
- apple/coreml/scripts
- demo-apps/apple_ios/LLaMA/docs/delegates
- models
- llama/tests
- qualcomm
- oss_scripts
- llama
- runner
- moshi
- exir/program
- extension
- apple/ExecuTorch
- Exported
- __tests__
- benchmark/apple/Benchmark/Benchmark.xcodeproj
- data_loader
- llm/export
- runtime/core/portable_type/c10/c10
- scripts
- third-party
- tools/cmake/preset
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
109 files changed
+3712
-3329
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
14 | 14 | | |
15 | 15 | | |
16 | 16 | | |
17 | | - | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
18 | 22 | | |
19 | 23 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
17 | 17 | | |
18 | 18 | | |
19 | 19 | | |
20 | | - | |
| 20 | + | |
21 | 21 | | |
22 | 22 | | |
23 | 23 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
386 | 386 | | |
387 | 387 | | |
388 | 388 | | |
389 | | - | |
390 | | - | |
| 389 | + | |
391 | 390 | | |
392 | 391 | | |
393 | 392 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
156 | 156 | | |
157 | 157 | | |
158 | 158 | | |
| 159 | + | |
159 | 160 | | |
160 | 161 | | |
161 | 162 | | |
| |||
173 | 174 | | |
174 | 175 | | |
175 | 176 | | |
176 | | - | |
177 | | - | |
| 177 | + | |
178 | 178 | | |
179 | 179 | | |
180 | 180 | | |
| |||
314 | 314 | | |
315 | 315 | | |
316 | 316 | | |
317 | | - | |
318 | | - | |
| 317 | + | |
319 | 318 | | |
320 | 319 | | |
321 | 320 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
386 | 386 | | |
387 | 387 | | |
388 | 388 | | |
389 | | - | |
390 | | - | |
391 | 389 | | |
392 | 390 | | |
393 | | - | |
394 | | - | |
395 | 391 | | |
396 | 392 | | |
397 | 393 | | |
398 | 394 | | |
399 | 395 | | |
400 | 396 | | |
401 | | - | |
402 | 397 | | |
403 | 398 | | |
404 | 399 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
565 | 565 | | |
566 | 566 | | |
567 | 567 | | |
568 | | - | |
569 | | - | |
570 | 568 | | |
571 | 569 | | |
572 | 570 | | |
| |||
602 | 600 | | |
603 | 601 | | |
604 | 602 | | |
605 | | - | |
| 603 | + | |
606 | 604 | | |
607 | 605 | | |
608 | 606 | | |
| |||
701 | 699 | | |
702 | 700 | | |
703 | 701 | | |
704 | | - | |
| 702 | + | |
705 | 703 | | |
706 | 704 | | |
707 | 705 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
19 | 19 | | |
20 | 20 | | |
21 | 21 | | |
22 | | - | |
23 | | - | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
24 | 26 | | |
25 | 27 | | |
26 | 28 | | |
| |||
37 | 39 | | |
38 | 40 | | |
39 | 41 | | |
40 | | - | |
41 | | - | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
42 | 52 | | |
43 | | - | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
44 | 61 | | |
45 | 62 | | |
46 | 63 | | |
47 | 64 | | |
48 | | - | |
49 | | - | |
50 | | - | |
51 | | - | |
52 | | - | |
53 | | - | |
54 | | - | |
55 | | - | |
56 | | - | |
57 | | - | |
58 | | - | |
| 65 | + | |
59 | 66 | | |
60 | 67 | | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
61 | 72 | | |
62 | 73 | | |
63 | 74 | | |
64 | 75 | | |
65 | 76 | | |
66 | 77 | | |
67 | | - | |
68 | | - | |
| 78 | + | |
| 79 | + | |
69 | 80 | | |
70 | | - | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
71 | 87 | | |
72 | 88 | | |
73 | 89 | | |
74 | 90 | | |
75 | 91 | | |
76 | 92 | | |
77 | | - | |
78 | | - | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
79 | 98 | | |
80 | | - | |
81 | | - | |
82 | | - | |
| 99 | + | |
83 | 100 | | |
84 | 101 | | |
85 | 102 | | |
| |||
88 | 105 | | |
89 | 106 | | |
90 | 107 | | |
91 | | - | |
92 | | - | |
| 108 | + | |
| 109 | + | |
93 | 110 | | |
94 | 111 | | |
95 | 112 | | |
96 | | - | |
| 113 | + | |
97 | 114 | | |
98 | 115 | | |
99 | | - | |
100 | 116 | | |
101 | 117 | | |
102 | | - | |
| 118 | + | |
103 | 119 | | |
104 | 120 | | |
105 | 121 | | |
| |||
0 commit comments