Commit dfcdc27
authored
[Backend] Refactor MMAv5 lowering and put mma_scaled in an if (triton-lang#6478)
This PR refactors the lowering of MMAv5 to share more of the code
between tc_gen5_mma and tc_gen5_mma_scaled, while also applying the same
optimization that tc_gen5_mma has that places the `tcgen05.mma`
instructions in an if block. This was previously checked to improve
performance.1 parent aac457e commit dfcdc27
File tree
4 files changed
+302
-259
lines changed- test/Conversion
- third_party/nvidia/lib/TritonNVIDIAGPUToLLVM/DotOpToLLVM
4 files changed
+302
-259
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
197 | 197 | | |
198 | 198 | | |
199 | 199 | | |
200 | | - | |
201 | | - | |
202 | | - | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
| 204 | + | |
| 205 | + | |
203 | 206 | | |
204 | 207 | | |
205 | 208 | | |
| |||
Lines changed: 5 additions & 5 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
28 | 28 | | |
29 | 29 | | |
30 | 30 | | |
31 | | - | |
| 31 | + | |
32 | 32 | | |
33 | 33 | | |
34 | 34 | | |
| |||
44 | 44 | | |
45 | 45 | | |
46 | 46 | | |
47 | | - | |
| 47 | + | |
48 | 48 | | |
49 | 49 | | |
50 | | - | |
| 50 | + | |
51 | 51 | | |
52 | 52 | | |
53 | 53 | | |
| |||
74 | 74 | | |
75 | 75 | | |
76 | 76 | | |
77 | | - | |
| 77 | + | |
78 | 78 | | |
79 | 79 | | |
80 | | - | |
| 80 | + | |
81 | 81 | | |
82 | 82 | | |
83 | 83 | | |
| |||
0 commit comments