Commit 6e1dafa
[release/3.4] "[BACKEND] Workaround for ptxas bug in matrix descriptor arithmetic (triton-lang#7197)" (triton-lang#7389)
The previous code sequence was hitting a bug in ptxas that caused
Emiting this new code sequence should be cheaper and saves us from
hitting the ptxas bug.
Co-authored-by: Thomas Raoux <[email protected]>1 parent ae84826 commit 6e1dafa
File tree
1 file changed
+4
-8
lines changed- third_party/nvidia/lib/TritonNVIDIAGPUToLLVM/DotOpToLLVM
1 file changed
+4
-8
lines changedLines changed: 4 additions & 8 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
151 | 151 | | |
152 | 152 | | |
153 | 153 | | |
154 | | - | |
155 | | - | |
156 | | - | |
157 | | - | |
158 | | - | |
159 | | - | |
160 | | - | |
161 | | - | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
162 | 158 | | |
163 | 159 | | |
164 | 160 | | |
| |||
0 commit comments