Commit b285609
[AMD] Always pipeline small loads on RDNA (triton-lang#8063)
On RDNA, we always pipeline through registers and can only check
completion of loads in the order they were dispatched through
s_wait_loadcnt. If we have small loads that are not pipelined, this can
force a wait on pipelined loads as well, negating the benefits of
pipelining.
Co-authored-by: Paul Trojahn <[email protected]>1 parent 5c5ab9f commit b285609
File tree
1 file changed
+2
-1
lines changed- third_party/amd/lib/TritonAMDGPUTransforms
1 file changed
+2
-1
lines changedLines changed: 2 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
555 | 555 | | |
556 | 556 | | |
557 | 557 | | |
558 | | - | |
| 558 | + | |
| 559 | + | |
559 | 560 | | |
560 | 561 | | |
561 | 562 | | |
| |||
0 commit comments