Commit 187ea27
authored
[AMD]Enable Transposed Mfma Layout For Fp8 (#7301)
The transposed mfma layout for fp8/bf8 was disabled due to a
regression. But current report shows that non-transposed mfma
layout generates extra `convert_layout` ops before `split`, which
consumes extra LDS.
Experiment shows the regression has gone. So it's fine to enable it
back.1 parent e71689d commit 187ea27
File tree
1 file changed
+2
-7
lines changed- third_party/amd/lib/TritonAMDGPUTransforms
1 file changed
+2
-7
lines changedLines changed: 2 additions & 7 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
451 | 451 | | |
452 | 452 | | |
453 | 453 | | |
454 | | - | |
455 | | - | |
| 454 | + | |
456 | 455 | | |
457 | | - | |
458 | | - | |
459 | | - | |
460 | | - | |
461 | 456 | | |
462 | 457 | | |
463 | 458 | | |
464 | | - | |
| 459 | + | |
465 | 460 | | |
466 | 461 | | |
467 | 462 | | |
| |||
0 commit comments