You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[SWP] attempt to remove a workaround for a triton llvm codegen bug (#4774)
Triton LLVM codegen has a bug where local_loads from #shared to #mma
layout can lead to invalid code if the loaded shape is smaller than the
mma tile. Remove the workaround.
See triton-lang/triton#3561.
Verified that with test case: https://pastebin.com/xxP3cFmy (test.mlir),
running
triton-opt test.mlir -tritongpu-pipeline=num-stages=3
--convert-scf-to-cf --allocate-shared-memory
--convert-triton-gpu-to-llvm
has no issue.
Unit test case added in triton-lang/triton#4798
also shows no issue.
0 commit comments