You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This PR enables local_load (non-transposed) and local_store lowering to
use the common code in MemoryOpToLLVM for AMD backend. Now the only
thing left in AMD's MemoryOpToLLVM are the two transposed version of
local_load.
The following lit tests are fixed for the new path:
- [x] Conversion/amd/async-ops-alias-scopes.mlir. The alias_scope info
is dropped in the new path
- fixed by 1986b4c869bbfbdebb6b7ac71a8a01aaeaaec3e4
- [x] Conversion/amd/ds_transpose.mlir. local_load is lowered to
`predicated_load` without `--convert-builtin-func-to-llvm`
- fixed by 4221b43f58a83ca1b8b50976504a3bc744a764cc
- [x] Conversion/amd/compute-base-ptr.mlir. Note that the legacy AMD
path fails to lower local_load so it is handled by the common path. And
the new path always use i8 to compute addresses.
- fixed by e382c2ff6a420dbe725830e5fc64a99cc815f301
- [x] Conversion/amd/tritongpu_to_llvm.mlir. Failure due to
padded_shared layout
- ~~fixed by 0c263fbbea377b23b8a652319027313757afeae6~~ fixed by
cf09a13c3066f21589a1744d8cdb7ec5c6f463b3. The offset calculation needs
to be in bytes.
- [x] Conversion/amd/tritongpu_wmma_dot_to_llvm.mlir.
- fixed by f9386deb1268676c581e38ba693280f2f
- [x] TritonGPU/amd/amd-instruction-sched.mlir. OpAnnotation is dropped
in the new path
- removed by triton-lang/triton#7395
- [x] Conversion/amd/in_thread_transpose.mlir. `{alignment}` info is
dropped
- fixed by
triton-lang/triton@b98d497
- [x] test-allocation.mlir
- fixed by 6c93fd41e83cacf1e74a9485a55b85c7b0daa0f9
---------
Co-authored-by: Alexander Weinrauch <[email protected]>
Co-authored-by: Nicola Zaghen <[email protected]>
0 commit comments