Skip to content

Commit 7e502a6

Browse files
durga4githubgokulbmishra
authored andcommitted
[MLIR][NVVM] Add support for shared::cta destination (llvm#168056)
This patch adds support for shared::cta as destination space in the TMA non-tensor copy Op (from global to shared::cta). * Appropriate verifier checks are added. * Unit tests are added to verify the lowering. The related intrinsic changes were merged through PR llvm#167508. Signed-off-by: Durgadoss R <[email protected]>
1 parent 752e519 commit 7e502a6

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

mlir/lib/Dialect/LLVMIR/IR/NVVMDialect.cpp

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -249,6 +249,7 @@ LogicalResult CpAsyncBulkGlobalToSharedClusterOp::verify() {
249249
return success();
250250
}
251251

252+
<<<<<<< HEAD
252253
<<<<<<< HEAD
253254
<<<<<<< HEAD
254255
static LogicalResult verifyMBarrierArriveLikeOp(Operation *op, Value addr,
@@ -288,6 +289,8 @@ LogicalResult MBarrierCompleteTxOp::verify() {
288289
>>>>>>> 95aa70cf209a ([MLIR][NVVM] Add support for shared::cta destination (#168056))
289290
=======
290291
>>>>>>> 95aa70cf209a ([MLIR][NVVM] Add support for shared::cta destination (#168056))
292+
=======
293+
>>>>>>> 95aa70cf209a ([MLIR][NVVM] Add support for shared::cta destination (#168056))
291294
LogicalResult ConvertFloatToTF32Op::verify() {
292295
using RndMode = NVVM::FPRoundingMode;
293296
switch (getRnd()) {

0 commit comments

Comments
 (0)