Commit 3d0b78e
committed
[Tests-only][NVGPU] Fix nvgpu_arrive syntax in matmulBuilder.py
This patch updates the syntax for nvgpu_arrive Op
in matmulBuilder.py. This fixes the compilation
error for this test.
For the warp-specialized matmul_kernel implementation,
removing the WaitGroupSyncOp (after the mma-main-loop)
fixes the hang observed.
With these two fixes, the test compiles and
executes successfully on an sm90a machine.
Signed-off-by: Durgadoss R <[email protected]>1 parent 9d88543 commit 3d0b78e
File tree
1 file changed
+2
-9
lines changed- mlir/test/Integration/GPU/CUDA/sm90/python/tools
1 file changed
+2
-9
lines changedLines changed: 2 additions & 9 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
568 | 568 | | |
569 | 569 | | |
570 | 570 | | |
571 | | - | |
572 | | - | |
573 | | - | |
| 571 | + | |
574 | 572 | | |
575 | 573 | | |
576 | 574 | | |
| |||
589 | 587 | | |
590 | 588 | | |
591 | 589 | | |
592 | | - | |
593 | | - | |
594 | | - | |
595 | 590 | | |
596 | 591 | | |
597 | | - | |
598 | | - | |
599 | | - | |
| 592 | + | |
600 | 593 | | |
601 | 594 | | |
602 | 595 | | |
| |||
0 commit comments