Commit da23473
[6/N][TLX-2cta] Codegen for remote barrier arrive (#647)
Summary:
When the barrier object is in cluster SMEM space, we should lower to arrive instruction with `shared::cluster` state space.
The input bar typically is a result of a mapa op which was landed earlier in the stack.
```
% make test-lit
ninja -C /data/users/pchen7e4/triton/build/cmake.linux-x86_64-cpython-3.11 check-triton-lit-tests
ninja: Entering directory `/data/users/pchen7e4/triton/build/cmake.linux-x86_64-cpython-3.11'
[0/1] Running the triton regression tests
Testing Time: 9.11s
Total Discovered Tests: 208
Passed : 207 (99.52%)
Expectedly Failed: 1 (0.48%)
% third_party/tlx/run_all.sh
Hello! (Facebook-only)
Need to build triton in this script? {y|n}n
Run all LITs? {y|n}n
Run core Triton python unit tests? {y|n}n
Run all TLX unit tests? {y|n}y
Running TLX Unit Tests
=========================================================================================== test session starts ============================================================================================
platform linux -- Python 3.11.13, pytest-8.3.4, pluggy-1.5.0
rootdir: /data/users/pchen7e4/triton
configfile: pyproject.toml
plugins: xdist-3.7.0, forked-1.6.0, typeguard-4.3.0
collected 109 items
python/test/unit/language/test_tlx.py ...sssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssss.............s................. [100%]
===================================================================================== 33 passed, 76 skipped in 50.92s ======================================================================================
Run TLX tutorial kernels (correctness|performance|no)? {c|p|n}
c
Verifying correctness of TLX tutorial kernels
=========================================================================================== test session starts ============================================================================================
platform linux -- Python 3.11.13, pytest-8.3.4, pluggy-1.5.0
rootdir: /data/users/pchen7e4/triton
configfile: pyproject.toml
plugins: xdist-3.7.0, forked-1.6.0, typeguard-4.3.0
collected 17 items
third_party/tlx/tutorials/amd-gemm-pipelined.py s [ 5%]
third_party/tlx/tutorials/blackwell-fa-ws-persistent_test.py . [ 11%]
third_party/tlx/tutorials/blackwell-fa-ws-pipelined-persistent_test.py . [ 17%]
third_party/tlx/tutorials/blackwell-fa-ws-pipelined_test.py . [ 23%]
third_party/tlx/tutorials/blackwell-fa-ws_test.py . [ 29%]
third_party/tlx/tutorials/blackwell-gemm-clc.py . [ 35%]
third_party/tlx/tutorials/blackwell-gemm-pipelined.py . [ 41%]
third_party/tlx/tutorials/blackwell-gemm-ws.py . [ 47%]
third_party/tlx/tutorials/blackwell-grouped-gemm.py . [ 52%]
third_party/tlx/tutorials/hopper-fa-ws-pipelined-pingpong_test.py s [ 58%]
third_party/tlx/tutorials/hopper-fa-ws-pipelined_test.py s [ 64%]
third_party/tlx/tutorials/hopper-fa-ws_test.py s [ 70%]
third_party/tlx/tutorials/hopper-gemm-pipelined_test.py s [ 76%]
third_party/tlx/tutorials/hopper-gemm-ws_test.py s [ 82%]
third_party/tlx/tutorials/hopper-persistent-gemm-ws-cooperative.py s [ 88%]
third_party/tlx/tutorials/hopper-persistent-gemm-ws-pingpong.py s [ 94%]
third_party/tlx/tutorials/vector-add2.py . [100%]
...
=========================================================================== 9 passed, 8 skipped, 4 warnings in 126.92s (0:02:06) ===============
```
Pull Request resolved: #647
Reviewed By: htyu
Differential Revision: D86467280
Pulled By: pchen7e2
fbshipit-source-id: 8795aeb097d8fc284bd8fba84de994d88167adf71 parent 6f77f69 commit da23473
File tree
2 files changed
+21
-1
lines changed- test/Conversion
- third_party/nvidia/lib/TritonNVIDIAGPUToLLVM
2 files changed
+21
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
79 | 79 | | |
80 | 80 | | |
81 | 81 | | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
82 | 89 | | |
83 | 90 | | |
84 | 91 | | |
| |||
Lines changed: 14 additions & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
33 | 33 | | |
34 | 34 | | |
35 | 35 | | |
| 36 | + | |
| 37 | + | |
36 | 38 | | |
37 | 39 | | |
38 | 40 | | |
| |||
234 | 236 | | |
235 | 237 | | |
236 | 238 | | |
| 239 | + | |
| 240 | + | |
| 241 | + | |
| 242 | + | |
| 243 | + | |
| 244 | + | |
237 | 245 | | |
238 | 246 | | |
239 | | - | |
| 247 | + | |
| 248 | + | |
| 249 | + | |
| 250 | + | |
| 251 | + | |
| 252 | + | |
240 | 253 | | |
241 | 254 | | |
242 | 255 | | |
| |||
0 commit comments