Commit d4d2123
committed
Remove rotation semantic in gpu.shufflw up/down
There is no such semantic in SPIRV OpGroupNonUniformShuffleUp and
OpGroupNonUniformShuffleDown. In addition, there is no such semantic in
NVVM shfl intrinsics.
Refer to NVVM IR spec
https://docs.nvidia.com/cuda/archive/12.2.1/nvvm-ir-spec/index.html#data-movement
"If the computed source lane index j is in range, the returned i32 value
will be the value of %a from lane j; otherwise, it will be the the value
of %a from the current thread. If the thread corresponding to lane j is
inactive, then the returned i32 value is undefined."1 parent cfe6c2d commit d4d2123
File tree
3 files changed
+38
-21
lines changed- mlir
- include/mlir/Dialect/GPU/IR
- lib/Conversion/GPUToSPIRV
- test/Conversion/GPUToSPIRV
3 files changed
+38
-21
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1332 | 1332 | | |
1333 | 1333 | | |
1334 | 1334 | | |
1335 | | - | |
| 1335 | + | |
| 1336 | + | |
1336 | 1337 | | |
1337 | 1338 | | |
1338 | 1339 | | |
| |||
1341 | 1342 | | |
1342 | 1343 | | |
1343 | 1344 | | |
1344 | | - | |
| 1345 | + | |
| 1346 | + | |
1345 | 1347 | | |
1346 | 1348 | | |
1347 | 1349 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
430 | 430 | | |
431 | 431 | | |
432 | 432 | | |
433 | | - | |
434 | | - | |
435 | 433 | | |
436 | | - | |
| 434 | + | |
437 | 435 | | |
438 | | - | |
| 436 | + | |
439 | 437 | | |
440 | 438 | | |
441 | 439 | | |
| |||
453 | 451 | | |
454 | 452 | | |
455 | 453 | | |
456 | | - | |
457 | | - | |
458 | | - | |
| 454 | + | |
| 455 | + | |
459 | 456 | | |
460 | | - | |
461 | | - | |
462 | | - | |
463 | | - | |
464 | | - | |
465 | | - | |
| 457 | + | |
| 458 | + | |
| 459 | + | |
466 | 460 | | |
467 | 461 | | |
468 | | - | |
469 | 462 | | |
470 | 463 | | |
471 | 464 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
26 | 26 | | |
27 | 27 | | |
28 | 28 | | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
29 | 52 | | |
30 | 53 | | |
31 | 54 | | |
| |||
54 | 77 | | |
55 | 78 | | |
56 | 79 | | |
57 | | - | |
| 80 | + | |
58 | 81 | | |
59 | 82 | | |
60 | 83 | | |
| |||
70 | 93 | | |
71 | 94 | | |
72 | 95 | | |
73 | | - | |
| 96 | + | |
74 | 97 | | |
75 | 98 | | |
76 | 99 | | |
| |||
82 | 105 | | |
83 | 106 | | |
84 | 107 | | |
85 | | - | |
| 108 | + | |
86 | 109 | | |
87 | 110 | | |
88 | 111 | | |
| |||
98 | 121 | | |
99 | 122 | | |
100 | 123 | | |
101 | | - | |
102 | | - | |
| 124 | + | |
103 | 125 | | |
104 | 126 | | |
105 | 127 | | |
| |||
0 commit comments