Commit ab34c3a
authored
[AMD] Support multi-cta and multicast for TDM operations (#8790)
Adds support for multi-cta TDM load and stores and sets the multicast
mask based on the `CGALayout`. Similar to `tt.load` and
`ttg.async_copy_global_to_local`, multicast is automatically enabled if
the `CGALayout` contains broadcasting bases.1 parent 9c2cefd commit ab34c3a
File tree
7 files changed
+254
-33
lines changed- lib/Conversion/TritonGPUToLLVM
- test/Conversion/amd
- third_party/amd
- lib/TritonAMDGPUToLLVM
- python/test
7 files changed
+254
-33
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
25 | 25 | | |
26 | 26 | | |
27 | 27 | | |
28 | | - | |
| 28 | + | |
| 29 | + | |
29 | 30 | | |
30 | 31 | | |
31 | 32 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
49 | 49 | | |
50 | 50 | | |
51 | 51 | | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
Lines changed: 43 additions & 16 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
482 | 482 | | |
483 | 483 | | |
484 | 484 | | |
485 | | - | |
| 485 | + | |
| 486 | + | |
486 | 487 | | |
487 | 488 | | |
488 | 489 | | |
| |||
514 | 515 | | |
515 | 516 | | |
516 | 517 | | |
517 | | - | |
| 518 | + | |
518 | 519 | | |
519 | 520 | | |
520 | 521 | | |
| |||
909 | 910 | | |
910 | 911 | | |
911 | 912 | | |
| 913 | + | |
912 | 914 | | |
913 | 915 | | |
914 | | - | |
| 916 | + | |
915 | 917 | | |
916 | 918 | | |
917 | 919 | | |
| |||
1047 | 1049 | | |
1048 | 1050 | | |
1049 | 1051 | | |
| 1052 | + | |
1050 | 1053 | | |
1051 | 1054 | | |
1052 | | - | |
| 1055 | + | |
1053 | 1056 | | |
1054 | 1057 | | |
1055 | 1058 | | |
| |||
1123 | 1126 | | |
1124 | 1127 | | |
1125 | 1128 | | |
| 1129 | + | |
1126 | 1130 | | |
| 1131 | + | |
1127 | 1132 | | |
1128 | 1133 | | |
1129 | 1134 | | |
1130 | 1135 | | |
1131 | 1136 | | |
| 1137 | + | |
1132 | 1138 | | |
1133 | 1139 | | |
| 1140 | + | |
| 1141 | + | |
| 1142 | + | |
| 1143 | + | |
| 1144 | + | |
| 1145 | + | |
| 1146 | + | |
| 1147 | + | |
1134 | 1148 | | |
1135 | | - | |
1136 | | - | |
1137 | | - | |
1138 | | - | |
1139 | | - | |
1140 | 1149 | | |
1141 | 1150 | | |
1142 | 1151 | | |
| |||
1165 | 1174 | | |
1166 | 1175 | | |
1167 | 1176 | | |
1168 | | - | |
1169 | | - | |
1170 | | - | |
1171 | | - | |
| 1177 | + | |
| 1178 | + | |
| 1179 | + | |
| 1180 | + | |
| 1181 | + | |
| 1182 | + | |
| 1183 | + | |
| 1184 | + | |
| 1185 | + | |
| 1186 | + | |
| 1187 | + | |
1172 | 1188 | | |
1173 | 1189 | | |
1174 | 1190 | | |
| |||
1196 | 1212 | | |
1197 | 1213 | | |
1198 | 1214 | | |
| 1215 | + | |
1199 | 1216 | | |
1200 | 1217 | | |
1201 | 1218 | | |
| |||
1214 | 1231 | | |
1215 | 1232 | | |
1216 | 1233 | | |
| 1234 | + | |
| 1235 | + | |
| 1236 | + | |
| 1237 | + | |
| 1238 | + | |
| 1239 | + | |
| 1240 | + | |
| 1241 | + | |
| 1242 | + | |
1217 | 1243 | | |
1218 | | - | |
| 1244 | + | |
1219 | 1245 | | |
1220 | | - | |
1221 | | - | |
| 1246 | + | |
| 1247 | + | |
| 1248 | + | |
1222 | 1249 | | |
1223 | 1250 | | |
1224 | 1251 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
| 3 | + | |
3 | 4 | | |
4 | 5 | | |
5 | 6 | | |
| |||
365 | 366 | | |
366 | 367 | | |
367 | 368 | | |
368 | | - | |
| 369 | + | |
| 370 | + | |
369 | 371 | | |
370 | 372 | | |
371 | 373 | | |
| |||
408 | 410 | | |
409 | 411 | | |
410 | 412 | | |
| 413 | + | |
| 414 | + | |
| 415 | + | |
| 416 | + | |
| 417 | + | |
| 418 | + | |
| 419 | + | |
| 420 | + | |
| 421 | + | |
| 422 | + | |
| 423 | + | |
| 424 | + | |
| 425 | + | |
411 | 426 | | |
412 | 427 | | |
413 | 428 | | |
| |||
453 | 468 | | |
454 | 469 | | |
455 | 470 | | |
| 471 | + | |
| 472 | + | |
456 | 473 | | |
457 | 474 | | |
458 | 475 | | |
| |||
501 | 518 | | |
502 | 519 | | |
503 | 520 | | |
504 | | - | |
| 521 | + | |
| 522 | + | |
| 523 | + | |
505 | 524 | | |
506 | 525 | | |
507 | 526 | | |
| |||
514 | 533 | | |
515 | 534 | | |
516 | 535 | | |
517 | | - | |
518 | | - | |
519 | | - | |
520 | | - | |
| 536 | + | |
| 537 | + | |
| 538 | + | |
| 539 | + | |
521 | 540 | | |
522 | 541 | | |
523 | 542 | | |
| |||
535 | 554 | | |
536 | 555 | | |
537 | 556 | | |
538 | | - | |
539 | | - | |
540 | | - | |
541 | | - | |
| 557 | + | |
| 558 | + | |
| 559 | + | |
| 560 | + | |
542 | 561 | | |
543 | 562 | | |
544 | 563 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
51 | 51 | | |
52 | 52 | | |
53 | 53 | | |
54 | | - | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
55 | 57 | | |
56 | 58 | | |
57 | 59 | | |
| |||
Lines changed: 6 additions & 5 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
27 | 27 | | |
28 | 28 | | |
29 | 29 | | |
30 | | - | |
31 | | - | |
| 30 | + | |
| 31 | + | |
32 | 32 | | |
33 | 33 | | |
34 | | - | |
| 34 | + | |
35 | 35 | | |
36 | 36 | | |
37 | 37 | | |
| |||
46 | 46 | | |
47 | 47 | | |
48 | 48 | | |
49 | | - | |
| 49 | + | |
50 | 50 | | |
| 51 | + | |
51 | 52 | | |
52 | 53 | | |
53 | 54 | | |
54 | | - | |
| 55 | + | |
55 | 56 | | |
56 | 57 | | |
57 | 58 | | |
| |||
0 commit comments