You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/ARCHITECTURE.md
+15-15Lines changed: 15 additions & 15 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -314,7 +314,7 @@ Example 1, a row-major coalesced layout may partition a 16x16 tensor over 2 warp
314
314
```
315
315
for
316
316
```
317
-
#triton_gpu.blocked_layout<{
317
+
#ttg.blocked_layout<{
318
318
sizePerThread = {2, 2}
319
319
threadsPerWarp = {8, 4}
320
320
warpsPerCTA = {1, 2}
@@ -341,7 +341,7 @@ Example 2, a row-major coalesced layout may partition a 32x32 tensor over 2 warp
341
341
```
342
342
for
343
343
```
344
-
#triton_gpu.blocked_layout<{
344
+
#ttg.blocked_layout<{
345
345
sizePerThread = {2, 2}
346
346
threadsPerWarp = {8, 4}
347
347
warpsPerCTA = {1, 2}
@@ -373,7 +373,7 @@ CTA [1,0] CTA [1,1]
373
373
```
374
374
for
375
375
```
376
-
#triton_gpu.blocked_layout<{
376
+
#ttg.blocked_layout<{
377
377
sizePerThread = {2, 2}
378
378
threadsPerWarp = {8, 4}
379
379
warpsPerCTA = {1, 2}
@@ -403,25 +403,25 @@ A single dot operator is likely to be mapped to multiple MMA instructions. For N
403
403
### Layout conversion
404
404
To produce the desired memory behavior described in the previous section, triton GPU introduces layouts conversion (by means of ConvertLayoutOp). An input tensor represented in a blocked layout is sliced and inserted into a shared layout, e.g.:
The main loop of the GEMM would then extract a slice (a reimplementation of tensor.extract_slice [25]) from the shared memory, converting arguments to the dot layout and producing mma layout with the dot operator, e.g.:
The main loop of the GEMM would then extract a slice (a reimplementation of tensor.extract_slice [c25]_) from the shared memory, converting arguments to the dot layout and producing mma layout with the dot operator, for example:
0 commit comments