Skip to content

Commit 2626f2f

Browse files
[PIPELINING] Fix stage for the local_load in the TMA pipelining (#5365)
`local_load` should be in the same stage that the `subview` that it is using.
1 parent 07e1cc6 commit 2626f2f

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

lib/Dialect/TritonGPU/Transforms/Pipeliner/MatmulLoopPipeline.cpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -272,7 +272,7 @@ createTMAAsyncCopy(scf::ForOp &forOp, tt::ExperimentalDescriptorLoadOp loadOp,
272272

273273
builder.setInsertionPointAfter(viewLoad);
274274
auto sharedLoad = builder.createWithStage<ttg::LocalLoadOp>(
275-
loc, stage, clusterId, loadOp.getType(),
275+
loc, stageForFirstUse, clusterForFirstUse, loadOp.getType(),
276276
viewLoad /*,wait->getResult(0)*/);
277277
auto result = sharedLoad->getResults();
278278
loadOp->replaceAllUsesWith(result);

0 commit comments

Comments
 (0)