Skip to content

Commit 9ce44c8

Browse files
[PIPELINE] AssignLatencies for mmav5 (#6077)
Implement and test assigning latencies for mmav5 ops (regular and scaled). This PR also features slight refactor of AssignLatencies code with putting load-related and mma-related code in separate classes for maintainability. Latency assignment for mma ops is enabled only for tests now, disabled in normal flow, as scheduling and lowering is not implemented yet. --------- Co-authored-by: Thomas Raoux <[email protected]>
1 parent cb78503 commit 9ce44c8

File tree

4 files changed

+569
-172
lines changed

4 files changed

+569
-172
lines changed

include/triton/Dialect/TritonGPU/Transforms/Schedule.h

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@ namespace gpu {
1515

1616
/// Discover operations that should become async and assign latencies to them
1717
/// based on the numStages value provided by the user.
18-
void assignLatencies(ModuleOp moduleOp, int numStages);
18+
void assignLatencies(ModuleOp moduleOp, int numStages, bool assignMMA = false);
1919

2020
/// Schedule the loops based on the latencies assigned to the operations.
2121
void scheduleLoops(ModuleOp moduleOp);

0 commit comments

Comments
 (0)