Skip to content

Commit 0879a0e

Browse files
[RemoveLayoutConversion] Increase convert layout cost
Signed-off-by: Whitney Tsang <[email protected]>
1 parent a9cd5c7 commit 0879a0e

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

third_party/intel/lib/TritonIntelGPUTransforms/RemoveLayoutConversions.cpp

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1560,7 +1560,9 @@ void LayoutRematerialization::backwardRematerialization(
15601560
// We measure costs in standardised milli-SM-cycles. The smem load
15611561
// and store each cost 8 * convertLayoutBytes, and then we double
15621562
// it to account for extra cost due to synchronisation.
1563-
int64_t convertLayoutCost = 32 * convertLayoutBytes;
1563+
// FIXME: measure cost of smem load/store and synchronisation on Intel GPUs,
1564+
// and refine this model further. (#5476)
1565+
int64_t convertLayoutCost = 32 * convertLayoutBytes * 2;
15641566
int64_t rematerialisationCost = 0;
15651567

15661568
// Evaluate single-use status for every operation in slice

0 commit comments

Comments
 (0)