Skip to content

Commit e5d0ec4

Browse files
committed
more fixes && new 'rank' function for 'IntelDPASLayout'
Signed-off-by: Anatoly Myachev <[email protected]>
1 parent 0de3a86 commit e5d0ec4

File tree

2 files changed

+5
-1
lines changed

2 files changed

+5
-1
lines changed

python/test/unit/intel/test_core.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -299,7 +299,7 @@ def test_reduce_layouts(M, N, src_layout, axis, epilogue_kind, dtype_str, add_ov
299299
warps = warps_per_cta(src_layout, [M, N])
300300
num_warps = int(np.prod(warps))
301301
blocked = BlockedLayout([1, 1], [32, THREADS_PER_WARP // 32], [4, num_warps // 4], [0, 1])
302-
one_d_layout = BlockedLayout([1], [THREADS_PER_WARP], [num_warps], [0], [1], [1], [0])
302+
one_d_layout = BlockedLayout([1], [THREADS_PER_WARP], [num_warps], [0])
303303

304304
expanded_shape = f"1x{N}" if axis == 0 else f"{M}x1"
305305
other_axis = 1 - axis

python/triton/experimental/gluon/language/intel/_layouts.py

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -84,3 +84,7 @@ def __hash__(self):
8484
self.threads_per_warp,
8585
tuple(self.cta_order),
8686
))
87+
88+
@property
89+
def rank(self):
90+
return len(self.warps_per_cta)

0 commit comments

Comments
 (0)