Skip to content

Commit 5f5feb0

Browse files
[AccelerateMatmul] Sync from upstream
Signed-off-by: Whitney Tsang <[email protected]>
1 parent 5bab216 commit 5f5feb0

File tree

1 file changed

+4
-1
lines changed

1 file changed

+4
-1
lines changed

third_party/intel/lib/TritonIntelGPUTransforms/AccelerateMatmul.cpp

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -421,7 +421,10 @@ class DecomposeScaledBlocked : public OpRewritePattern<tt::DotScaledOp> {
421421
if (!scale)
422422
return v;
423423

424-
return rewriter.create<ttg::UpcastMXFPOp>(v.getLoc(), v, scale, elemType);
424+
auto retTy = triton::gpu::UpcastMXFPOp::deduceOutputType(
425+
v, elemType, Builder(v.getContext()).getBF16Type());
426+
return rewriter.create<ttg::UpcastMXFPOp>(v.getLoc(), retTy, v, scale,
427+
elemType);
425428
}
426429
};
427430

0 commit comments

Comments
 (0)