Skip to content

Commit 2b1a8e7

Browse files
authored
[LLVMGPU] Make FP8 VMFMA intrinsic discoverable by KernelConfig (#19022)
Making VMFMA_F32_16x16x32_F8E4M3FNUZ visible/accessible from `getVirtualIntrinsics` S.T we can eventually pick this intrinsic from KernelConfig. Right now since by default since we always choose the "first" found intrinsic, it is practically guaranteed to never pick the virtual intrinsics by heuristics. Hence we are only specifying the intrinsics through tuning/spec script. However since we are planning to add some heuristics in the future on KernelConfig to be able to select this virtual intrinsic when necessary, we are making it visible now. Signed-off-by: Stanley Winata <[email protected]>
1 parent 1444755 commit 2b1a8e7

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

compiler/src/iree/compiler/Codegen/Dialect/GPU/IR/IREEGPUAttrs.cpp

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -655,6 +655,8 @@ SmallVector<MMAIntrinsic> MMAAttr::getVirtualIntrinsics() const {
655655
return {MMAIntrinsic::VMFMA_F32_16x16x32_F16};
656656
case MMAIntrinsic::MFMA_F32_32x32x8_F16:
657657
return {MMAIntrinsic::VMFMA_F32_32x32x16_F16};
658+
case MMAIntrinsic::MFMA_F32_16x16x32_F8E4M3FNUZ:
659+
return {MMAIntrinsic::VMFMA_F32_16x16x32_F8E4M3FNUZ};
658660
default:
659661
return {};
660662
}

0 commit comments

Comments
 (0)