[DO NOT REVIEW] Generalize mlir ukernel with python preprocessing #23275

Yu-Zhewen · 2026-01-24T18:39:06Z

Summary

This PR generalizes MLIR ukernel generation by introducing a Python preprocessing approach to reduce code duplication and improve maintainability.

Motivation

Before: We currently have 7 hand-written MLIR ukernels, totaling ~3k lines of repetitive code with significant boilerplate:

iree_uk_amdgpu_dt_matmul_f16.mlir
iree_uk_amdgpu_dt_matmul_f8E4M3FNUZ.mlir
iree_uk_amdgpu_dt_scaled_matmul_f4E2M1FN.mlir
iree_uk_amdgpu_matmul_bf16.mlir
iree_uk_amdgpu_matmul_f16.mlir
iree_uk_amdgpu_matmul_f8E4M3FN.mlir
iree_uk_amdgpu_matmul_f8E4M3FNUZ.mlir

Each ukernel variant required manual duplication with minor parameter changes (element types, intrinsics, unroll configurations, etc.).

After: This PR introduces a template-based generation system where we only need to maintain some template files (.mlir.in, around 1k lines at the moment).

See TEMPLATE_GUIDE.md for all generation commands.

Plan

Using the infrastructure to develop non-data-tiled MXFP4 ukernel
Tune tile sizes for gfx950 ukernels [ROCM][DT][Ukernel] Port gfx942 ukernel to gfx950 for data tiling #22825
Discuss checking in these templates
Integrate generation into build system (CMake/Bazel)

Assisted-by: Cursor AI

-- This commit ports gfx942 ukernel to gfx950 for data tiling. Signed-off-by: Abhishek Varma <[email protected]>

Signed-off-by: Yu-Zhewen <[email protected]>

Abhishek-Varma and others added 2 commits January 24, 2026 13:03

[ROCM][DT][Ukernel] Port gfx942 ukernel to gfx950 for data tiling

4ea1ff2

-- This commit ports gfx942 ukernel to gfx950 for data tiling. Signed-off-by: Abhishek Varma <[email protected]>

WIP

a5f67ce

Signed-off-by: Yu-Zhewen <[email protected]>

Yu-Zhewen force-pushed the mlir_ukernel_gen branch from d02f862 to 79398c7 Compare January 26, 2026 22:06

WIP2

79398c7

Signed-off-by: Yu-Zhewen <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DO NOT REVIEW] Generalize mlir ukernel with python preprocessing #23275

[DO NOT REVIEW] Generalize mlir ukernel with python preprocessing #23275

Uh oh!

Yu-Zhewen commented Jan 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[DO NOT REVIEW] Generalize mlir ukernel with python preprocessing #23275

Are you sure you want to change the base?

[DO NOT REVIEW] Generalize mlir ukernel with python preprocessing #23275

Uh oh!

Conversation

Yu-Zhewen commented Jan 24, 2026

Summary

Motivation

Plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants