Skip to content

[AMD][FpSan] Fix dot emulation for MFMA/WMMA encodings#9655

Open
kelesvol wants to merge 1 commit intotriton-lang:mainfrom
kelesvol:amd-fpsan
Open

[AMD][FpSan] Fix dot emulation for MFMA/WMMA encodings#9655
kelesvol wants to merge 1 commit intotriton-lang:mainfrom
kelesvol:amd-fpsan

Conversation

@kelesvol
Copy link
Contributor

@kelesvol kelesvol commented Mar 5, 2026

Use optimized blocked layouts for emulation tiles instead of original dot encodings as they may impose minimum shape requirements that the small tiles cannot satisfy. Also, add barriers around the emulation loops for cross-warp scratch visibility.

Use optimized blocked layouts for emulation tiles instead of original dot
encodings as they may impose minimum shape requirements that the small tiles
cannot satisfy. Also, add barriers around the emulation loops for cross-warp
scratch visibility.
@kelesvol
Copy link
Contributor Author

kelesvol commented Mar 5, 2026

cc @antiagainst @FrederickVu

@antiagainst
Copy link
Member

@pawelszczerbuk can you take a look?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants