Skip to content

Conversation

bjacob
Copy link
Contributor

@bjacob bjacob commented Jun 18, 2025

This implements the fix suggested by @heshuju in #4108.

It fixes an issue that was blocking the LLVM integrate in IREE. https://github.com/iree-org/iree/actions/runs/15626565095/job/44156258415?pr=21092

Signed-off-by: Benoit Jacob <[email protected]>
@rsuderman
Copy link
Contributor

We should change this at the reduce op variants level instead.

"torch.aten._scaled_dot_product_flash_attention_for_cpu",

Instead of matching to torch.aten._scaled_dot_product_flash_attention_for_cpu to aten._scaled_dot_product_flash_attention_for_cpu. Or preferably match with both. I am guessing that the ingestion can generate for both.

The above test fails because it assumes a torch prefix to the operator.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants