[ET-VK] Move rotary embedding custom op to be handled via graph pass instead of source transform #13450

SS-JIA · 2025-08-15T05:28:26Z

Stack from ghstack (oldest at bottom):

(to be filled)

Motivation

Be able to test Vulkan lowering via optimum-executorch.

Context

Currently, ET-VK implements rotary embeddings via a custom op. This op is currently inserted into Transformer models by replacing Rotary Embedding modules with a custom module that executes the custom op via a source transform.

The source transform approach makes it cumbersome to lower LLMs to Vulkan, since it requires the export logic to apply the source transform before calling torch.export(). This in turn makes it difficult to integrate Vulkan lowering into optimum-executorch, which tries to use a common export + lowering logic for all lowering paths.

As an alternative, leverage SubgraphMatcher to detect fusable patterns and fuse the rotary embedding graph pattern into the custom op as part of the Vulkan delegate's graph passes. This removes the requirement to apply a custom source transform just for Vulkan.

Changes

Introduce the backends/vulkan/patterns folder to store fusable graph patterns
Introduce a fusable graph pattern for rotary positional embeddings
Update partitioner logic to automatically include nodes that are part of a fusable graph pattern
Introduce a pass to fuse known patterns into custom ops / custom op sequence

Differential Revision: D80293301

…instead of source transform ## Motivation Be able to test Vulkan lowering via optimum-executorch. ## Context Currently, ET-VK implements rotary embeddings via a custom op. This op is currently inserted into Transformer models by replacing Rotary Embedding modules with a custom module that executes the custom op via a source transform. The source transform approach makes it cumbersome to lower LLMs to Vulkan, since it requires the export logic to apply the source transform before calling `torch.export()`. This in turn makes it difficult to integrate Vulkan lowering into optimum-executorch, which tries to use a common export + lowering logic for all lowering paths. As an alternative, leverage `SubgraphMatcher` to detect fusable patterns and fuse the rotary embedding graph pattern into the custom op as part of the Vulkan delegate's graph passes. This removes the requirement to apply a custom source transform just for Vulkan. ## Changes * Introduce the `backends/vulkan/patterns` folder to store fusable graph patterns * Introduce a fusable graph pattern for rotary positional embeddings * Update partitioner logic to automatically include nodes that are part of a fusable graph pattern * Introduce a pass to fuse known patterns into custom ops / custom op sequence Differential Revision: [D80293301](https://our.internmc.facebook.com/intern/diff/D80293301/) [ghstack-poisoned]

pytorch-bot · 2025-08-15T05:28:29Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13450

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

ROCm CI/CD workflows failing due to : download from https://api.github.com/repos/pytorch/pytorch timed out.

❌ 2 New Failures

As of commit 83b317e with merge base ecb639a ():

NEW FAILURES - The following jobs have failed:

Propose to merge ghstack orig PRs to main / Try to create a PR with ghstack /orig branch (gh)
Process completed with exit code 1.
pull / test-llava-runner-linux / linux-job (gh)
RuntimeError: Command docker exec -t 87e5fe65cba8cf305930d25f45d19ac3e156dfd1cceff61c9c7866292570d8d3 /exec failed with exit code 139

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2025-08-15T05:29:02Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

SS-JIA requested review from jackzhxng and lucylq as code owners August 15, 2025 05:28

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 15, 2025

SS-JIA closed this Aug 29, 2025

SS-JIA had a problem deploying to cherry-pick-bot August 29, 2025 16:51 — with GitHub Actions Failure

SS-JIA deleted the gh/SS-JIA/286/head branch October 15, 2025 18:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ET-VK] Move rotary embedding custom op to be handled via graph pass instead of source transform #13450

[ET-VK] Move rotary embedding custom op to be handled via graph pass instead of source transform #13450

Uh oh!

SS-JIA commented Aug 15, 2025

Uh oh!

pytorch-bot bot commented Aug 15, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Aug 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[ET-VK] Move rotary embedding custom op to be handled via graph pass instead of source transform #13450

[ET-VK] Move rotary embedding custom op to be handled via graph pass instead of source transform #13450

Uh oh!

Conversation

SS-JIA commented Aug 15, 2025

Motivation

Context

Changes

Uh oh!

pytorch-bot bot commented Aug 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13450

❗ 1 Active SEVs

❌ 2 New Failures

Uh oh!

github-actions bot commented Aug 15, 2025

This PR needs a release notes: label

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

pytorch-bot bot commented Aug 15, 2025 •

edited

Loading

This PR needs a `release notes:` label