Add xnnpack pass to propagate custom meta field to q/dq nodes #14864

lucylq · 2025-10-07T20:34:45Z

Summary

Enable quantization with program-data separation.

To select weights for separation, we tag nodes on the eager model. After quantization, qdq nodes are generated. These do not contain the external tags that their inputs have. This PR propagates the tags to the qdq nodes, so that quant weights are moved to external file and can be shared.

Test plan

python -m unittest executorch.backends.xnnpack.test.passes.test_propagate_custom_meta_pass

pytorch-bot · 2025-10-07T20:34:48Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14864

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[ROCm][CI] Machines under the label linux.rocm.gpu.2 are undergoing maintenance.

❌ 3 New Failures, 3 Unrelated Failures

As of commit db612db with merge base d00279d ():

NEW FAILURES - The following jobs have failed:

pull / test-qnn-wheel-packages-linux (3.10) / linux-job (gh)
RuntimeError: Command docker exec -t 66b379ef6e3224101228d0fcf0c7e7351f5d9831b53c9b3daca1e7867e709fda /exec failed with exit code 1
pull / test-qnn-wheel-packages-linux (3.11) / linux-job (gh)
RuntimeError: Command docker exec -t 3b697e8a2bf329f0ffe5b3d66569af4faf20bcbec1182b29fbf9ede8b03cb141 /exec failed with exit code 1
pull / test-qnn-wheel-packages-linux (3.12) / linux-job (gh)
RuntimeError: Command docker exec -t 21610a1452143c030b67565946b3c243af6b66babe7b0ea8a158ebd08fb3f8a5 /exec failed with exit code 1

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / macos / macos-job (gh) (trunk failure)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1
pull / unittest-buck / macos / macos-job (gh) (trunk failure)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1
pull / unittest-editable / macos / macos-job (gh) (trunk failure)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2025-10-07T20:35:21Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

lucylq · 2025-10-08T21:54:09Z

backends/test/harness/stages/quantize.py

        return self.converted_graph.forward(*inputs)
+
+class Quantize_(Stage):
+    """


@GregoryComer let me know if I should move this into the test class, instead of the test harness, or rename.

FWIW, I like it here.

GregoryComer

Thanks for adding the tester stage! I don't really have a better idea on the name, and it's not part of the public API surface, so I'm good with these changes.

digantdesai · 2025-10-09T12:31:56Z

backends/xnnpack/test/passes/test_propagate_custom_meta_pass.py

+        exec = tester.get_artifact()
+        program_buffer = exec.buffer
+        self.assertEqual(len(exec._tensor_data), 1)
+        data_buffer = bytes(exec._tensor_data.pop("model"))


Do we want to assert the size of it? Just to make sure it is indeed the quantized weight tensor. Also I would like (somehow) to validate that we also didn't put this in the blob, perhaps by asserting that the blob size is < weight_size (if we have large-ish weights).

Thanks - I'll add a check on size.

Re validating that it's not in the blob, we can check that forward fails when we do not pass in the data buffer.

@digantdesai added a check on size, and check on accuracy.

Verified locally that we're missing the weight if we do not pass in the data buffer. The test segfaults after ~4 runs though, maybe something isn't cleaned up properly in pybindings. Left that as a todo for now.

digantdesai

Thanks @lucylq. I hope we have quantized LoRA working OK with XNNPACK now?

lucylq · 2025-10-09T15:38:04Z

@digantdesai Yes! Thanks a lot for the help. There were a few issues with export_llama and emitter, but I was able to generate quantized lora files. They work well on desktop, need to debug load issues on android.

### Summary Enable quantization with program-data separation. To select weights for separation, we tag nodes on the eager model. After quantization, qdq nodes are generated. These do not contain the external tags that their inputs have. This PR propagates the tags to the qdq nodes, so that quant weights are moved to external file and can be shared. ### Test plan ``` python -m unittest executorch.backends.xnnpack.test.passes.test_propagate_custom_meta_pass ```

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 7, 2025

lucylq requested a review from GregoryComer October 7, 2025 20:34

lucylq force-pushed the lfq.xnnpack-qdq-tags branch from fa97c1b to bc8f182 Compare October 7, 2025 20:41

lucylq marked this pull request as ready for review October 7, 2025 20:42

lucylq requested a review from digantdesai as a code owner October 7, 2025 20:42

lucylq force-pushed the lfq.xnnpack-qdq-tags branch from bc8f182 to 9ef58fe Compare October 8, 2025 21:53

lucylq requested a review from cccclai as a code owner October 8, 2025 21:53

lucylq commented Oct 8, 2025

View reviewed changes

lucylq force-pushed the lfq.xnnpack-qdq-tags branch from 9ef58fe to f7515a9 Compare October 8, 2025 22:02

GregoryComer approved these changes Oct 8, 2025

View reviewed changes

digantdesai reviewed Oct 9, 2025

View reviewed changes

digantdesai approved these changes Oct 9, 2025

View reviewed changes

lucylq requested review from JacobSzwejbka, jackzhxng, larryliu0820 and swolchok as code owners October 9, 2025 15:41

lucylq force-pushed the lfq.xnnpack-qdq-tags branch 7 times, most recently from 714f81a to 0c08ac7 Compare October 13, 2025 02:57

Add xnnpack pass to propagate custom meta field to q/dq nodes

db612db

lucylq force-pushed the lfq.xnnpack-qdq-tags branch from 0c08ac7 to db612db Compare October 13, 2025 16:09

lucylq merged commit 19c9ff3 into main Oct 14, 2025
209 of 224 checks passed

lucylq deleted the lfq.xnnpack-qdq-tags branch October 14, 2025 00:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add xnnpack pass to propagate custom meta field to q/dq nodes #14864

Add xnnpack pass to propagate custom meta field to q/dq nodes #14864

Uh oh!

lucylq commented Oct 7, 2025

Uh oh!

pytorch-bot bot commented Oct 7, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 7, 2025

Uh oh!

lucylq Oct 8, 2025

Uh oh!

digantdesai Oct 9, 2025

Uh oh!

GregoryComer left a comment

Uh oh!

digantdesai Oct 9, 2025

Uh oh!

lucylq Oct 9, 2025 •

edited

Loading

Uh oh!

lucylq Oct 9, 2025

Uh oh!

digantdesai left a comment

Uh oh!

lucylq commented Oct 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add xnnpack pass to propagate custom meta field to q/dq nodes #14864

Add xnnpack pass to propagate custom meta field to q/dq nodes #14864

Uh oh!

Conversation

lucylq commented Oct 7, 2025

Summary

Test plan

Uh oh!

pytorch-bot bot commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14864

❗ 1 Active SEVs

❌ 3 New Failures, 3 Unrelated Failures

Uh oh!

github-actions bot commented Oct 7, 2025

This PR needs a release notes: label

Uh oh!

lucylq Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

digantdesai Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

GregoryComer left a comment

Choose a reason for hiding this comment

Uh oh!

digantdesai Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

lucylq Oct 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lucylq Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

digantdesai left a comment

Choose a reason for hiding this comment

Uh oh!

lucylq commented Oct 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Oct 7, 2025 •

edited

Loading

This PR needs a `release notes:` label

lucylq Oct 9, 2025 •

edited

Loading