Add TorchAO wrapper config to allow filter_fn for quantize_ #13264

abhinaykukkadapu · 2025-08-10T03:45:04Z

Changes:

Support filter function in quantize_ function when using torchao quantize.
Update unittests accordingly
Use ComposableQuantizer if there are multiple quantizers and is of type torchao, for legacy quantizers use them directly with prepare_pt2e.
Source transform modifies model inplace, so deep copy first to avoid modifying user provided model.

[ghstack-poisoned]

abhinaykukkadapu · 2025-08-10T03:45:06Z

Stack from ghstack (oldest at bottom):

pytorch-bot · 2025-08-10T03:45:07Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13264

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 4 New Failures, 7 Pending, 1 Unrelated Failure

As of commit 482f3d6 with merge base 0e76a97 ():

NEW FAILURES - The following jobs have failed:

Build documentation / build (buck2) / Build doc (gh)
At least one of the pre-conditions you specified did not hold
pull / unittest / linux / linux-job (gh)
examples/models/llama/tests/test_ring_attention.py::TestRingAttention::test_single_token_processing_quantized
pull / unittest-arm-backend-with-no-fvp (test_pytest_ops) / linux-job (gh)
RuntimeError: Command docker exec -t 0fef128d2b1e80a34c216e4ffd2962173404fb72fa3966be820495d2d444809c /exec failed with exit code 1
pull / unittest-editable / linux / linux-job (gh)
examples/models/llama/tests/test_ring_attention.py::TestRingAttention::test_single_token_processing_quantized

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest-arm-backend-with-no-fvp (test_pytest_models) / linux-job (gh) (trunk failure)
backends/arm/test/models/stable_diffusion/test_vae_AutoencoderKL.py::TestAutoencoderKL::test_AutoencoderKL_tosa_MI

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2025-08-10T03:45:43Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

[ghstack-poisoned]

digantdesai · 2025-08-12T12:12:14Z

backends/xnnpack/test/recipes/test_xnnpack_recipes.py

+        eager_quantized_model = source_transform_output.data["forward"]
+        output = session.run_method("forward", example_inputs[0])[0]
+        expected = eager_quantized_model(*example_inputs[0])
+        self.assertTrue(torch.allclose(output, expected, atol=atol))


You might want to print more stats if this fails - see https://github.com/pytorch/executorch/blob/main/backends/test/harness/tester.py#L337

digantdesai · 2025-08-12T12:12:58Z

backends/xnnpack/test/recipes/test_xnnpack_recipes.py

-                            atol=1e-1,
-                        )
+                    self._compare_eager_quantized_model_outputs(
+                        session, example_inputs, 1e-1


atol? Why is this so high for two linears?

Suggested change

session, example_inputs, 1e-1

session, example_inputs, atol=1e-1

Yeah, i think 1e-2 is working on my mac, will check if linux passes on CI. Nevertheless, i'm updating the tolerance tests similar to CoreML (let me know if there is any objection) to use sqnr to compare eager model vs lowered model output.

But use tolerance checks to compare post quantized model and lowered model.

backends/xnnpack/recipes/xnnpack_recipe_provider.py

export/stages.py

metascroy · 2025-08-12T18:35:13Z

export/stages.py

+            raise ValueError("Mixed quantizer types are not supported")
+        if len(torch_ao_quantizers) > 1:
+            raise ValueError(
+                "Multiple quantizers of torch.ao.quantization.quantizer not supported"


Doesn't torchao already detect this and give an error if mixing? I thought I added that

May be the torchao version is different?

export/stages.py

metascroy · 2025-08-12T20:21:17Z

Overall, looks good. Address comments before merging

[ghstack-poisoned]

…13264)" and "Add coreml quant recipes (#13265)" This reverts commit 0a7cea8 and 310a05d. It appears that #13264 broke unittest jobs and #13265 depends on it. ghstack-source-id: 6a7d30c ghstack-comment-id: 3184642863 Pull-Request: #13374

…13264)" and "Add coreml quant recipes (#13265)" (#13374) This reverts commit 0a7cea8 and 310a05d. It appears that #13264 broke unittest jobs and #13265 depends on it.

Fixing tests for stack that got reverted: #13264 Changes: Support filter function in quantize_ function when using torchao quantize. Update unittests accordingly Use ComposableQuantizer if there are multiple quantizers and is of type torchao, for legacy quantizers use them directly with prepare_pt2e. Source transform modifies model inplace, so deep copy first to avoid modifying user provided model. Differential Revision: [D80206543](https://our.internmc.facebook.com/intern/diff/D80206543/) [ghstack-poisoned]

… for quantize_" Fixing tests for stack that got reverted: #13264 Changes: Support filter function in quantize_ function when using torchao quantize. Update unittests accordingly Use ComposableQuantizer if there are multiple quantizers and is of type torchao, for legacy quantizers use them directly with prepare_pt2e. Source transform modifies model inplace, so deep copy first to avoid modifying user provided model. Differential Revision: [D80206543](https://our.internmc.facebook.com/intern/diff/D80206543/) [ghstack-poisoned]

… allow filter_fn for quantize_" Fixing tests for stack that got reverted: #13264 Changes: Support filter function in quantize_ function when using torchao quantize. Update unittests accordingly Use ComposableQuantizer if there are multiple quantizers and is of type torchao, for legacy quantizers use them directly with prepare_pt2e. Source transform modifies model inplace, so deep copy first to avoid modifying user provided model. Differential Revision: [D80206543](https://our.internmc.facebook.com/intern/diff/D80206543/) [ghstack-poisoned]

… for quantize_" Fixing tests for stack that got reverted: #13264 Changes: Support filter function in quantize_ function when using torchao quantize. Update unittests accordingly Use ComposableQuantizer if there are multiple quantizers and is of type torchao, for legacy quantizers use them directly with prepare_pt2e. Source transform modifies model inplace, so deep copy first to avoid modifying user provided model. Differential Revision: [D80206543](https://our.internmc.facebook.com/intern/diff/D80206543/) [ghstack-poisoned]

… allow filter_fn for quantize_" Fixing tests for stack that got reverted: #13264 Changes: Support filter function in quantize_ function when using torchao quantize. Update unittests accordingly Use ComposableQuantizer if there are multiple quantizers and is of type torchao, for legacy quantizers use them directly with prepare_pt2e. Source transform modifies model inplace, so deep copy first to avoid modifying user provided model. Differential Revision: [D80206543](https://our.internmc.facebook.com/intern/diff/D80206543/) [ghstack-poisoned]

… for quantize_" Fixing tests for stack that got reverted: #13264 Changes: Support filter function in quantize_ function when using torchao quantize. Update unittests accordingly Use ComposableQuantizer if there are multiple quantizers and is of type torchao, for legacy quantizers use them directly with prepare_pt2e. Source transform modifies model inplace, so deep copy first to avoid modifying user provided model. Differential Revision: [D80206543](https://our.internmc.facebook.com/intern/diff/D80206543/) [ghstack-poisoned]

…13264)

…ytorch#13264)" and "Add coreml quant recipes (pytorch#13265)" (pytorch#13374) This reverts commit 0a7cea8 and 310a05d. It appears that pytorch#13264 broke unittest jobs and pytorch#13265 depends on it.

Update

59affe6

[ghstack-poisoned]

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 10, 2025

abhinaykukkadapu mentioned this pull request Aug 10, 2025

Add coreml quant recipes #13265

Merged

abhinaykukkadapu requested review from digantdesai and metascroy August 10, 2025 03:56

Update

37bdc0b

[ghstack-poisoned]

abhinaykukkadapu requested a review from mcr229 as a code owner August 12, 2025 00:23

abhinaykukkadapu added 2 commits August 11, 2025 21:37

Update

4eb6a03

[ghstack-poisoned]

Update

7dab762

[ghstack-poisoned]

digantdesai reviewed Aug 12, 2025

View reviewed changes

backends/xnnpack/recipes/xnnpack_recipe_provider.py Show resolved Hide resolved

digantdesai reviewed Aug 12, 2025

View reviewed changes

export/stages.py Show resolved Hide resolved

metascroy reviewed Aug 12, 2025

View reviewed changes

export/stages.py Show resolved Hide resolved

metascroy approved these changes Aug 12, 2025

View reviewed changes

abhinaykukkadapu added 4 commits August 12, 2025 14:22

Update

fab2d54

[ghstack-poisoned]

Update

b5c56a2

[ghstack-poisoned]

Update

cbfe8bb

[ghstack-poisoned]

Update

482f3d6

[ghstack-poisoned]

abhinaykukkadapu merged commit 0a7cea8 into main Aug 13, 2025
97 of 104 checks passed

abhinaykukkadapu deleted the gh/abhinaykukkadapu/4/head branch August 13, 2025 01:11

swolchok mentioned this pull request Aug 13, 2025

Revert "Add TorchAO wrapper config to allow filter_fn for quantize_ (#13264)" and "Add coreml quant recipes (#13265)" #13374

Merged

abhinaykukkadapu mentioned this pull request Aug 13, 2025

[executorch] Add TorchAO wrapper config to allow filter_fn for quantize_ #13386

Merged

agrima1304 pushed a commit to agrima1304/executorch that referenced this pull request Aug 26, 2025

Add TorchAO wrapper config to allow filter_fn for quantize_ (pytorch#…

1a67f6c

…13264)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add TorchAO wrapper config to allow filter_fn for quantize_ #13264

Add TorchAO wrapper config to allow filter_fn for quantize_ #13264

Uh oh!

abhinaykukkadapu commented Aug 10, 2025 •

edited

Loading

Uh oh!

abhinaykukkadapu commented Aug 10, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Aug 10, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Aug 10, 2025

Uh oh!

digantdesai Aug 12, 2025

Uh oh!

digantdesai Aug 12, 2025 •

edited

Loading

Uh oh!

abhinaykukkadapu Aug 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

metascroy Aug 12, 2025

Uh oh!

abhinaykukkadapu Aug 12, 2025

Uh oh!

Uh oh!

metascroy commented Aug 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	session, example_inputs, 1e-1
	session, example_inputs, atol=1e-1

Add TorchAO wrapper config to allow filter_fn for quantize_ #13264

Add TorchAO wrapper config to allow filter_fn for quantize_ #13264

Uh oh!

Conversation

abhinaykukkadapu commented Aug 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

abhinaykukkadapu commented Aug 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13264

❌ 4 New Failures, 7 Pending, 1 Unrelated Failure

Uh oh!

github-actions bot commented Aug 10, 2025

This PR needs a release notes: label

Uh oh!

digantdesai Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

digantdesai Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

abhinaykukkadapu Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

metascroy Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

abhinaykukkadapu Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

metascroy commented Aug 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

abhinaykukkadapu commented Aug 10, 2025 •

edited

Loading

abhinaykukkadapu commented Aug 10, 2025 •

edited

Loading

pytorch-bot bot commented Aug 10, 2025 •

edited

Loading

This PR needs a `release notes:` label

digantdesai Aug 12, 2025 •

edited

Loading

abhinaykukkadapu Aug 12, 2025 •

edited

Loading