Add coreml quant recipes #13265

abhinaykukkadapu · 2025-08-10T03:45:17Z

Adds coreml quant recipes after FP32/16 recipes added in #13121

Recipes added:

PT2E_INT8_STATIC
PT2E_INT8_WEIGHT_ONLY
INT4_WEIGHT_ONLY_PER_CHANNEL
INT4_WEIGHT_ONLY_PER_GROUP
INT8_WEIGHT_ONLY_PER_CHANNEL
INT8_WEIGHT_ONLY_PER_GROUP
CODEBOOK_WEIGHT_ONLY

[ghstack-poisoned]

abhinaykukkadapu · 2025-08-10T03:45:18Z

Stack from ghstack (oldest at bottom):

pytorch-bot · 2025-08-10T03:45:20Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13265

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 4 New Failures, 7 Pending, 2 Unrelated Failures

As of commit 3bd2fe0 with merge base 0e76a97 ():

NEW FAILURES - The following jobs have failed:

Build documentation / build (buck2) / Build doc (gh)
At least one of the pre-conditions you specified did not hold
pull / unittest / linux / linux-job (gh)
examples/models/llama/tests/test_ring_attention.py::TestRingAttention::test_single_token_processing_quantized
pull / unittest-arm-backend-with-no-fvp (test_pytest_ops) / linux-job (gh)
RuntimeError: Command docker exec -t d69e849b47e9ddc5c4da46a0c1c09cfa73eb718e29644d2a51bdcfd1b80437f0 /exec failed with exit code 1
pull / unittest-editable / linux / linux-job (gh)
examples/models/llama/tests/test_ring_attention.py::TestRingAttention::test_single_token_processing_quantized

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Apple / build-demo-ios / macos-job (gh) (trunk failure)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 65
pull / unittest-arm-backend-with-no-fvp (test_pytest_models) / linux-job (gh) (trunk failure)
backends/arm/test/models/stable_diffusion/test_vae_AutoencoderKL.py::TestAutoencoderKL::test_AutoencoderKL_tosa_MI

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 1bb73e0 ghstack-comment-id: 3172341606 Pull-Request: #13265

[ghstack-poisoned]

ghstack-source-id: 76f6fc8 ghstack-comment-id: 3172341606 Pull-Request: #13265

backends/apple/coreml/recipes/coreml_recipe_provider.py

ghstack-source-id: 76f6fc8 ghstack-comment-id: 3172341606 Pull-Request: #13265

[ghstack-poisoned]

ghstack-source-id: d147f71 ghstack-comment-id: 3172341606 Pull-Request: #13265

[ghstack-poisoned]

ghstack-source-id: cdcd86d ghstack-comment-id: 3172341606 Pull-Request: #13265

backends/apple/coreml/recipes/coreml_recipe_provider.py

backends/apple/coreml/recipes/coreml_recipe_types.py

backends/apple/coreml/test/test_coreml_recipes.py

ghstack-source-id: bf6c618 ghstack-comment-id: 3172341606 Pull-Request: #13265

[ghstack-poisoned]

ghstack-source-id: c2f2a36 ghstack-comment-id: 3172341606 Pull-Request: #13265

…13264)" and "Add coreml quant recipes (#13265)" This reverts commit 0a7cea8 and 310a05d. It appears that #13264 broke unittest jobs and #13265 depends on it. ghstack-source-id: 6a7d30c ghstack-comment-id: 3184642863 Pull-Request: #13374

…13264)" and "Add coreml quant recipes (#13265)" (#13374) This reverts commit 0a7cea8 and 310a05d. It appears that #13264 broke unittest jobs and #13265 depends on it.

Fixing tests for stack that got reverted: #13265 Adds coreml quant recipes after FP32/16 recipes added in #13121 Recipes added: PT2E_INT8_STATIC PT2E_INT8_WEIGHT_ONLY INT4_WEIGHT_ONLY_PER_CHANNEL INT4_WEIGHT_ONLY_PER_GROUP INT8_WEIGHT_ONLY_PER_CHANNEL INT8_WEIGHT_ONLY_PER_GROUP CODEBOOK_WEIGHT_ONLY Differential Revision: [D80206542](https://our.internmc.facebook.com/intern/diff/D80206542/) [ghstack-poisoned]

Fixing tests for stack that got reverted: #13265 Adds coreml quant recipes after FP32/16 recipes added in #13121 Recipes added: PT2E_INT8_STATIC PT2E_INT8_WEIGHT_ONLY INT4_WEIGHT_ONLY_PER_CHANNEL INT4_WEIGHT_ONLY_PER_GROUP INT8_WEIGHT_ONLY_PER_CHANNEL INT8_WEIGHT_ONLY_PER_GROUP CODEBOOK_WEIGHT_ONLY Differential Revision: [D80206542](https://our.internmc.facebook.com/intern/diff/D80206542/) ghstack-source-id: 302827577 Pull Request resolved: #13387

Fixing tests for stack that got reverted: #13265 Adds coreml quant recipes after FP32/16 recipes added in #13121 Recipes added: PT2E_INT8_STATIC PT2E_INT8_WEIGHT_ONLY INT4_WEIGHT_ONLY_PER_CHANNEL INT4_WEIGHT_ONLY_PER_GROUP INT8_WEIGHT_ONLY_PER_CHANNEL INT8_WEIGHT_ONLY_PER_GROUP CODEBOOK_WEIGHT_ONLY Differential Revision: [D80206542](https://our.internmc.facebook.com/intern/diff/D80206542/) [ghstack-poisoned]

Pull Request resolved: #13387 Fixing tests for stack that got reverted: #13265 Adds coreml quant recipes after FP32/16 recipes added in #13121 Recipes added: PT2E_INT8_STATIC PT2E_INT8_WEIGHT_ONLY INT4_WEIGHT_ONLY_PER_CHANNEL INT4_WEIGHT_ONLY_PER_GROUP INT8_WEIGHT_ONLY_PER_CHANNEL INT8_WEIGHT_ONLY_PER_GROUP CODEBOOK_WEIGHT_ONLY ghstack-source-id: 302842396 @exported-using-ghexport Differential Revision: [D80206542](https://our.internmc.facebook.com/intern/diff/D80206542/)

Fixing tests for stack that got reverted: #13265 Adds coreml quant recipes after FP32/16 recipes added in #13121 Recipes added: PT2E_INT8_STATIC PT2E_INT8_WEIGHT_ONLY INT4_WEIGHT_ONLY_PER_CHANNEL INT4_WEIGHT_ONLY_PER_GROUP INT8_WEIGHT_ONLY_PER_CHANNEL INT8_WEIGHT_ONLY_PER_GROUP CODEBOOK_WEIGHT_ONLY Differential Revision: [D80206542](https://our.internmc.facebook.com/intern/diff/D80206542/) [ghstack-poisoned]

Pull Request resolved: #13387 Fixing tests for stack that got reverted: #13265 Adds coreml quant recipes after FP32/16 recipes added in #13121 Recipes added: PT2E_INT8_STATIC PT2E_INT8_WEIGHT_ONLY INT4_WEIGHT_ONLY_PER_CHANNEL INT4_WEIGHT_ONLY_PER_GROUP INT8_WEIGHT_ONLY_PER_CHANNEL INT8_WEIGHT_ONLY_PER_GROUP CODEBOOK_WEIGHT_ONLY ghstack-source-id: 302857006 @exported-using-ghexport Differential Revision: [D80206542](https://our.internmc.facebook.com/intern/diff/D80206542/)

Fixing tests for stack that got reverted: #13265 Adds coreml quant recipes after FP32/16 recipes added in #13121 Recipes added: PT2E_INT8_STATIC PT2E_INT8_WEIGHT_ONLY INT4_WEIGHT_ONLY_PER_CHANNEL INT4_WEIGHT_ONLY_PER_GROUP INT8_WEIGHT_ONLY_PER_CHANNEL INT8_WEIGHT_ONLY_PER_GROUP CODEBOOK_WEIGHT_ONLY Differential Revision: [D80206542](https://our.internmc.facebook.com/intern/diff/D80206542/) [ghstack-poisoned]

Pull Request resolved: #13387 Fixing tests for stack that got reverted: #13265 Adds coreml quant recipes after FP32/16 recipes added in #13121 Recipes added: PT2E_INT8_STATIC PT2E_INT8_WEIGHT_ONLY INT4_WEIGHT_ONLY_PER_CHANNEL INT4_WEIGHT_ONLY_PER_GROUP INT8_WEIGHT_ONLY_PER_CHANNEL INT8_WEIGHT_ONLY_PER_GROUP CODEBOOK_WEIGHT_ONLY ghstack-source-id: 302870102 @exported-using-ghexport Differential Revision: [D80206542](https://our.internmc.facebook.com/intern/diff/D80206542/)

Fixing tests for stack that got reverted: #13265 Adds coreml quant recipes after FP32/16 recipes added in #13121 Recipes added: PT2E_INT8_STATIC PT2E_INT8_WEIGHT_ONLY INT4_WEIGHT_ONLY_PER_CHANNEL INT4_WEIGHT_ONLY_PER_GROUP INT8_WEIGHT_ONLY_PER_CHANNEL INT8_WEIGHT_ONLY_PER_GROUP CODEBOOK_WEIGHT_ONLY Differential Revision: [D80206542](https://our.internmc.facebook.com/intern/diff/D80206542/) [ghstack-poisoned]

Pull Request resolved: #13387 Fixing tests for stack that got reverted: #13265 Adds coreml quant recipes after FP32/16 recipes added in #13121 Recipes added: PT2E_INT8_STATIC PT2E_INT8_WEIGHT_ONLY TORCHAO_INT4_WEIGHT_ONLY_PER_CHANNEL TORCHAO_INT4_WEIGHT_ONLY_PER_GROUP TORCHAO_INT8_WEIGHT_ONLY_PER_CHANNEL TORCHAO_INT8_WEIGHT_ONLY_PER_GROUP CODEBOOK_WEIGHT_ONLY ghstack-source-id: 303044428 @exported-using-ghexport Differential Revision: [D80206542](https://our.internmc.facebook.com/intern/diff/D80206542/)

Fixing tests for stack that got reverted: #13265 Adds coreml quant recipes after FP32/16 recipes added in #13121 Recipes added: PT2E_INT8_STATIC PT2E_INT8_WEIGHT_ONLY INT4_WEIGHT_ONLY_PER_CHANNEL INT4_WEIGHT_ONLY_PER_GROUP INT8_WEIGHT_ONLY_PER_CHANNEL INT8_WEIGHT_ONLY_PER_GROUP CODEBOOK_WEIGHT_ONLY Differential Revision: [D80206542](https://our.internmc.facebook.com/intern/diff/D80206542/) [ghstack-poisoned]

Pull Request resolved: #13387 Fixing tests for stack that got reverted: #13265 Adds coreml quant recipes after FP32/16 recipes added in #13121 Recipes added: PT2E_INT8_STATIC PT2E_INT8_WEIGHT_ONLY TORCHAO_INT4_WEIGHT_ONLY_PER_CHANNEL TORCHAO_INT4_WEIGHT_ONLY_PER_GROUP TORCHAO_INT8_WEIGHT_ONLY_PER_CHANNEL TORCHAO_INT8_WEIGHT_ONLY_PER_GROUP CODEBOOK_WEIGHT_ONLY ghstack-source-id: 303126085 @exported-using-ghexport Differential Revision: [D80206542](https://our.internmc.facebook.com/intern/diff/D80206542/)

…ytorch#13264)" and "Add coreml quant recipes (pytorch#13265)" (pytorch#13374) This reverts commit 0a7cea8 and 310a05d. It appears that pytorch#13264 broke unittest jobs and pytorch#13265 depends on it.

abhinaykukkadapu added 2 commits August 9, 2025 20:44

Update

59affe6

[ghstack-poisoned]

Update

ebc30b2

[ghstack-poisoned]

abhinaykukkadapu requested review from cccclai, digantdesai, mcr229 and shoumikhin as code owners August 10, 2025 03:45

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 10, 2025

abhinaykukkadapu added a commit that referenced this pull request Aug 10, 2025

Add coreml quant recipes

bf7413d

ghstack-source-id: 1bb73e0 ghstack-comment-id: 3172341606 Pull-Request: #13265

abhinaykukkadapu mentioned this pull request Aug 10, 2025

Add TorchAO wrapper config to allow filter_fn for quantize_ #13264

Merged

Update

1207a46

[ghstack-poisoned]

abhinaykukkadapu added a commit that referenced this pull request Aug 10, 2025

Add coreml quant recipes

801d78a

ghstack-source-id: 76f6fc8 ghstack-comment-id: 3172341606 Pull-Request: #13265

abhinaykukkadapu requested review from kimishpatel and metascroy and removed request for cccclai, mcr229 and shoumikhin August 10, 2025 03:52

abhinaykukkadapu commented Aug 10, 2025

View reviewed changes

backends/apple/coreml/recipes/coreml_recipe_provider.py Outdated Show resolved Hide resolved

abhinaykukkadapu added a commit that referenced this pull request Aug 10, 2025

Add coreml quant recipes

c0c32b6

ghstack-source-id: 76f6fc8 ghstack-comment-id: 3172341606 Pull-Request: #13265

Update

1137ee9

[ghstack-poisoned]

abhinaykukkadapu added a commit that referenced this pull request Aug 10, 2025

Add coreml quant recipes

7f79b99

ghstack-source-id: d147f71 ghstack-comment-id: 3172341606 Pull-Request: #13265

Update

0e422bf

[ghstack-poisoned]

abhinaykukkadapu added a commit that referenced this pull request Aug 11, 2025

Add coreml quant recipes

8882d4a

ghstack-source-id: cdcd86d ghstack-comment-id: 3172341606 Pull-Request: #13265