[ET-VK][ez][Ops] registering Q/DQ/CQP ops and specifying optimal storage #12200

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

facebook-github-bot merged 7 commits into gh/ahmtox/28/base from gh/ahmtox/28/head

Jul 14, 2025

Contributor

ahmtox commented Jul 3, 2025 •

edited

Loading

Stack from ghstack (oldest at bottom):

Context

Certain quantization operators need scales and zeros to be set with a storage layout as buffers. Since the existing op_registry does not allow specifying how input parameters are set with their memory or storage layout, we need to specify that the optimal storage type is buffer so that is conversion pass is added to ensure that the inputs are also buffers.

Changes

This moves the quantized_decomposed operators in their own registration, while also specifying that buffer is preferred.

Differential Revision: D77746131


          [ET-VK][ez][Ops] registering Q/DQ/CQP ops and specifying optimal storage

734e1f8

# Context

Certain quantization operators need scales and zeros to be set with a storage layout as buffers. Since the existing op_registry does not allow specifying how input parameters are set with their memory or storage layout, we need to specify that the optimal storage type is buffer so that is conversion pass is added to ensure that the inputs are also buffers.

# Changes

This moves the quantized_decomposed operators in their own registration, while also specifying that buffer is preferred.

Differential Revision: [D77746131](https://our.internmc.facebook.com/intern/diff/D77746131/)

[ghstack-poisoned]

ahmtox requested a review from SS-JIA as a code owner

July 3, 2025 18:17

pytorch-bot bot commented Jul 3, 2025 •

edited

Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12200

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 9cea81b with merge base 1540659 ():

NEW FAILURE - The following job has failed:

pull / test-eval_llama-mmlu-linux / linux-job (gh)
RuntimeError: Command docker exec -t ae48193b6ddf40450d9dffe79cccd581b39ce5ffcdc0da6607f5d18b1199d796 /exec failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot added the CLA Signed label

This was referenced Jul 3, 2025

[ET-VK][Ops] aligning Q/DQ/CQP op inputs with ATen impl #12199

Merged

[ET-VK][ez] enabling fp64->fp32 converison for vulkan compatibility #12201

Merged

[ET-VK] lowering ExecuTorch tensor dtype for Vulkan tensor dtype to enable 64bit #12202

Open

[ET] correcting cpu ref quantize_per_channel logic to align with ATen #12203

Merged

[ET-VK][Ops] quantize_per_channel reference impl and testing #12204

Merged

[ET-VK][Ops] quantize_per_channel shaders and impl #12205

Merged

[ET-VK][Ops] dequantize_per_channel reference impl and testing #12206

Merged

[ET-VK][Ops] dequantize_per_channel shaders and impl #12207

Merged

[ET-VK][Ops] quantize_per_tensor.tensor variant #12208

Merged

[ET-VK][Ops] dequantize_per_tensor.tensor variant #12209

Merged

[ET-VK][testing] Q/DQ/CQP op comprehensive delegate dynamic quantization testing #12210

Merged

github-actions bot commented Jul 3, 2025

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Contributor

facebook-github-bot commented Jul 3, 2025

This pull request was exported from Phabricator. Differential Revision: D77746131

facebook-github-bot added the fb-exported label

trivedivivek approved these changes

View reviewed changes


          Update on "[ET-VK][ez][Ops] registering Q/DQ/CQP ops and specifying o…

534973e

…ptimal storage"

# Context

Certain quantization operators need scales and zeros to be set with a storage layout as buffers. Since the existing op_registry does not allow specifying how input parameters are set with their memory or storage layout, we need to specify that the optimal storage type is buffer so that is conversion pass is added to ensure that the inputs are also buffers.

# Changes

This moves the quantized_decomposed operators in their own registration, while also specifying that buffer is preferred.

Differential Revision: [D77746131](https://our.internmc.facebook.com/intern/diff/D77746131/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Jul 7, 2025

This pull request was exported from Phabricator. Differential Revision: D77746131


          Update on "[ET-VK][ez][Ops] registering Q/DQ/CQP ops and specifying o…

acbdfb2

…ptimal storage"

# Context

Certain quantization operators need scales and zeros to be set with a storage layout as buffers. Since the existing op_registry does not allow specifying how input parameters are set with their memory or storage layout, we need to specify that the optimal storage type is buffer so that is conversion pass is added to ensure that the inputs are also buffers.

# Changes

This moves the quantized_decomposed operators in their own registration, while also specifying that buffer is preferred.

Differential Revision: [D77746131](https://our.internmc.facebook.com/intern/diff/D77746131/)

[ghstack-poisoned]

ahmtox mentioned this pull request

[ET-VK][Ops] affine quantization operators registration #12369

Merged

Contributor

facebook-github-bot commented Jul 10, 2025

This pull request was exported from Phabricator. Differential Revision: D77746131


          Update on "[ET-VK][ez][Ops] registering Q/DQ/CQP ops and specifying o…

f86ca87

…ptimal storage"

# Context

Certain quantization operators need scales and zeros to be set with a storage layout as buffers. Since the existing op_registry does not allow specifying how input parameters are set with their memory or storage layout, we need to specify that the optimal storage type is buffer so that is conversion pass is added to ensure that the inputs are also buffers.

# Changes

This moves the quantized_decomposed operators in their own registration, while also specifying that buffer is preferred.

Differential Revision: [D77746131](https://our.internmc.facebook.com/intern/diff/D77746131/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Jul 11, 2025

This pull request was exported from Phabricator. Differential Revision: D77746131


          Update on "[ET-VK][ez][Ops] registering Q/DQ/CQP ops and specifying o…

552c29f

…ptimal storage"

# Context

Certain quantization operators need scales and zeros to be set with a storage layout as buffers. Since the existing op_registry does not allow specifying how input parameters are set with their memory or storage layout, we need to specify that the optimal storage type is buffer so that is conversion pass is added to ensure that the inputs are also buffers.

# Changes

This moves the quantized_decomposed operators in their own registration, while also specifying that buffer is preferred.

Differential Revision: [D77746131](https://our.internmc.facebook.com/intern/diff/D77746131/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Jul 11, 2025

This pull request was exported from Phabricator. Differential Revision: D77746131


          Update on "[ET-VK][ez][Ops] registering Q/DQ/CQP ops and specifying o…

…ptimal storage"

# Context

Certain quantization operators need scales and zeros to be set with a storage layout as buffers. Since the existing op_registry does not allow specifying how input parameters are set with their memory or storage layout, we need to specify that the optimal storage type is buffer so that is conversion pass is added to ensure that the inputs are also buffers.

# Changes

This moves the quantized_decomposed operators in their own registration, while also specifying that buffer is preferred.

Differential Revision: [D77746131](https://our.internmc.facebook.com/intern/diff/D77746131/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Jul 11, 2025

This pull request was exported from Phabricator. Differential Revision: D77746131


          Update on "[ET-VK][ez][Ops] registering Q/DQ/CQP ops and specifying o…

9cea81b

…ptimal storage"

# Context

Certain quantization operators need scales and zeros to be set with a storage layout as buffers. Since the existing op_registry does not allow specifying how input parameters are set with their memory or storage layout, we need to specify that the optimal storage type is buffer so that is conversion pass is added to ensure that the inputs are also buffers.

# Changes

This moves the quantized_decomposed operators in their own registration, while also specifying that buffer is preferred.

Differential Revision: [D77746131](https://our.internmc.facebook.com/intern/diff/D77746131/)

[ghstack-poisoned]

Contributor

facebook-github-bot commented Jul 14, 2025

This pull request was exported from Phabricator. Differential Revision: D77746131

facebook-github-bot merged commit e11834d into gh/ahmtox/28/base

98 of 100 checks passed

facebook-github-bot deleted the gh/ahmtox/28/head branch

July 14, 2025 14:57

facebook-github-bot temporarily deployed to cherry-pick-bot

July 14, 2025 14:58

— with

GitHub Actions Inactive

pytorchbot mentioned this pull request

[ET-VK][ez][Ops] registering Q/DQ/CQP ops and specifying optimal storage #12429

Merged

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed fb-exported