[ET-VK] Fast path for choose_qparams #14019

SS-JIA · 2025-09-05T18:38:03Z

Stack from ghstack (oldest at bottom):

The current implementations of choose_qparams are too slow to be practically usable.

As a temporary workaround to unblock LLM optimizations, this diff/PR introduces a fast path for computing per-channel quantization parameters for 2D matrices in the form of the choose_qparams_per_row shader.

Differential Revision: D81800024

The current implementations of `choose_qparams` are too slow to be practically usable. As a temporary workaround to unblock LLM optimizations, this diff/PR introduces a fast path for computing per-channel quantization parameters for 2D matrices in the form of the choose_qparams_per_row shader. Differential Revision: [D81800024](https://our.internmc.facebook.com/intern/diff/D81800024/) [ghstack-poisoned]

pytorch-bot · 2025-09-05T18:38:06Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14019

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 2ee5b97 with merge base 1a7441f ():

NEW FAILURE - The following job has failed:

Build documentation / build (buck2) / Build doc (gh)
At least one of the pre-conditions you specified did not hold

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2025-09-05T18:38:56Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

facebook-github-bot · 2025-09-05T18:39:20Z

This pull request was exported from Phabricator. Differential Revision: D81800024

The current implementations of `choose_qparams` are too slow to be practically usable. As a temporary workaround to unblock LLM optimizations, this diff/PR introduces a fast path for computing per-channel quantization parameters for 2D matrices in the form of the choose_qparams_per_row shader. Differential Revision: [D81800024](https://our.internmc.facebook.com/intern/diff/D81800024/) [ghstack-poisoned]

facebook-github-bot · 2025-09-07T18:41:07Z

This pull request was exported from Phabricator. Differential Revision: D81800024

Pull Request resolved: #14019 The current implementations of `choose_qparams` are too slow to be practically usable. As a temporary workaround to unblock LLM optimizations, this diff/PR introduces a fast path for computing per-channel quantization parameters for 2D matrices in the form of the choose_qparams_per_row shader. ghstack-source-id: 308092877 @exported-using-ghexport Differential Revision: [D81800024](https://our.internmc.facebook.com/intern/diff/D81800024/)

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 5, 2025

facebook-github-bot added the fb-exported label Sep 5, 2025

manuelcandales approved these changes Sep 5, 2025

View reviewed changes

ssjia and others added 2 commits September 7, 2025 10:54

facebook-github-bot merged commit 6ec3deb into gh/SS-JIA/320/base Sep 8, 2025
115 of 118 checks passed

facebook-github-bot deleted the gh/SS-JIA/320/head branch September 8, 2025 00:05

facebook-github-bot temporarily deployed to cherry-pick-bot September 8, 2025 00:05 — with GitHub Actions Inactive

pytorchbot mentioned this pull request Sep 8, 2025

[ET-VK] Fast path for choose_qparams #14045

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ET-VK] Fast path for choose_qparams #14019

[ET-VK] Fast path for choose_qparams #14019

Uh oh!

SS-JIA commented Sep 5, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 5, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Sep 5, 2025

Uh oh!

facebook-github-bot commented Sep 5, 2025

Uh oh!

facebook-github-bot commented Sep 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[ET-VK] Fast path for choose_qparams #14019

[ET-VK] Fast path for choose_qparams #14019

Uh oh!

Conversation

SS-JIA commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14019

❌ 1 New Failure

Uh oh!

github-actions bot commented Sep 5, 2025

This PR needs a release notes: label

Uh oh!

facebook-github-bot commented Sep 5, 2025

Uh oh!

facebook-github-bot commented Sep 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SS-JIA commented Sep 5, 2025 •

edited

Loading

pytorch-bot bot commented Sep 5, 2025 •

edited

Loading

This PR needs a `release notes:` label