Added mse range setting #11857

rohansjoshi · 2025-06-23T18:05:13Z

Summary:
Added option to use MSE range setting algorithm. This algorithm does a linear grid search over scales and selects those which minimizes mean squared error (see the line_search method in the class PerChannelParamsObserver). This method is applied for quantizing weights per channel. Accuracy is still poor, but somewhat better than using MinMax.

On wikitext task, with grid size 200:

Model Name	max_seq_len	ptq	word_perplexity
Llama 3.2-1B Instruct	128	16a4w	2367107
Llama 3.2-1B Instruct	128	16a4w_block	5523977
Llama 3.2-1B Instruct	128	8a8w	501663

Reviewed By: cccclai

Differential Revision: D77055545

pytorch-bot · 2025-06-23T18:05:17Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/11857

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 Cancelled Jobs

As of commit 83dc888 with merge base 2f55193 ():

CANCELLED JOBS - The following jobs were cancelled. Please retry:

pull / test-models-linux (mobilebert, portable, linux.2xlarge) / linux-job (gh)
##[error]The operation was canceled.
pull / test-models-linux (mobilebert, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-06-23T18:05:24Z

This pull request was exported from Phabricator. Differential Revision: D77055545

github-actions · 2025-06-23T18:06:08Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Summary: Added option to use MSE range setting algorithm. This algorithm does a linear grid search over scales and selects those which minimizes mean squared error (see the line_search method in the class PerChannelParamsObserver). This method is applied for quantizing weights per channel. On wikitext task: | Model Name | max_seq_len | ptq | range_setting | word_perplexity |----------|----------|----------|-----------|-----------| | Llama 3.2-1B Instruct | 1024 | 16a4w | MinMax | 184.06065814138435 | | Llama 3.2-1B Instruct | 1024 | 16a4w | MSE (grid size 200) | 292.199408962586 | Reviewed By: cccclai Differential Revision: D77055545

facebook-github-bot · 2025-07-04T01:11:46Z

This pull request was exported from Phabricator. Differential Revision: D77055545

Summary: Added option to use MSE range setting algorithm. This algorithm does a linear grid search over scales and selects those which minimizes mean squared error (see the line_search method in the class PerChannelParamsObserver). This method is applied for quantizing weights per channel. On wikitext task: | Model Name | max_seq_len | ptq | range_setting | word_perplexity |----------|----------|----------|-----------|-----------| | Llama 3.2-1B Instruct | 1024 | 16a4w | MinMax | 184.06065814138435 | | Llama 3.2-1B Instruct | 1024 | 16a4w | MSE (grid size 200) | 292.199408962586 | Reviewed By: cccclai Differential Revision: D77055545

Summary: Pull Request resolved: pytorch#11857 Added option to use MSE range setting algorithm. This algorithm does a linear grid search over scales and selects those which minimizes mean squared error (see the line_search method in the class PerChannelParamsObserver). This method is applied for quantizing weights per channel. On wikitext task: | Model Name | max_seq_len | ptq | range_setting | word_perplexity |----------|----------|----------|-----------|-----------| | Llama 3.2-1B Instruct | 1024 | 16a4w | MinMax | 184.06065814138435 | | Llama 3.2-1B Instruct | 1024 | 16a4w | MSE (grid size 200) | 292.199408962586 | Reviewed By: cccclai Differential Revision: D77055545

facebook-github-bot · 2025-07-06T18:00:25Z

This pull request was exported from Phabricator. Differential Revision: D77055545

Differential Revision: D77055545 Pull Request resolved: pytorch#11857

rohansjoshi requested a review from cccclai as a code owner June 23, 2025 18:05

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 23, 2025

facebook-github-bot added the fb-exported label Jun 23, 2025

rohansjoshi force-pushed the export-D77055545 branch from 74f00b7 to f7b1ecf Compare July 4, 2025 01:11

cccclai approved these changes Jul 4, 2025

View reviewed changes

rohansjoshi force-pushed the export-D77055545 branch from f7b1ecf to 83e056f Compare July 6, 2025 17:53

rohansjoshi force-pushed the export-D77055545 branch from 83e056f to 83dc888 Compare July 6, 2025 18:00

facebook-github-bot merged commit 6669637 into pytorch:main Jul 7, 2025
101 of 105 checks passed

Tanish2101 pushed a commit to Tanish2101/executorch that referenced this pull request Jul 9, 2025

Added mse range setting

0464996

Differential Revision: D77055545 Pull Request resolved: pytorch#11857

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added mse range setting #11857

Added mse range setting #11857

Uh oh!

rohansjoshi commented Jun 23, 2025

Uh oh!

pytorch-bot bot commented Jun 23, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Jun 23, 2025

Uh oh!

github-actions bot commented Jun 23, 2025

Uh oh!

facebook-github-bot commented Jul 4, 2025

Uh oh!

facebook-github-bot commented Jul 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Added mse range setting #11857

Added mse range setting #11857

Uh oh!

Conversation

rohansjoshi commented Jun 23, 2025

Uh oh!

pytorch-bot bot commented Jun 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/11857

❌ 2 Cancelled Jobs

Uh oh!

facebook-github-bot commented Jun 23, 2025

Uh oh!

github-actions bot commented Jun 23, 2025

This PR needs a release notes: label

Uh oh!

facebook-github-bot commented Jul 4, 2025

Uh oh!

facebook-github-bot commented Jul 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Jun 23, 2025 •

edited

Loading

This PR needs a `release notes:` label