Fix 8w8a qat qconfig setting activations #13284

navsud · 2025-08-11T16:12:26Z

Summary: 8-bit activation qconfig should not use reduce_range=True which limits the range to 0,127. This diff fixes that issue.

Differential Revision: D80007226

pytorch-bot · 2025-08-11T16:12:30Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13284

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 5 Unrelated Failures

As of commit 9f528a0 with merge base 310a05d ():

NEW FAILURES - The following jobs have failed:

Build documentation / build (buck2) / Build doc (gh)
At least one of the pre-conditions you specified did not hold
pull / unittest-arm-backend-with-no-fvp (test_pytest_ops) / linux-job (gh)
RuntimeError: Command docker exec -t 58ecf5de365a4c5de353f497e903017f170598e77bc657fb2baf1d1e42263a6c /exec failed with exit code 1

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / unittest / macos / macos-job (gh) (similar failure)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1

BROKEN TRUNK - The following jobs failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / linux / linux-job (gh) (trunk failure)
examples/models/llama/tests/test_ring_attention.py::TestRingAttention::test_single_token_processing_quantized
pull / unittest-arm-backend-with-no-fvp (test_pytest_models) / linux-job (gh) (trunk failure)
backends/arm/test/models/stable_diffusion/test_vae_AutoencoderKL.py::TestAutoencoderKL::test_AutoencoderKL_tosa_MI
pull / unittest-editable / linux / linux-job (gh) (trunk failure)
examples/models/llama/tests/test_ring_attention.py::TestRingAttention::test_single_token_processing_quantized
pull / unittest-editable / macos / macos-job (gh) (trunk failure)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-08-11T16:12:37Z

This pull request was exported from Phabricator. Differential Revision: D80007226

Summary: 8-bit activation qconfig should not use reduce_range=True which limits the range to 0,127. This diff fixes that issue. Differential Revision: D80007226

facebook-github-bot · 2025-08-11T21:53:23Z

This pull request was exported from Phabricator. Differential Revision: D80007226

cccclai · 2025-08-12T16:02:56Z

8-bit activation qconfig should not use reduce_range=True which limits the range to 0,127

What range should it be instead?

navsud · 2025-08-12T16:07:58Z

8-bit activation qconfig should not use reduce_range=True which limits the range to 0,127

What range should it be instead?

Without reduce_range=True, it would use the default of 0,255 ranges, which is the correct ranges for uint8. With this change, I was able to recover the training loss during QAT back to that of the fp32 training.

cccclai

Thanks for the fix!

Summary: 8-bit activation qconfig should not use reduce_range=True which limits the range to 0,127. This diff fixes that issue. Reviewed By: cccclai Differential Revision: D80007226

Summary: Pull Request resolved: pytorch#13284 8-bit activation qconfig should not use reduce_range=True which limits the range to 0,127. This diff fixes that issue. Reviewed By: cccclai Differential Revision: D80007226

facebook-github-bot · 2025-08-13T01:26:58Z

This pull request was exported from Phabricator. Differential Revision: D80007226

Differential Revision: D80007226 Pull Request resolved: pytorch#13284

Novelfor · 2025-08-26T17:27:12Z

I found the weight use reduce_range too..

    weight_fake_quant_ctr = FusedMovingAvgObsFakeQuantize.with_args(
        dtype=torch.int8,
        quant_min=torch.iinfo(torch.int8).min + 1,
        quant_max=torch.iinfo(torch.int8).max,
        qscheme=torch.per_tensor_symmetric,
        reduce_range=True,
        observer=MovingAverageMinMaxObserver,
    )

I think if modify it to False will improve performance too...

cccclai · 2025-08-26T17:59:15Z

I found the weight use reduce_range too..

    weight_fake_quant_ctr = FusedMovingAvgObsFakeQuantize.with_args(
        dtype=torch.int8,
        quant_min=torch.iinfo(torch.int8).min + 1,
        quant_max=torch.iinfo(torch.int8).max,
        qscheme=torch.per_tensor_symmetric,
        reduce_range=True,
        observer=MovingAverageMinMaxObserver,
    )

I think if modify it to False will improve performance too...

cc: @haowhsu-quic @winskuo-quic @shewu-quic @DannyYuyang-quic

navsud · 2025-08-26T19:05:44Z

@haowhsu-quic
I see reduce_range=True at many places in this file. Do we really need to reduce the range by 1-bit for qualcomm backends? If not, we should remove them to get back the 1-bit range, which would improve the model accuracy.

navsud requested a review from cccclai as a code owner August 11, 2025 16:12

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 11, 2025

facebook-github-bot added the fb-exported label Aug 11, 2025

cccclai requested review from DannyYuyang-quic, haowhsu-quic, shewu-quic and winskuo-quic August 11, 2025 17:20

navsud added the release notes: none Do not include this in the release notes label Aug 11, 2025

navsud force-pushed the export-D80007226 branch from 99b8017 to a48263f Compare August 11, 2025 21:53

cccclai approved these changes Aug 13, 2025

View reviewed changes

navsud force-pushed the export-D80007226 branch from a48263f to 354c03a Compare August 13, 2025 01:23

Fix 8w8a qat qconfig setting activations (pytorch#13284)

9f528a0

Summary: Pull Request resolved: pytorch#13284 8-bit activation qconfig should not use reduce_range=True which limits the range to 0,127. This diff fixes that issue. Reviewed By: cccclai Differential Revision: D80007226

navsud force-pushed the export-D80007226 branch from 354c03a to 9f528a0 Compare August 13, 2025 01:26

facebook-github-bot merged commit e032ca3 into pytorch:main Aug 13, 2025
98 of 106 checks passed

agrima1304 pushed a commit to agrima1304/executorch that referenced this pull request Aug 26, 2025

Fix 8w8a qat qconfig setting activations

400f31a

Differential Revision: D80007226 Pull Request resolved: pytorch#13284

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix 8w8a qat qconfig setting activations #13284

Fix 8w8a qat qconfig setting activations #13284

Uh oh!

navsud commented Aug 11, 2025

Uh oh!

pytorch-bot bot commented Aug 11, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Aug 11, 2025

Uh oh!

facebook-github-bot commented Aug 11, 2025

Uh oh!

cccclai commented Aug 12, 2025

Uh oh!

navsud commented Aug 12, 2025 •

edited

Loading

Uh oh!

cccclai left a comment

Uh oh!

facebook-github-bot commented Aug 13, 2025

Uh oh!

Uh oh!

Novelfor commented Aug 26, 2025

Uh oh!

cccclai commented Aug 26, 2025

Uh oh!

navsud commented Aug 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fix 8w8a qat qconfig setting activations #13284

Fix 8w8a qat qconfig setting activations #13284

Uh oh!

Conversation

navsud commented Aug 11, 2025

Uh oh!

pytorch-bot bot commented Aug 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13284

❌ 2 New Failures, 5 Unrelated Failures

Uh oh!

facebook-github-bot commented Aug 11, 2025

Uh oh!

facebook-github-bot commented Aug 11, 2025

Uh oh!

cccclai commented Aug 12, 2025

Uh oh!

navsud commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cccclai left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Aug 13, 2025

Uh oh!

Uh oh!

Novelfor commented Aug 26, 2025

Uh oh!

cccclai commented Aug 26, 2025

Uh oh!

navsud commented Aug 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot bot commented Aug 11, 2025 •

edited

Loading

navsud commented Aug 12, 2025 •

edited

Loading