Quant doc updates #12240

metascroy · 2025-07-04T21:30:17Z

Initial draft of quantization doc updates (#10603)

pytorch-bot · 2025-07-04T21:30:21Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12240

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures, 15 Pending

As of commit dd47664 with merge base 9d599c9 ():

NEW FAILURES - The following jobs have failed:

Build Presets / zephyr (zephyr) / build (gh)
RuntimeError: Command docker exec -t c0de2dbf6d0e713d48749f2bf5491bc618056281c2db4d2e7f83b0bc1d9fa4ff /exec failed with exit code 1
pull / unittest / linux / linux-job (gh)
backends/xnnpack/test/ops/test_gelu.py::TestGelu::test_fp16_gelu
pull / unittest-editable / linux / linux-job (gh)
backends/xnnpack/test/ops/test_gelu.py::TestGelu::test_fp16_gelu

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2025-07-04T21:30:55Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

GregoryComer

Looks great. Thanks for updating this.

metascroy · 2025-07-07T20:31:48Z

@jerryzh168 can you give the changes a look over too?

jerryzh168 · 2025-07-07T21:53:19Z

docs/source/quantization-overview.md

+3. Lower the model to the target backend

-In addition to export based quantization (described above), ExecuTorch wants to highlight source based quantizations, accomplished via [torchao](https://github.com/pytorch/ao). Unlike export based quantization, source based quantization directly modifies the model prior to export. One specific example is `Int8DynActInt4WeightQuantizer`.
+## 1. Create a Backend-Specific Quantizer


nit: I think Configure would be more accurate

jerryzh168 · 2025-07-07T21:53:45Z

docs/source/quantization-overview.md

+
+## 3. Lower the model
+
+The final step is to lower the quantized_model to the desired backend, as you would an unquantized one.  See backend-specific pages for lowering information.


nit: link to docs as well?

jerryzh168

makes sense, wondering if we want to connect with transformer (or optimum-executorch?) as well, like what we did in https://docs.pytorch.org/ao/main/serving.html

metascroy · 2025-07-08T00:16:53Z

makes sense, wondering if we want to connect with transformer (or optimum-executorch?) as well, like what we did in https://docs.pytorch.org/ao/main/serving.html

The LLM doc overhaul is separate

metascroy · 2025-07-08T00:59:28Z

@pytorchbot cherry-pick --onto release/0.7 -c docs

Initial draft of quantization doc updates (#10603) (cherry picked from commit 8497ea7)

pytorchbot · 2025-07-08T01:01:39Z

Cherry picking #12240

The cherry pick PR is at #12260 The following tracker issues are updated:

[v0.7.0] Release Tracker #11075 (comment)

Details for Dev Infra team

Raised by workflow job

metascroy · 2025-07-08T01:01:54Z

@pytorchbot cherry-pick --onto release/0.7 -c “docs”

pytorch-bot · 2025-07-08T01:01:57Z

❌ 🤖 pytorchbot command failed:

@pytorchbot cherry-pick: error: argument -c/--classification: invalid choice: '“docs”' (choose from 'regression', 'critical', 'fixnewfeature', 'docs', 'release')

usage: @pytorchbot cherry-pick --onto ONTO [--fixes FIXES] -c
                               {regression,critical,fixnewfeature,docs,release}

Try @pytorchbot --help for more info.

Initial draft of quantization doc updates (pytorch#10603)

metascroy added 3 commits July 2, 2025 21:56

update quant overview page

174a6a6

up

5a4c839

up

df0e7a0

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 4, 2025

metascroy changed the title ~~Quant docs~~ Quant doc updates Jul 4, 2025

metascroy requested review from GregoryComer and jerryzh168 July 4, 2025 21:30

up

78a8a8d

metascroy marked this pull request as ready for review July 7, 2025 19:44

metascroy requested a review from mergennachin as a code owner July 7, 2025 19:44

GregoryComer approved these changes Jul 7, 2025

View reviewed changes

jerryzh168 reviewed Jul 7, 2025

View reviewed changes

jerryzh168 approved these changes Jul 7, 2025

View reviewed changes

up

dd47664

metascroy force-pushed the quant-docs branch from 9bf1f8d to dd47664 Compare July 8, 2025 00:16

metascroy merged commit 8497ea7 into main Jul 8, 2025
91 of 96 checks passed

metascroy deleted the quant-docs branch July 8, 2025 00:40

pytorchbot pushed a commit that referenced this pull request Jul 8, 2025

Quant doc updates (#12240)

0ae215d

Initial draft of quantization doc updates (#10603) (cherry picked from commit 8497ea7)

pytorchbot mentioned this pull request Jul 8, 2025

[v0.7.0] Release Tracker #11075

Closed

Tanish2101 pushed a commit to Tanish2101/executorch that referenced this pull request Jul 9, 2025

Quant doc updates (pytorch#12240)

d67c56c

Initial draft of quantization doc updates (pytorch#10603)


		## 3. Lower the model

		The final step is to lower the quantized_model to the desired backend, as you would an unquantized one. See backend-specific pages for lowering information.

Quant doc updates #12240

Quant doc updates #12240

Uh oh!

Conversation

metascroy commented Jul 4, 2025

Uh oh!

pytorch-bot bot commented Jul 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12240

❌ 3 New Failures, 15 Pending

Uh oh!

github-actions bot commented Jul 4, 2025

This PR needs a release notes: label

Uh oh!

GregoryComer left a comment

Choose a reason for hiding this comment

Uh oh!

metascroy commented Jul 7, 2025

Uh oh!

jerryzh168 Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

metascroy commented Jul 8, 2025

Uh oh!

Uh oh!

metascroy commented Jul 8, 2025

Uh oh!

pytorchbot commented Jul 8, 2025

Cherry picking #12240

Uh oh!

metascroy commented Jul 8, 2025

Uh oh!

pytorch-bot bot commented Jul 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

pytorch-bot bot commented Jul 4, 2025 •

edited

Loading

This PR needs a `release notes:` label