Adding mixed quantization support #14134

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

facebook-github-bot merged 3 commits into pytorch:main from mgiordy:export-D81519735

Sep 30, 2025

Contributor

mgiordy commented Sep 9, 2025

Summary:

Context

This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

In this diff

Op nodes are returned from each pattern matching
Dequantize nodes are bypassed if not needed in the final graph.

Differential Revision: D81519735

pytorch-bot bot commented Sep 9, 2025 •

edited

Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14134

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 5188058 with merge base 181ed4d ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-cla bot added the CLA Signed label

Contributor

facebook-github-bot commented Sep 9, 2025

This pull request was exported from Phabricator. Differential Revision: D81519735

facebook-github-bot added the fb-exported label

github-actions bot commented Sep 9, 2025

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

mgiordy force-pushed the export-D81519735 branch from 1e64ab6 to 789bf47 Compare

September 10, 2025 00:15

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request


          Adding mixed quantization support (pytorch#14134)

789bf47

Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Differential Revision: D81519735

Contributor

facebook-github-bot commented Sep 10, 2025

This pull request was exported from Phabricator. Differential Revision: D81519735

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request


          Adding mixed quantization support (pytorch#14134)

20e4ab8

Summary:
Pull Request resolved: pytorch#14134

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Differential Revision: D81519735

mgiordy force-pushed the export-D81519735 branch 2 times, most recently from 2ffaef2 to 49f3938 Compare

September 16, 2025 17:44

Contributor

facebook-github-bot commented Sep 16, 2025

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

facebook-github-bot added the meta-exported label

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request


          Adding mixed quantization support (pytorch#14134)

49f3938

Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735

mgiordy force-pushed the export-D81519735 branch from 49f3938 to 569f2e6 Compare

September 23, 2025 23:40

Contributor

facebook-github-bot commented Sep 23, 2025

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request


          Adding mixed quantization support (pytorch#14134)

569f2e6

Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735

mgiordy force-pushed the export-D81519735 branch from 569f2e6 to 406d706 Compare

September 24, 2025 21:26

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request


          Adding mixed quantization support (pytorch#14134)

406d706

Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735

Contributor

facebook-github-bot commented Sep 24, 2025

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request


          Adding mixed quantization support (pytorch#14134)

2c0b42b

Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735

mgiordy force-pushed the export-D81519735 branch from 406d706 to 2c0b42b Compare

September 25, 2025 00:32

Contributor

facebook-github-bot commented Sep 25, 2025

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request


          Adding mixed quantization support (pytorch#14134)

61701f4

Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735

mgiordy force-pushed the export-D81519735 branch from 2c0b42b to 61701f4 Compare

September 25, 2025 05:07

Contributor

facebook-github-bot commented Sep 25, 2025

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy force-pushed the export-D81519735 branch from 61701f4 to 10689f5 Compare

September 25, 2025 15:16

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request


          Adding mixed quantization support (pytorch#14134)

10689f5

Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735

Contributor

facebook-github-bot commented Sep 25, 2025

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy force-pushed the export-D81519735 branch from 10689f5 to 2840bbd Compare

September 25, 2025 22:15

Contributor

facebook-github-bot commented Sep 25, 2025

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy force-pushed the export-D81519735 branch from 2840bbd to 12aae49 Compare

September 26, 2025 18:46

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request


          Adding mixed quantization support (pytorch#14134)

12aae49

Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735

Contributor

facebook-github-bot commented Sep 26, 2025

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy force-pushed the export-D81519735 branch from 12aae49 to b556b8d Compare

September 26, 2025 19:03

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request


          Adding mixed quantization support (pytorch#14134)

b556b8d

Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735

Contributor

facebook-github-bot commented Sep 26, 2025

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request


          Adding mixed quantization support (pytorch#14134)

ed2bdec

Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735

mgiordy force-pushed the export-D81519735 branch from b556b8d to ed2bdec Compare

September 26, 2025 20:22

Contributor

facebook-github-bot commented Sep 26, 2025

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy force-pushed the export-D81519735 branch from ed2bdec to 5b7459b Compare

September 27, 2025 01:21

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request


          Adding mixed quantization support (pytorch#14134)

5b7459b

Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735

Contributor

facebook-github-bot commented Sep 27, 2025

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request


          Adding mixed quantization support (pytorch#14134)

1f03b87

Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735

mgiordy force-pushed the export-D81519735 branch from 5b7459b to 1f03b87 Compare

September 27, 2025 01:31

Contributor

facebook-github-bot commented Sep 27, 2025

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy force-pushed the export-D81519735 branch from 1f03b87 to a2cceb1 Compare

September 29, 2025 05:11

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request


          Adding mixed quantization support (pytorch#14134)

a2cceb1

Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735

Contributor

facebook-github-bot commented Sep 29, 2025

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy force-pushed the export-D81519735 branch from a2cceb1 to 792d571 Compare

September 29, 2025 16:20

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request


          Adding mixed quantization support (pytorch#14134)

792d571

Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735

Contributor

facebook-github-bot commented Sep 29, 2025

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.


          Adding mixed quantization support (pytorch#14134)

63ae795

Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb, mcremon-meta

Differential Revision: D81519735

mgiordy force-pushed the export-D81519735 branch from 792d571 to 63ae795 Compare

September 29, 2025 22:33

Contributor

facebook-github-bot commented Sep 29, 2025

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

skrtskrtfb approved these changes

View reviewed changes

mgiordy added 2 commits

September 29, 2025 17:38


          Merge branch 'main' into export-D81519735

f228d06


          Merge branch 'main' into export-D81519735

skrtskrtfb approved these changes

View reviewed changes

facebook-github-bot merged commit d4d24ec into pytorch:main

13 checks passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed fb-exported meta-exported