Skip to content

Conversation

@mgiordy
Copy link
Contributor

@mgiordy mgiordy commented Sep 9, 2025

Summary:

Context

This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

In this diff

  1. Op nodes are returned from each pattern matching
  2. Dequantize nodes are bypassed if not needed in the final graph.

Differential Revision: D81519735

@pytorch-bot
Copy link

pytorch-bot bot commented Sep 9, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14134

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 5188058 with merge base 181ed4d (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 9, 2025
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D81519735

@github-actions
Copy link

github-actions bot commented Sep 9, 2025

This PR needs a release notes: label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request Sep 10, 2025
Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Differential Revision: D81519735
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D81519735

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request Sep 10, 2025
Summary:
Pull Request resolved: pytorch#14134

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Differential Revision: D81519735
@mgiordy mgiordy force-pushed the export-D81519735 branch 2 times, most recently from 2ffaef2 to 49f3938 Compare September 16, 2025 17:44
@facebook-github-bot
Copy link
Contributor

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request Sep 16, 2025
Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735
@facebook-github-bot
Copy link
Contributor

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request Sep 23, 2025
Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735
mgiordy pushed a commit to mgiordy/executorch that referenced this pull request Sep 24, 2025
Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735
@facebook-github-bot
Copy link
Contributor

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request Sep 25, 2025
Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735
@facebook-github-bot
Copy link
Contributor

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request Sep 25, 2025
Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735
@facebook-github-bot
Copy link
Contributor

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request Sep 25, 2025
Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735
@facebook-github-bot
Copy link
Contributor

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

@facebook-github-bot
Copy link
Contributor

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request Sep 26, 2025
Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735
@facebook-github-bot
Copy link
Contributor

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request Sep 26, 2025
Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735
@facebook-github-bot
Copy link
Contributor

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request Sep 26, 2025
Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735
@facebook-github-bot
Copy link
Contributor

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request Sep 27, 2025
Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735
@facebook-github-bot
Copy link
Contributor

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request Sep 27, 2025
Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735
@facebook-github-bot
Copy link
Contributor

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request Sep 29, 2025
Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735
@facebook-github-bot
Copy link
Contributor

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

mgiordy pushed a commit to mgiordy/executorch that referenced this pull request Sep 29, 2025
Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb

Differential Revision: D81519735
@facebook-github-bot
Copy link
Contributor

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

Summary:

# Context
This Diff adds support for mixed quantization operators in Executorch. Now weights and biases can be quantized, while inputs and activations are kept in floating point.

# In this diff
1. Op nodes are returned from each pattern matching
2. Dequantize nodes are bypassed if not needed in the final graph.

Reviewed By: skrtskrtfb, mcremon-meta

Differential Revision: D81519735
@facebook-github-bot
Copy link
Contributor

@mgiordy has exported this pull request. If you are a Meta employee, you can view the originating diff in D81519735.

@facebook-github-bot facebook-github-bot merged commit d4d24ec into pytorch:main Sep 30, 2025
13 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported meta-exported

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants