Add XNN U8 op support via quantization #8330

GregoryComer · 2025-02-09T13:04:48Z

Summary

Support U8 ops in the XNNPACK delegate by treating the input and output tensors as u8 asymmetric-quantized tensors with scale=1 and zero_point=0. This PR adds U8 support for upsample_bilinear2d, cat, slice, and _to_copy (when used to convert u8 to f32). More ops are possible with this method.

Conversion from u8 to f32 is done via transformation into a dequantize op. This is implemented in a new pass - ReplaceU8ConvertWithDqPass. The general U8 to quantized U8 transformation is done in define_tensor in node_visitor.py. U8 inputs are created as quantized tensors with the appropriate qparams (scale=1, zp=0).

Test plan

I've added op-level u8 tests to each of the new ops, as well as tests for the ReplaceU8ConvertWithDqPass. I've also added an end-to-end test for MobileNetV3 with a wrapper to take U8 inputs, resize and crop, and then convert to f32 and run the model.

pytorch-bot · 2025-02-09T13:04:51Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/8330

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 1fbaf0e with merge base d99970b ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-02-10T04:23:01Z

@GregoryComer has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

mcr229

This generally looks right to me.

digantdesai · 2025-02-12T22:02:10Z

Ah reviewed internally first, please look at the comments there, next time I will remember.

github-actions · 2025-08-30T00:51:15Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

github-actions · 2025-10-30T00:51:27Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

GregoryComer requested review from digantdesai and mcr229 February 9, 2025 13:04

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 9, 2025

GregoryComer force-pushed the xnn-u8 branch from 5de9ff0 to 3e2fa60 Compare February 9, 2025 13:07

GregoryComer added the release notes: xnnpack Changes to the XNNPack backend delegate label Feb 9, 2025

GregoryComer force-pushed the xnn-u8 branch 2 times, most recently from 4897af3 to c5134f1 Compare February 10, 2025 02:58

Add XNN U8 op support via quantization

1fbaf0e

GregoryComer force-pushed the xnn-u8 branch from c5134f1 to 1fbaf0e Compare February 10, 2025 04:19

GregoryComer marked this pull request as ready for review February 10, 2025 04:21

GregoryComer changed the title ~~(WIP) Add XNN U8 op support via quantization~~ Add XNN U8 op support via quantization Feb 10, 2025

mcr229 approved these changes Feb 10, 2025

View reviewed changes

github-actions bot added the stale PRs inactive for over 60 days label Aug 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add XNN U8 op support via quantization #8330

Add XNN U8 op support via quantization #8330

Uh oh!

GregoryComer commented Feb 9, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Feb 9, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented Feb 10, 2025

Uh oh!

mcr229 left a comment

Uh oh!

digantdesai commented Feb 12, 2025

Uh oh!

github-actions bot commented Aug 30, 2025

Uh oh!

github-actions bot commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add XNN U8 op support via quantization #8330

Are you sure you want to change the base?

Add XNN U8 op support via quantization #8330

Uh oh!

Conversation

GregoryComer commented Feb 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

pytorch-bot bot commented Feb 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/8330

✅ No Failures

Uh oh!

facebook-github-bot commented Feb 10, 2025

Uh oh!

mcr229 left a comment

Choose a reason for hiding this comment

Uh oh!

digantdesai commented Feb 12, 2025

Uh oh!

github-actions bot commented Aug 30, 2025

Uh oh!

github-actions bot commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

GregoryComer commented Feb 9, 2025 •

edited

Loading

pytorch-bot bot commented Feb 9, 2025 •

edited

Loading