Allow partitioning quantized linear for FP32-only partition #7284

digantdesai · 2024-12-11T06:20:05Z

Summary:
Add overwrite precision of linear op in partitioning.

When using legacy_mode, we will test we don't partition [add]mm given,
(1) We can't assume that weights are always static (non param).
(2) Alternatively, when lowering [add]mm to xnn::bmm we can't support bias.
(2)(a) Only lowering non-bias [add]mm, which is only exposed on legacy_path deemed low ROI.

Added tests to make sure we see this behavior

Differential Revision: D67011716

pytorch-bot · 2024-12-11T06:20:08Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7284

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Cancelled Job

As of commit 9b1bb48 with merge base 9d1a310 ():

NEW FAILURE - The following job has failed:

Check Labels / Check labels (gh)
RuntimeError: Error checking labels: PR does not have required labels

CANCELLED JOB - The following job was cancelled. Please retry:

pull / unittest / macos / macos-job (gh)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-12-11T06:20:32Z

This pull request was exported from Phabricator. Differential Revision: D67011716

github-actions · 2024-12-11T06:21:03Z

This PR needs a `release notes:` label

If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

…7284) Summary: Pull Request resolved: pytorch#7284 Add overwrite precision of linear op in partitioning. When using legacy_mode, we will test we don't partition [add]mm given, (1) We can't assume that weights are always static (non param). (2) Alternatively, when lowering [add]mm to xnn::bmm we can't support bias. (2)(a) Only lowering non-bias [add]mm, which is only exposed on legacy_path deemed low ROI. Added tests to make sure we see this behavior Differential Revision: D67011716

facebook-github-bot · 2024-12-11T06:48:57Z

This pull request was exported from Phabricator. Differential Revision: D67011716

…7284) Summary: Pull Request resolved: pytorch#7284 Add overwrite precision of linear op in partitioning. When using legacy_mode, we will test we don't partition [add]mm given, (1) We can't assume that weights are always static (non param). (2) Alternatively, when lowering [add]mm to xnn::bmm we can't support bias. (2)(a) Only lowering non-bias [add]mm, which is only exposed on legacy_path deemed low ROI. Added tests to make sure we see this behavior Differential Revision: D67011716

facebook-github-bot · 2024-12-11T07:01:18Z

This pull request was exported from Phabricator. Differential Revision: D67011716

…7284) Summary: Add overwrite precision of linear op in partitioning. When using legacy_mode, we will test we don't partition [add]mm given, (1) We can't assume that weights are always static (non param). (2) Alternatively, when lowering [add]mm to xnn::bmm we can't support bias. (2)(a) Only lowering non-bias [add]mm, which is only exposed on legacy_path deemed low ROI. Added tests to make sure we see this behavior Reviewed By: mcr229 Differential Revision: D67011716

facebook-github-bot · 2024-12-17T05:30:47Z

This pull request was exported from Phabricator. Differential Revision: D67011716

…7284) Summary: Add overwrite precision of linear op in partitioning. When using legacy_mode, we will test we don't partition [add]mm given, (1) We can't assume that weights are always static (non param). (2) Alternatively, when lowering [add]mm to xnn::bmm we can't support bias. (2)(a) Only lowering non-bias [add]mm, which is only exposed on legacy_path deemed low ROI. Added tests to make sure we see this behavior Reviewed By: mcr229 Differential Revision: D67011716

facebook-github-bot · 2024-12-17T06:29:55Z

This pull request was exported from Phabricator. Differential Revision: D67011716

…7284) Summary: Add overwrite precision of linear op in partitioning. When using legacy_mode, we will test we don't partition [add]mm given, (1) We can't assume that weights are always static (non param). (2) Alternatively, when lowering [add]mm to xnn::bmm we can't support bias. (2)(a) Only lowering non-bias [add]mm, which is only exposed on legacy_path deemed low ROI. Added tests to make sure we see this behavior Reviewed By: mcr229 Differential Revision: D67011716

facebook-github-bot · 2024-12-17T16:00:35Z

This pull request was exported from Phabricator. Differential Revision: D67011716

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 11, 2024

facebook-github-bot added the fb-exported label Dec 11, 2024

digantdesai force-pushed the export-D67011716 branch from 2ad1d6c to 7b32bc5 Compare December 11, 2024 06:48

digantdesai force-pushed the export-D67011716 branch from 7b32bc5 to ed167b6 Compare December 11, 2024 07:01

digantdesai force-pushed the export-D67011716 branch from ed167b6 to adf9fc1 Compare December 17, 2024 05:30

digantdesai added the module: xnnpack Issues related to xnnpack delegation and the code under backends/xnnpack/ label Dec 17, 2024

digantdesai force-pushed the export-D67011716 branch from adf9fc1 to 8b47c4d Compare December 17, 2024 06:29

digantdesai force-pushed the export-D67011716 branch from 8b47c4d to 9b1bb48 Compare December 17, 2024 16:00

mcr229 approved these changes Dec 18, 2024

View reviewed changes

facebook-github-bot merged commit 884d16d into pytorch:main Dec 18, 2024
42 of 46 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow partitioning quantized linear for FP32-only partition #7284

Allow partitioning quantized linear for FP32-only partition #7284

Uh oh!

digantdesai commented Dec 11, 2024

Uh oh!

pytorch-bot bot commented Dec 11, 2024 •

edited

Loading

Uh oh!

facebook-github-bot commented Dec 11, 2024

Uh oh!

github-actions bot commented Dec 11, 2024

Uh oh!

facebook-github-bot commented Dec 11, 2024

Uh oh!

facebook-github-bot commented Dec 11, 2024

Uh oh!

facebook-github-bot commented Dec 17, 2024

Uh oh!

facebook-github-bot commented Dec 17, 2024

Uh oh!

facebook-github-bot commented Dec 17, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Allow partitioning quantized linear for FP32-only partition #7284

Allow partitioning quantized linear for FP32-only partition #7284

Uh oh!

Conversation

digantdesai commented Dec 11, 2024

Uh oh!

pytorch-bot bot commented Dec 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/7284

❌ 1 New Failure, 1 Cancelled Job

Uh oh!

facebook-github-bot commented Dec 11, 2024

Uh oh!

github-actions bot commented Dec 11, 2024

This PR needs a release notes: label

Uh oh!

facebook-github-bot commented Dec 11, 2024

Uh oh!

facebook-github-bot commented Dec 11, 2024

Uh oh!

facebook-github-bot commented Dec 17, 2024

Uh oh!

facebook-github-bot commented Dec 17, 2024

Uh oh!

facebook-github-bot commented Dec 17, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Dec 11, 2024 •

edited

Loading

This PR needs a `release notes:` label