[TorchToLinalg] Use Op with native channel order for quantized conv2d #3807

ubfx · 2024-10-20T17:43:19Z

I've upstreamed the necessary quantized linalg Op with the "channel-first" ordering used by torch (llvm/llvm-project#107740) for 2d convolution.

This patch changes the lowering for the quantized 2d case of aten.convolution accordingly, which saves three transpositions per convolution (input, weights, result) and therefore removes the requirement to try to optimize these away in downstream passes.

ubfx · 2024-10-20T17:44:51Z

I'm upstreaming the necessary linalg ops for the conv 3d and depthwise cases into MLIR next.

zjgarvey

Nice.

ubfx · 2024-10-21T08:36:28Z

I'm not merging this yet because I noticed that all of the Quantized Conv2D E2E tests are currently not properly evaluated (i.e. they're all in the XFAIL set), because the fx_importer frontend can't deal with it.

zjgarvey · 2024-10-21T13:54:43Z

Are we no longer running the old importer path in the CI? You should at least be able to test locally with that path.

zjgarvey · 2024-10-21T13:58:27Z

Yeah, try running this locally:

projects/pt1/tools/e2e_test.sh -f Conv2dQInt8Module_basic -v

It should print off IR for that example.

Then I'd check all the Conv2dQ tests with.

projects/pt1/tools/e2e_test.sh -f Conv2dQ -v

ubfx · 2024-10-21T15:07:21Z

Are we no longer running the old importer path in the CI?

No seems like only onnx and fx_importer are now run on CI.

You should at least be able to test locally with that path.

Yeah I was able to test locally but it seems dangerous to not have the quantized tests done on CI at all. By now, I also retired the jit_ir_importer locally, though. Seems like this is a good opportunity to move the quantized tests over to the DQ/Q paradigm which we use everywhere else: #3809

I've upstreamed the necessary quantized linalg Op with the "channel-first" ordering used by torch (llvm/llvm-project#107740) for 2d convolution. This patch changes the lowering for the quantized 2d case of `aten.convolution` accordingly, which saves three transpositions per convolution (input, weights, result) and therefore removes the requirement to try to optimize these away in downstream passes.

ubfx requested review from rsuderman, stellaraccident and zjgarvey October 20, 2024 17:43

zjgarvey approved these changes Oct 20, 2024

View reviewed changes

ubfx force-pushed the conv2d-q-new-op branch from ebf59f5 to 688b178 Compare October 22, 2024 17:06

ubfx merged commit aca33f1 into llvm:main Oct 22, 2024
3 checks passed

ubfx deleted the conv2d-q-new-op branch October 24, 2024 10:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[TorchToLinalg] Use Op with native channel order for quantized conv2d #3807

[TorchToLinalg] Use Op with native channel order for quantized conv2d #3807

Uh oh!

ubfx commented Oct 20, 2024

Uh oh!

ubfx commented Oct 20, 2024

Uh oh!

zjgarvey left a comment

Uh oh!

ubfx commented Oct 21, 2024

Uh oh!

zjgarvey commented Oct 21, 2024

Uh oh!

zjgarvey commented Oct 21, 2024

Uh oh!

ubfx commented Oct 21, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[TorchToLinalg] Use Op with native channel order for quantized conv2d #3807

[TorchToLinalg] Use Op with native channel order for quantized conv2d #3807

Uh oh!

Conversation

ubfx commented Oct 20, 2024

Uh oh!

ubfx commented Oct 20, 2024

Uh oh!

zjgarvey left a comment

Choose a reason for hiding this comment

Uh oh!

ubfx commented Oct 21, 2024

Uh oh!

zjgarvey commented Oct 21, 2024

Uh oh!

zjgarvey commented Oct 21, 2024

Uh oh!

ubfx commented Oct 21, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants