Skip to content

Conversation

@mcfi
Copy link
Contributor

@mcfi mcfi commented Nov 13, 2025

Summary:
This diff adds convolution support to arm64 fbgemm by reusing existing reference implementations.

  1. Introduced conv_requant_ref that invokes the reference conv_ref and requantize_u8acc32_ref and added it in places where only x86 conv implementations are available.
  2. Changed weights matrix packing to basically do nothing or call transposeConvWeights.

This diff unblocks fbgemm users' convolution code on Arm64. We plan to add follow-up diffs to optimize each kind of convolution (e.g., depthwise, directconv, etc.)

Differential Revision: D86548699

@meta-cla meta-cla bot added the cla signed label Nov 13, 2025
@meta-codesync
Copy link
Contributor

meta-codesync bot commented Nov 13, 2025

@mcfi has exported this pull request. If you are a Meta employee, you can view the originating Diff in D86548699.

mcfi added a commit to mcfi/FBGEMM that referenced this pull request Nov 14, 2025
… APIs (pytorch#5126)

Summary:
X-link: facebookresearch/FBGEMM#2128


This diff adds convolution support to arm64 fbgemm by reusing existing reference implementations.
1. Introduced conv_requant_ref that invokes the reference conv_ref and requantize_u8acc32_ref and added it in places where only x86 conv implementations are available.
2. Changed weights matrix packing to basically do nothing or call transposeConvWeights.

This diff unblocks fbgemm users' convolution code on Arm64. We plan to add follow-up diffs to optimize each kind of convolution (e.g., depthwise, directconv, etc.)

Differential Revision: D86548699
mcfi added a commit to mcfi/FBGEMM that referenced this pull request Nov 14, 2025
… APIs (pytorch#5126)

Summary:
X-link: facebookresearch/FBGEMM#2128


This diff adds convolution support to arm64 fbgemm by reusing existing reference implementations.
1. Introduced conv_requant_ref that invokes the reference conv_ref and requantize_u8acc32_ref and added it in places where only x86 conv implementations are available.
2. Changed weights matrix packing to basically do nothing or call transposeConvWeights.

This diff unblocks fbgemm users' convolution code on Arm64. We plan to add follow-up diffs to optimize each kind of convolution (e.g., depthwise, directconv, etc.)

Differential Revision: D86548699
mcfi added a commit to mcfi/FBGEMM that referenced this pull request Nov 15, 2025
… APIs (pytorch#5126)

Summary:
X-link: facebookresearch/FBGEMM#2128


This diff adds convolution support to arm64 fbgemm by reusing existing reference implementations.
1. Introduced conv_requant_ref that invokes the reference conv_ref and requantize_u8acc32_ref and added it in places where only x86 conv implementations are available.
2. Changed weights matrix packing to basically do nothing or call transposeConvWeights.

This diff unblocks fbgemm users' convolution code on Arm64. We plan to add follow-up diffs to optimize each kind of convolution (e.g., depthwise, directconv, etc.)

Differential Revision: D86548699
… APIs (pytorch#5126)

Summary:
X-link: facebookresearch/FBGEMM#2128


This diff adds convolution support to arm64 fbgemm by reusing existing reference implementations.
1. Introduced conv_requant_ref that invokes the reference conv_ref and requantize_u8acc32_ref and added it in places where only x86 conv implementations are available.
2. Changed weights matrix packing to basically do nothing or call transposeConvWeights.

This diff unblocks fbgemm users' convolution code on Arm64. We plan to add follow-up diffs to optimize each kind of convolution (e.g., depthwise, directconv, etc.)

Differential Revision: D86548699
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant