Milestone2.1: Partition to_dim_order_copy op in XNN delegate #12220

leafs1 · 2025-07-03T23:09:00Z

Summary

This PR adds support for the to_dim_order_copy operation in the XNNPACK delegate partitioner, enabling direct handling of memory format conversions initiated by users via .to(memory_format=) calls. This enhancement significantly improves performance by producing more compressed graphs that avoid unnecessary partitioning boundaries at memory format conversion points. By delegating these operations directly to XNNPACK, we eliminate the overhead of context switching between the runtime and delegate, reducing both execution time and memory footprint. The implementation leverages XNNPACK's highly optimized memory format conversion routines, which are specifically designed for efficient tensor layout transformations on various hardware targets.

Test plan

Confirmed expected output when having user specified dim order conversions as well as appropriate partitioning. I did this by writing individual tests for the to_copy op ensuring it changes dim order and dtype when appropriate. Also added test module to confirm that the to copy nodes are partitioned and not in another partition

pytorch-bot · 2025-07-03T23:09:04Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12220

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures

As of commit 365de21 with merge base a8070ec ():

NEW FAILURES - The following jobs have failed:

pull / test-eval_llama-mmlu-linux / linux-job (gh)
RuntimeError: Command docker exec -t 91cf8ce57d3e17d99cf5e80dbf5a1cd9fc6c7a4f6e3cdbbb89097f40f68ca012 /exec failed with exit code 1
pull / unittest / linux / linux-job (gh)
examples/models/llama/tests/test_ring_attention.py::TestRingAttention::test_sliding_window_attention
pull / unittest-editable / linux / linux-job (gh)
examples/models/llama/tests/test_ring_attention.py::TestRingAttention::test_sliding_window_attention

This comment was automatically generated by Dr. CI and updates every 15 minutes.

leafs1 · 2025-07-03T23:09:55Z

@pytorchbot label "release notes: none"

backends/xnnpack/partition/config/__init__.py

digantdesai · 2025-07-11T10:50:03Z

backends/xnnpack/_passes/channels_last_tagged_reshape_pass.py

                # The node requires nchw inputs
                for input_node in node.all_input_nodes:
                    self.input_to_nchw(graph_module, input_node, node)
+            elif node.target == exir_ops.edge.aten._to_copy.default:


so the reason we still have to_copy even after we partition to_dim_order_copy is because we revert it back to to_copy. So when we add a node visitor next for the to_dim_order_copy we should remove the revert pass.

I see, should I make those changes in a follow up pr or would it be better to keep them here?

digantdesai · 2025-07-11T10:51:03Z

backends/xnnpack/test/ops/test_to_copy.py

@@ -0,0 +1,85 @@
+# Copyright (c) Meta Platforms, Inc. and affiliates.


As I said earlier, using to_copy is OK but we can just as easily move to to_dim_order_copy and remove the dim_order ops revert pass.

backends/xnnpack/test/ops/test_to_copy.py

backends/xnnpack/test/passes/test_channels_last_tagged_reshape.py

mcr229 · 2025-07-11T21:44:48Z

backends/xnnpack/partition/config/generic_node_configs.py

+        return True
+
+    def supported_precision_types(self) -> List[ConfigPrecisionType]:
+        return [ConfigPrecisionType.FP32]


add ConfigPrecisionType.STATIC_QUANT

mcr229 · 2025-07-14T18:53:25Z

lets wait for CI, but looking good!

…12220)" This reverts commit dd6caa3. ghstack-source-id: f7e75ad ghstack-comment-id: 3079037022 Pull Request resolved: #12542

…12220)" (#12542) This reverts commit dd6caa3. The imported diff is breaking an internal test: [D78368033](https://www.internalfb.com/diff/D78368033). Please see the diff for more details.

### Summary This PR adds support for the `to_dim_order_copy` operation in the XNNPACK delegate partitioner, enabling direct handling of memory format conversions initiated by users via `.to(memory_format=)` calls. This enhancement significantly improves performance by producing more compressed graphs that avoid unnecessary partitioning boundaries at memory format conversion points. By delegating these operations directly to XNNPACK, we eliminate the overhead of context switching between the runtime and delegate, reducing both execution time and memory footprint. The implementation leverages XNNPACK's highly optimized memory format conversion routines, which are specifically designed for efficient tensor layout transformations on various hardware targets. ### Test plan Confirmed expected output when having user specified dim order conversions as well as appropriate partitioning. I did this by writing individual tests for the to_copy op ensuring it changes dim order and dtype when appropriate. Also added test module to confirm that the to copy nodes are partitioned and not in another partition

…12220)" (#12542) This reverts commit dd6caa3. The imported diff is breaking an internal test: [D78368033](https://www.internalfb.com/diff/D78368033). Please see the diff for more details.

leafs1 requested review from digantdesai and mcr229 as code owners July 3, 2025 23:09

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 3, 2025

pytorch-bot bot added the release notes: none Do not include this in the release notes label Jul 3, 2025

digantdesai reviewed Jul 11, 2025

View reviewed changes

leafs1 force-pushed the milestone2.1 branch 3 times, most recently from bbc194c to c26c56b Compare July 11, 2025 21:26

mcr229 reviewed Jul 11, 2025

View reviewed changes

leafs1 force-pushed the milestone2.1 branch from 7d63f9b to 1c558c3 Compare July 14, 2025 18:48

partition to_dim_order_copy in XNN delegate

4c981e1

leafs1 force-pushed the milestone2.1 branch from 1c558c3 to 4c981e1 Compare July 14, 2025 18:49

Merge branch 'main' into milestone2.1

94ada53

mcr229 approved these changes Jul 14, 2025

View reviewed changes

Merge branch 'main' into milestone2.1

365de21

leafs1 merged commit dd6caa3 into pytorch:main Jul 15, 2025
95 of 98 checks passed

SS-JIA added a commit that referenced this pull request Jul 16, 2025

Revert "Milestone2.1: Partition to_dim_order_copy op in XNN delegate (#…

2855b2d

…12220)" This reverts commit dd6caa3. ghstack-source-id: f7e75ad ghstack-comment-id: 3079037022 Pull Request resolved: #12542

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Milestone2.1: Partition to_dim_order_copy op in XNN delegate #12220

Milestone2.1: Partition to_dim_order_copy op in XNN delegate #12220

Uh oh!

leafs1 commented Jul 3, 2025

Uh oh!

pytorch-bot bot commented Jul 3, 2025 •

edited

Loading

Uh oh!

leafs1 commented Jul 3, 2025

Uh oh!

Uh oh!

digantdesai Jul 11, 2025

Uh oh!

leafs1 Jul 11, 2025

Uh oh!

digantdesai Jul 11, 2025

Uh oh!

Uh oh!

Uh oh!

mcr229 Jul 11, 2025

Uh oh!

mcr229 commented Jul 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		@@ -0,0 +1,85 @@
		# Copyright (c) Meta Platforms, Inc. and affiliates.

Milestone2.1: Partition to_dim_order_copy op in XNN delegate #12220

Milestone2.1: Partition to_dim_order_copy op in XNN delegate #12220

Uh oh!

Conversation

leafs1 commented Jul 3, 2025

Summary

Test plan

Uh oh!

pytorch-bot bot commented Jul 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/12220

❌ 3 New Failures

Uh oh!

leafs1 commented Jul 3, 2025

Uh oh!

Uh oh!

digantdesai Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

leafs1 Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

digantdesai Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mcr229 Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

mcr229 commented Jul 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot bot commented Jul 3, 2025 •

edited

Loading