-
Notifications
You must be signed in to change notification settings - Fork 742
Move the transpose matmul pass to OSS and run it earlier in the flow #10433
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/10433
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit c2f2316 with merge base 6b877de ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
This pull request was exported from Phabricator. Differential Revision: D73600069 |
…10433) Summary: That pass is doing a lot more than it looks, and it's just easier to move it back to where it was. CPU backends will possibly see more cycles due to added permutes, but we don't care about that. All DSP backends should be more efficient on transposed matmuls. Should that not be the case in the future, we can re-evaluate. When we do the survey of passes and reorder them properly, we can think about this more. Differential Revision: D73600069
47c55ec to
8eb2b6e
Compare
|
This pull request was exported from Phabricator. Differential Revision: D73600069 |
…10433) Summary: Pull Request resolved: #10433 That pass is doing a lot more than it looks, and it's just easier to move it back to where it was. CPU backends will possibly see more cycles due to added permutes, but we don't care about that. All DSP backends should be more efficient on transposed matmuls. Should that not be the case in the future, we can re-evaluate. When we do the survey of passes and reorder them properly, we can think about this more. Differential Revision: D73600069
8eb2b6e to
689a082
Compare
…10433) Summary: That pass is doing a lot more than it looks, and it's just easier to move it back to where it was. CPU backends will possibly see more cycles due to added permutes, but we don't care about that. All DSP backends should be more efficient on transposed matmuls. Should that not be the case in the future, we can re-evaluate. When we do the survey of passes and reorder them properly, we can think about this more. Reviewed By: hsharma35 Differential Revision: D73600069
689a082 to
8c3b621
Compare
|
This pull request was exported from Phabricator. Differential Revision: D73600069 |
…10433) Summary: That pass is doing a lot more than it looks, and it's just easier to move it back to where it was. CPU backends will possibly see more cycles due to added permutes, but we don't care about that. All DSP backends should be more efficient on transposed matmuls. Should that not be the case in the future, we can re-evaluate. When we do the survey of passes and reorder them properly, we can think about this more. Reviewed By: hsharma35 Differential Revision: D73600069
8c3b621 to
13e7cc7
Compare
…10433) Summary: Pull Request resolved: #10433 That pass is doing a lot more than it looks, and it's just easier to move it back to where it was. CPU backends will possibly see more cycles due to added permutes, but we don't care about that. All DSP backends should be more efficient on transposed matmuls. Should that not be the case in the future, we can re-evaluate. When we do the survey of passes and reorder them properly, we can think about this more. Reviewed By: hsharma35 Differential Revision: D73600069
|
This pull request was exported from Phabricator. Differential Revision: D73600069 |
13e7cc7 to
c2f2316
Compare
Summary:
That pass is doing a lot more than it looks, and it's just easier to move it back to where it was. CPU backends will possibly see more cycles due to added permutes, but we don't care about that. All DSP backends should be more efficient on transposed matmuls. Should that not be the case in the future, we can re-evaluate.
When we do the survey of passes and reorder them properly, we can think about this more.
Differential Revision: D73600069