-
Couldn't load subscription status.
- Fork 700
[ET-VK] Removing manual unroll in linear shader to improve overall performance. #15110
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15110
Note: Links to docs will display an error until the docs builds have been completed. ❌ 3 New FailuresAs of commit b19c6be with merge base 4c4f235 ( NEW FAILURES - The following jobs have failed:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
@trivedivivek has exported this pull request. If you are a Meta employee, you can view the originating Diff in D84571616. |
…e. (pytorch#15110) Summary: ### Summary This diff improves the overall performance of the linear shader by removing manual unrolling in the `linear_qcsnw_tiled.glsl` file. The changes include: - Removing the `[[unroll]]` directive in the for loop to allow the compiler to automatically unroll the loop, which can lead to better performance. - Changing the type of `mat1` from `VEC4_T[TILE_ROWS]` to `T[TILE_ROWS][4]` to better match the access pattern in the loop. Differential Revision: D84571616
9c3d279 to
c1369bc
Compare
…e. (pytorch#15110) Summary: ### Summary This diff improves the overall performance of the linear shader by removing manual unrolling in the `linear_qcsnw_tiled.glsl` file. The changes include: - Removing the `[[unroll]]` directive in the for loop to allow the compiler to automatically unroll the loop, which can lead to better performance. - Changing the type of `mat1` from `VEC4_T[TILE_ROWS]` to `T[TILE_ROWS][4]` to better match the access pattern in the loop. Differential Revision: D84571616
…e. (pytorch#15110) Summary: ### Summary This diff improves the overall performance of the linear shader by removing manual unrolling in the `linear_qcsnw_tiled.glsl` file. The changes include: - Removing the `[[unroll]]` directive in the for loop to allow the compiler to automatically unroll the loop, which can lead to better performance. - Changing the type of `mat1` from `VEC4_T[TILE_ROWS]` to `T[TILE_ROWS][4]` to better match the access pattern in the loop. Differential Revision: D84571616
…e. (pytorch#15110) Summary: ### Summary This diff improves the overall performance of the linear shader by removing manual unrolling in the `linear_qcsnw_tiled.glsl` file. The changes include: - Removing the `[[unroll]]` directive in the for loop to allow the compiler to automatically unroll the loop, which can lead to better performance. - Changing the type of `mat1` from `VEC4_T[TILE_ROWS]` to `T[TILE_ROWS][4]` to better match the access pattern in the loop. Reviewed By: SS-JIA Differential Revision: D84571616
c1369bc to
280b330
Compare
…e. (pytorch#15110) Summary: ### Summary This diff improves the overall performance of the linear shader by removing manual unrolling in the `linear_qcsnw_tiled.glsl` file. The changes include: - Removing the `[[unroll]]` directive in the for loop to allow the compiler to automatically unroll the loop, which can lead to better performance. - Changing the type of `mat1` from `VEC4_T[TILE_ROWS]` to `T[TILE_ROWS][4]` to better match the access pattern in the loop. Reviewed By: SS-JIA Differential Revision: D84571616
280b330 to
b19c6be
Compare
…e. (pytorch#15110) Summary: ### Summary This diff improves the overall performance of the linear shader by removing manual unrolling in the `linear_qcsnw_tiled.glsl` file. The changes include: - Removing the `[[unroll]]` directive in the for loop to allow the compiler to automatically unroll the loop, which can lead to better performance. - Changing the type of `mat1` from `VEC4_T[TILE_ROWS]` to `T[TILE_ROWS][4]` to better match the access pattern in the loop. Reviewed By: SS-JIA Differential Revision: D84571616
…e. (pytorch#15110) Summary: ### Summary This diff improves the overall performance of the linear shader by removing manual unrolling in the `linear_qcsnw_tiled.glsl` file. The changes include: - Removing the `[[unroll]]` directive in the for loop to allow the compiler to automatically unroll the loop, which can lead to better performance. - Changing the type of `mat1` from `VEC4_T[TILE_ROWS]` to `T[TILE_ROWS][4]` to better match the access pattern in the loop. Reviewed By: SS-JIA Differential Revision: D84571616
Summary:
Summary
This diff improves the overall performance of the linear shader by removing manual unrolling in the
linear_qcsnw_tiled.glslfile.The changes include:
[[unroll]]directive in the for loop to allow the compiler to automatically unroll the loop, which can lead to better performance.mat1fromVEC4_T[TILE_ROWS]toT[TILE_ROWS][4]to better match the access pattern in the loop.Differential Revision: D84571616