Skip to content

Conversation

@victor-eds
Copy link
Contributor

Support repCluster[0] > 2 by using 7-D tensors and adding a convert_layout operation before the final reshape.

See code for implementation details.

Signed-off-by: victor-eds [email protected]

@victor-eds victor-eds self-assigned this Oct 22, 2024
@victor-eds
Copy link
Contributor Author

Part of #2266.

…ort `repCluster[0] > 2`

Support `repCluster[0] > 2` by using 7-D tensors and adding a `convert_layout` operation before the final `reshape`.

See code for implementation details.

Signed-off-by: victor-eds <[email protected]>
@victor-eds victor-eds force-pushed the fast-sub-group-transpose-extend branch from fdd0512 to 6f74535 Compare October 22, 2024 13:45
@victor-eds victor-eds requested a review from jopperm October 24, 2024 10:13
@victor-eds victor-eds enabled auto-merge (squash) October 24, 2024 10:52
@victor-eds victor-eds disabled auto-merge October 24, 2024 11:37
@vlad-penkin vlad-penkin linked an issue Oct 30, 2024 that may be closed by this pull request
Copy link
Contributor

@jopperm jopperm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some minor documentation nits, but LGTM otherwise!

@victor-eds victor-eds merged commit 04a7b65 into intel:main Oct 31, 2024
4 checks passed
@victor-eds victor-eds deleted the fast-sub-group-transpose-extend branch October 31, 2024 10:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Port "sub-group transpose reduction" to default path

3 participants