Skip to content

Conversation

@victor-eds
Copy link
Contributor

Support repCluster[0] > 2 by using 7-D tensors and adding a convert_layout operation before the final reshape.

See code for implementation details.

…ort `repCluster[0] > 2`

Support `repCluster[0] > 2` by using 7-D tensors and adding a `convert_layout` operation before the final `reshape`.

See code for implementation details.

Signed-off-by: victor-eds <[email protected]>
@victor-eds victor-eds requested review from a team, etiotto and whitneywhtsang October 21, 2024 12:45
@victor-eds victor-eds self-assigned this Oct 21, 2024
@victor-eds
Copy link
Contributor Author

victor-eds commented Oct 21, 2024

This depends on #2491

@victor-eds
Copy link
Contributor Author

Part of #2266.

@etiotto etiotto linked an issue Oct 21, 2024 that may be closed by this pull request
@etiotto etiotto deleted the branch intel:fast-sub-group-transpose October 22, 2024 13:31
@etiotto etiotto closed this Oct 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Port "sub-group transpose reduction" to default path

2 participants