Skip to content

Conversation

@dkarrasch
Copy link
Contributor

This is a companion PR to JuliaLang/julia#52439. There, we avoid constructing MulAddMul objects, which are, however, used here in the most generic multiplication kernel. Along the way, I rearranged the call chain slightly to avoid unpacking already existing MulAddMul objects, just to reconstruct it within the multiplication kernel.

@maleadt maleadt marked this pull request as draft March 21, 2024 08:48
@maleadt maleadt marked this pull request as ready for review May 23, 2024 07:58
@maleadt maleadt merged commit 94bdb43 into JuliaGPU:master May 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants