Skip to content

Conversation

@shalinib-ibm
Copy link
Owner

Inside the 8x8 kernel, isoalate the packing and MMA Computation.

Not much performance differnce from 4.3 t/s to 4.1 t/s (llama-bench Q4 model p 128 n 1 t 1 )

Make sure to read the contributing guidelines before submitting a PR

Inside the 8x8 kernel, isoalate the packing and MMA Computation.

Not much performance differnce from 4.3 t/s to 4.1 t/s
(llama-bench Q4 model p 128 n 1 t 1 )

Signed-off-by: Shalini Salomi Bodapati <[email protected]>
@github-actions github-actions bot added the ggml label Oct 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants