Replies: 1 comment
-
Have you tried with the latest version of triton ? By reproducing the tutorial and changing the shapes as above, I have:
as expected. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
In the Matrix Multiplication tutorial, there is a comment and an assertion that claim if
K % BLOCK_SIZE_K != 0
, there would be out-of-bound memory access (which makes sense).But if I change the input matrices to shape (512, 513) and (513, 512) (i.e. K=513), with BLOCK_SIZE_K=32, the code still works and result is close to cublas result. What's going on?
Beta Was this translation helpful? Give feedback.
All reactions