Can someone please help me debug this code #7690
Unanswered
conquerer-wo
asked this question in
Q&A
Replies: 1 comment 1 reply
-
There doesn't seem to be any problems on triton/main |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
so I am new to triton and have been trying to write a kernel that Fuses 2 matrix multiplications into one kernel, yet I keep receiving a PassManager::run failed error, could you please help me figure out where the problem is(btw im on triton version 3.3),
this is the code:
import torch
import triton
import triton.language as tl
@triton.jit
def matmul_kernel(
):
def matmul(a: torch.Tensor, b: torch.Tensor, a2: torch.Tensor, b2: torch.Tensor):
M,K= a.shape
_, N = b.shape
B, O = a2.shape
_, S = b2.shape
f_t = torch.rand((4096,65536), dtype=torch.float16, device='cuda')
s_t = torch.rand((4096, 1024), dtype=torch.float16, device='cuda')
e_t = torch.rand((4096,65536), dtype=torch.float16, device='cuda')
b_t = torch.rand((65536,512), dtype=torch.float16, device='cuda')
c,c1 = matmul(f_t.t().contiguous(),s_t,e_t,b_t)
print(c.shape)
print(c1.shape)
Beta Was this translation helpful? Give feedback.
All reactions