Skip to content

Commit 3d6e242

Browse files
bug fix
Signed-off-by: cliu-us <[email protected]>
1 parent c51ce60 commit 3d6e242

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

fms_mo/custom_ext_kernels/utils.py

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -613,7 +613,9 @@ def exv2_i4f16_fxinputs_impl(
613613
(x.shape[0], q4_width), dtype=torch.float16, device=x.device
614614
)
615615

616-
exllamav2_kernels.gemm_half_q_half(x, q_handle, output, force_cuda)
616+
gptqmodel.exllamav2_kernels.gemm_half_q_half(
617+
x, q_handle, output, force_cuda
618+
)
617619
return output.view(outshape)
618620

619621
# Abstract implementation

0 commit comments

Comments
 (0)