musa: fix failures in test-backend-ops for mul_mat_id op #15236

yeahdongcn · 2025-08-11T11:21:19Z

Make sure to read the contributing guidelines before submitting a PR

Testing Done

ToT:

root@xiaodongye-s80:/ws# ./build/bin/test-backend-ops  | grep FAIL
ggml_cuda_init: GGML_CUDA_FORCE_MMQ:    no
ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no
ggml_cuda_init: found 1 MUSA devices:
  Device 0: MTT S80, compute capability 2.1, VMM: yes
[MUL_MAT_ID] NMSE = 0.473073968 > 0.000500000   MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=1,b=0,m=512,n=32,k=256): FAIL
[MUL_MAT_ID] NMSE = 0.602911949 > 0.000500000   MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=1,b=1,m=512,n=32,k=256): FAIL
[MUL_MAT_ID] NMSE = 0.801315053 > 0.000500000   MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=2,b=0,m=512,n=32,k=256): FAIL
[MUL_MAT_ID] NMSE = 2.501388765 > 0.000500000   MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=4,n_used=2,b=1,m=512,n=32,k=256): FAIL
[MUL_MAT_ID] NMSE = 1.142949490 > 0.000500000   MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=1,b=0,m=512,n=129,k=256): FAIL
[MUL_MAT_ID] NMSE = 0.820664836 > 0.000500000   MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=1,b=1,m=512,n=129,k=256): FAIL
[MUL_MAT_ID] NMSE = 0.730463410 > 0.000500000   MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=2,b=0,m=512,n=32,k=256): FAIL
[MUL_MAT_ID] NMSE = 1.249068448 > 0.000500000   MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=2,b=1,m=512,n=32,k=256): FAIL
[MUL_MAT_ID] NMSE = 1.097372514 > 0.000500000   MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=4,b=0,m=512,n=32,k=256): FAIL
[MUL_MAT_ID] NMSE = 0.734338886 > 0.000500000   MUL_MAT_ID(type_a=f32,type_b=f32,n_mats=8,n_used=4,b=1,m=512,n=32,k=256): FAIL
  Backend MUSA0: FAIL
FAIL

With this fix:

root@xiaodongye-s80:/ws# ./build/bin/test-backend-ops | grep FAIL
ggml_cuda_init: GGML_CUDA_FORCE_MMQ:    no
ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no
ggml_cuda_init: found 1 MUSA devices:
  Device 0: MTT S80, compute capability 2.1, VMM: yes

Signed-off-by: Xiaodong Ye <[email protected]>

JohannesGaessler

Github doesn't let me make the correct suggestion, but please apply the same fix to cp_async_available from which I copied the logic (and where it seems the defect doesn't manifest as a bug).

Signed-off-by: Xiaodong Ye <[email protected]>

musa: fix failures in test-backend-ops for mul_mat_id op

c29d5d4

Signed-off-by: Xiaodong Ye <[email protected]>

yeahdongcn requested a review from JohannesGaessler August 11, 2025 11:21

github-actions bot added Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Aug 11, 2025

JohannesGaessler approved these changes Aug 11, 2025

View reviewed changes

Address review comments

7179053

Signed-off-by: Xiaodong Ye <[email protected]>

yeahdongcn merged commit 25ff6f7 into ggml-org:master Aug 12, 2025
43 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

musa: fix failures in test-backend-ops for mul_mat_id op #15236

musa: fix failures in test-backend-ops for mul_mat_id op #15236

Uh oh!

yeahdongcn commented Aug 11, 2025 •

edited

Loading

Uh oh!

JohannesGaessler left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

musa: fix failures in test-backend-ops for mul_mat_id op #15236

musa: fix failures in test-backend-ops for mul_mat_id op #15236

Uh oh!

Conversation

yeahdongcn commented Aug 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Testing Done

Uh oh!

JohannesGaessler left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yeahdongcn commented Aug 11, 2025 •

edited

Loading