Commit d10296c
authored
[release/2.6] fix scaled matmul and test_float8_basics_cuda (#2739)
This PR fixes:
- test_matmul_cuda.py::TestFP8MatmulCudaCUDA::test_float8_basics_cuda -
AssertionError: RuntimeError not raised
-
test_matmul_cuda.py::TestFP8MatmulCudaCUDA::test_scaled_mm_vs_emulated_row_wise_bfloat16_cuda
- AssertionError: Tensor-likes are not close!
need to swap A_SCALE and B_SCALE descriptors data if `use_rowwise` like
as
[HIPBLASLT_VEC_EXT](https://github.com/ROCm/pytorch/blob/78f6ff789a11bcdca072f019305485d1cf06c7eb/aten/src/ATen/cuda/CUDABlas.cpp#L1450-L1454)
Fixes SWDEV-5440981 parent 78f6ff7 commit d10296c
2 files changed
+9
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1447 | 1447 | | |
1448 | 1448 | | |
1449 | 1449 | | |
| 1450 | + | |
| 1451 | + | |
| 1452 | + | |
| 1453 | + | |
| 1454 | + | |
| 1455 | + | |
1450 | 1456 | | |
1451 | 1457 | | |
1452 | 1458 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
| 3 | + | |
3 | 4 | | |
4 | 5 | | |
5 | 6 | | |
| |||
356 | 357 | | |
357 | 358 | | |
358 | 359 | | |
359 | | - | |
| 360 | + | |
| 361 | + | |
360 | 362 | | |
361 | 363 | | |
362 | 364 | | |
| |||
0 commit comments