File tree
6 files changed
+11
-12
lines changed- kernels/hgemm
- third-party
6 files changed
+11
-12
lines changedLines changed: 3 additions & 4 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
4 | 4 |
| |
5 | 5 |
| |
6 | 6 |
| |
7 |
| - | |
8 |
| - | |
9 |
| - | |
10 |
| - | |
| 7 | + | |
| 8 | + | |
| 9 | + |
Lines changed: 2 additions & 2 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
30 | 30 |
| |
31 | 31 |
| |
32 | 32 |
| |
33 |
| - | |
| 33 | + | |
34 | 34 |
| |
35 | 35 |
| |
36 | 36 |
| |
| |||
43 | 43 |
| |
44 | 44 |
| |
45 | 45 |
| |
46 |
| - | |
| 46 | + | |
47 | 47 |
| |
48 | 48 |
| |
49 | 49 |
| |
|
Lines changed: 0 additions & 1 deletion
This file was deleted.
Lines changed: 4 additions & 4 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
27 | 27 |
| |
28 | 28 |
| |
29 | 29 |
| |
30 |
| - | |
31 |
| - | |
32 |
| - | |
33 |
| - | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
34 | 34 |
| |
35 | 35 |
| |
36 | 36 |
| |
|
- examples/67_hopper_fp8_warp_specialized_gemm_with_blockwise_scaling/67_hopper_fp8_warp_specialized_gemm_with_blockwise_scaling.cu+13-6
- examples/67_hopper_fp8_warp_specialized_gemm_with_blockwise_scaling/67_hopper_fp8_warp_specialized_gemm_with_groupwise_scaling.cu+59-25
- examples/67_hopper_fp8_warp_specialized_gemm_with_blockwise_scaling/hopper_fp8_commandline.hpp+7-3
- examples/67_hopper_fp8_warp_specialized_gemm_with_blockwise_scaling/reference/host/gemm_with_groupwise_scaling.h+20-13
- examples/68_hopper_fp8_warp_specialized_grouped_gemm_with_blockwise_scaling/68_hopper_fp8_warp_specialized_grouped_gemm_with_blockwise_scaling.cu+841
- examples/68_hopper_fp8_warp_specialized_grouped_gemm_with_blockwise_scaling/CMakeLists.txt+61
- examples/68_hopper_fp8_warp_specialized_grouped_gemm_with_blockwise_scaling/hopper_fp8_commandline.hpp+211
- examples/68_hopper_fp8_warp_specialized_grouped_gemm_with_blockwise_scaling/reference/host/gemm_with_groupwise_scaling.h+520
- examples/81_blackwell_gemm_blockwise/81_blackwell_gemm_blockwise.cu+585
- examples/81_blackwell_gemm_blockwise/81_blackwell_gemm_groupwise.cu+589
- examples/81_blackwell_gemm_blockwise/CMakeLists.txt+57
- examples/CMakeLists.txt+2
- include/cutlass/detail/sm100_blockwise_scale_layout.hpp+189
- include/cutlass/gemm/collective/builders/sm100_blockwise_umma_builder.inl+304
- include/cutlass/gemm/collective/builders/sm90_gmma_builder.inl+20-15
- include/cutlass/gemm/collective/collective_builder.hpp+1
- include/cutlass/gemm/collective/collective_mma.hpp+2
- include/cutlass/gemm/collective/fp8_accumulation.hpp+24
- include/cutlass/gemm/collective/sm100_mma_warpspecialized_blockwise_scaling.hpp+1.2k
- include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_rs_warpspecialized_mixed_input.hpp+14
- include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized.hpp+13
- include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8.hpp+12
- include/cutlass/gemm/collective/sm90_mma_array_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp+1.0k
- include/cutlass/gemm/collective/sm90_mma_tma_gmma_ss_warpspecialized_fp8_blockwise_scaling.hpp+209-125
- include/cutlass/gemm/dispatch_policy.hpp+91-8
- include/cutlass/gemm/kernel/gemm_universal.hpp+1
- include/cutlass/gemm/kernel/sm100_gemm_tma_warpspecialized_mma_transform.hpp+1.0k
- include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_cooperative.hpp+5-2
- include/cutlass/gemm/kernel/sm90_gemm_array_tma_warpspecialized_pingpong.hpp+3
0 commit comments