Skip to content

Commit 4e94523

Browse files
authored
[BENCHMARK][GEMM] Fix poor performance when running CUTLASS Benchmark on BMG (#4338)
This PR fixes the CUTLASS benchmark, which was running poorly on BMG. It modifies the pinned commit to point to CUTLASS release 3.9.2 and also adds the missing IGC arguments, as mentioned [here](https://github.com/codeplaysoftware/cutlass-sycl/blob/sycl-develop/media/docs/cpp/build/building_with_sycl_support.md) BMG CI: https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/15305727000/job/43057817132 --------- Signed-off-by: Jefferson Le Quellec <[email protected]>
1 parent 83345de commit 4e94523

File tree

2 files changed

+2
-1
lines changed

2 files changed

+2
-1
lines changed

benchmarks/cutlass_kernel/CMakeLists.txt

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,7 @@ set(CUTLASS_KERNEL_FLAGS ${CUTLASS_KERNEL_FLAGS}
44
-fsycl-targets=intel_gpu_pvc,intel_gpu_bmg_g21
55
"SHELL:-Xspirv-translator=intel_gpu_pvc --spirv-ext=+SPV_INTEL_split_barrier"
66
"SHELL:-Xspirv-translator=intel_gpu_bmg_g21 --spirv-ext=+SPV_INTEL_split_barrier"
7+
-Xs "-options \"-igc_opts 'VISAOptions=-perfmodel,VectorAliasBBThreshold=1000,ExtraOCLOptions=-cl-intel-256-GRF-per-thread'\" -options -ze-opt-large-register-file"
78
)
89

910
Python3_add_library(cutlass_kernel MODULE WITH_SOABI python_main.cpp)
Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
bb48e86d2fe7cb09eab2e719e78d5811d3da3131
1+
dee33709bdc0cc579df49f251da894d4546b2624

0 commit comments

Comments
 (0)