Skip to content

Commit 6ad4d9a

Browse files
committed
[easywave][SYCL] Avoid invalid work-group size with DPC++
Instead of just checking `icpx`, also check for `clang++` in `#if` directive. Signed-off-by: Victor Perez <[email protected]>
1 parent a357dbd commit 6ad4d9a

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

easywave/SYCL/src/ewGpuNode.cpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -409,7 +409,7 @@ int CGpuNode::run()
409409
sycl::range<1> boundary_workgroup_size(256);
410410
sycl::range<1> boundary_size(INT_CEIL(std::max(dp.nI, dp.nJ), boundary_workgroup_size[0]));
411411

412-
#if defined(SYCL_LANGUAGE_VERSION) && defined(__INTEL_LLVM_COMPILER)
412+
#if (defined(SYCL_LANGUAGE_VERSION) && defined(__INTEL_LLVM_COMPILER)) || defined(SYCL_IMPLEMENTATION_INTEL)
413413
/* For Intel, prevent the nd_range_error: "Non-uniform work-groups are not supported by the target device -54 (CL_INVALID_WORK_GROUP_SIZE))". */
414414
/* Originally we had n = 128 threads, 32 for x and 128/x = 4 threads, hardcoded in the CUDA code. */
415415
sycl::range<2> compute_wnd_workgroup_size(4, 32);

0 commit comments

Comments
 (0)