Skip to content

Commit f54f61d

Browse files
authored
[ESIMD] Fix implementations of block_load(usm, ...) and block_load(acc) (#11797)
1) Fix the big mess in E2E test for block_load(). Test did not really check the mask variant. It also used wrong alignments. 2) Fix the comments for USM and ACC block_load implementations. 3) Minor optimization for ACC block_load functions that do not accept the byte_offset operand. We can assume align16 for them. Signed-off-by: Klochkov, Vyacheslav N <[email protected]>
1 parent 3d0edd4 commit f54f61d

File tree

5 files changed

+377
-240
lines changed

5 files changed

+377
-240
lines changed

sycl/include/sycl/ext/intel/esimd/detail/memory_intrin.hpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -212,7 +212,7 @@ __ESIMD_INTRIN __ESIMD_DNS::vector_type_t<T, N * __ESIMD_DNS::to_int<VS>()>
212212
__esimd_lsc_load_merge_bti(
213213
__ESIMD_DNS::simd_mask_storage_t<N> pred,
214214
__ESIMD_DNS::vector_type_t<uint32_t, N> offsets, SurfIndAliasT surf_ind,
215-
__ESIMD_DNS::vector_type_t<T, N * __ESIMD_DNS::to_int<VS>()> PassThru = 0)
215+
__ESIMD_DNS::vector_type_t<T, N * __ESIMD_DNS::to_int<VS>()> PassThru)
216216
#ifdef __SYCL_DEVICE_ONLY__
217217
;
218218
#else // __SYCL_DEVICE_ONLY__

0 commit comments

Comments
 (0)