-
Notifications
You must be signed in to change notification settings - Fork 65
Closed
Description
We recently set up a GPU CI for all the GPU backends of AcceleratedKernels.jl - it seems AMDGPU is the only platform where the call to strides(::ROCArray) fails inside a KernelAbstractions.jl kernel:
A few notes: I know @pxl-th ran the N-dimensional reduction and it worked on his machine, but it may have been with GPUArraysCore=0.2. The ecosystem hasn't yet fully updated to that version, so I had to add GPUArraysCore="0.1, 0.2" to [compat] - it may have been due to some combination of Adapt, Metal, and GPUArrays resulting in version conflicts.
I can do a hacky fix and pass the strides to the kernel as an explicit argument, but I wanted to know if this is a known issue and where / how it could be solved.
Metadata
Metadata
Assignees
Labels
No labels
