-
Notifications
You must be signed in to change notification settings - Fork 65
Closed
Description
I think I'm missing something basic with synchronization.
When using a simple @roc kernel launch inside a function we get an error in this AMDGPU.synchronize() line. The stacktrace can be seen in our CI using a recent AMDGPU.jl v0.8.6 on a MI100 with rocm 6.
I don't know if the first message in AMDGPU.jl in the stacktrace: [4] synchronize (repeats 2 times) @ ~/.julia/packages/AMDGPU/rrvsy/src/highlevel.jl:49 [inlined] provides any hints.
Works:
@roc groupsize = threads gridsize = threads * blocks _parallel_for_amdgpu(f, x...)
end
Fails:
@roc groupsize = threads gridsize = threads * blocks _parallel_for_amdgpu(f, x...)
AMDGPU.synchronize()
end
For reference the CUDA code works fine:
CUDA.@sync @cuda threads = threads blocks = blocks _parallel_for_cuda(f, x...)
end
Any help would be appreciated!
Metadata
Metadata
Assignees
Labels
No labels