Skip to content

Error triggered by synchronize() #603

@williamfgc

Description

@williamfgc

I think I'm missing something basic with synchronization.

When using a simple @roc kernel launch inside a function we get an error in this AMDGPU.synchronize() line. The stacktrace can be seen in our CI using a recent AMDGPU.jl v0.8.6 on a MI100 with rocm 6.
I don't know if the first message in AMDGPU.jl in the stacktrace: [4] synchronize (repeats 2 times) @ ~/.julia/packages/AMDGPU/rrvsy/src/highlevel.jl:49 [inlined] provides any hints.

Works:

 @roc groupsize = threads gridsize = threads * blocks _parallel_for_amdgpu(f, x...)
end

Fails:

 @roc groupsize = threads gridsize = threads * blocks _parallel_for_amdgpu(f, x...)
  AMDGPU.synchronize()
end

For reference the CUDA code works fine:

  CUDA.@sync @cuda threads = threads blocks = blocks _parallel_for_cuda(f, x...)
end

Any help would be appreciated!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions