**Is your feature request related to a problem? Please describe.** GPU implementation of barrier functions **Additional context** https://kemenghuang.github.io/img/ugipc-arxiv.pdf