What
MPI language bindings for partitioned communication functions to be callable in an accelerator context.
Why
Performing MPI_Pready or MPI_Parrived operations from the accelerator context can provide an efficient/convenient way to overlap communication with computation on the accelerator.
Presentations
Resources