https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101581
--- Comment #2 from Tobias Burnus <burnus at gcc dot gnu.org> --- Thomas pointed out that, at least with CUDA, the cuMemcpyPeer function could be used (and cuMemcpy3DPeer for PR110813). (For cuMemcpy, there is also CU_MEMORYTYPE_UNIFIED which might be of some use.) BTW: "[patch] OpenMP: Call cuMemcpy2D/cuMemcpy3D for nvptx for omp_target_memcpy_rect" has been committed as r14-2792-g25072a477a56a7