sunce4t commented on issue #3102: URL: https://github.com/apache/brpc/issues/3102#issuecomment-3509403429
> > > 请教下这个需要区分Nv的卡型或者架构么?需要gdrcopy的库支持么? > > > > > > GDR本身没有依赖卡型和架构;cuda版本和ofed版本都需要特定版本及以上(通常是满足的,不必特定去安装);gdrcopy我们目前有使用,加速D2H拷贝,也可以不用,直接使用cudaMemCpy; > > 目前我们这边正在整理代码,先提供一版不使用gdrcopy的; > > 需要注意的就是:在H系列的卡之前,用GDR的话,不能保证RDMA写入立刻对GPU可见,需要一次额外的访问([https://github.com/NVIDIA/nccl/issues/1702)](https://github.com/NVIDIA/nccl/issues/1702%EF%BC%89) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
