[OMPI devel] RDMA with ob1 and openib

Sylvain Jeaugey Tue, 27 Apr 2010 10:15:43 -0400

Hi list,

I'm currently working on IB bandwidth improvements and maybe some of youmay help me understanding some things. I'm trying to align every IB RDMAoperation to 64 bytes, because having it unaligned can hurt yourperformance from lightly to very badly, depending on your architecture.

So, I'm trying to understand the RDMA protocol (PUT and GET), and here iswhat I understood :

* if we have one btl, RDMA is performed with only one GET operation,otherwise, we use multiple PUT operations. I can understand that the GEToperation improves asynchronous aspects. So, why not always use GEToperations ?

* if mpi_leave_pinned is 0, this is becoming more strange. We start arendez-vous (not RDMA) with a size equal to the eager limit, then weswitch to RDMA because the remote peer asks for RDMA PUTs (even ifbtl_openib_flags does not have the PUT operation btw). Why this cornercase ? Why not starting a normal RDMA (especially since we switch back toRDMA afterwards) ?

* the openib btl has a "buffer alignment" parameter. Fantastic, just whatI needed. Unfortunately, I can't see where it is used (and indeedperformance is bad if my buffers are not aligned to 64 bytes). Am Imissing something ?

* I did a prototype to split GET operations in openib into two operations: a small one to correct buffer alignment and a big aligned one. It wouldcertainly be better to perform the first one with a normal send/recv, butfor the prototype, doing it inside the openib GET was simpler. Performanceon unaligned buffers is much better (but this is just a prototype). Isthere anyone working on this right now or should I pursue my effort tomake it clean and stable ?


Thanks in advance for any feedback,
Sylvain

[OMPI devel] RDMA with ob1 and openib

Reply via email to