Re: [Beowulf] Performance characterising a HPC application

Scott Atchley Fri, 30 Mar 2007 17:44:18 -0800

On Mar 26, 2007, at 1:04 PM, Gilad Shainer wrote:

When Mellanox refers to transport offload, it mean full transport
offload - for all transport semantics. InfiniBand, as you probably
know, provides RDMA AND Send/Receive semantics, and in both cases
you can do Zero-copy operations.

This full flexibility provides the programmer with the ability tochoose

the
best semantics for his use. Some programmers choose Send/Receive and
some RDMA. It is all depends on their application.
From your response, I see that Qlogic does not provide this kind
of flexibility.


Gilad,

I have seen you make that point many times. This may be a sillyquestion, but it latency and throughput equivalent for both APIs forlarge and small messages?

I ask because I wrote the ports of Lustre and PVFS2 for MX and Ispent a lot of time looking at their existing IB code. I see them useSend/Recv for small and/or unexpected messages. Both use IB write forlarge payloads.

Although both use IB write (one-sided, no?) for the large payload,both require one or two small Send/Recv messages to serve as RTS andCTS before they can initiate the one-sided implementation. In effect,they have to write their own Send/Recv (two-sided) semantics on ofIB's RDMA.

If Send/Recv performance is on par with RDMA on IB, why not use thatAPI for large messages? Why re-write Send/Recv every time they useRDMA? The code to implement PVFS2 on MX is over 30% smaller than theIB code because I did not have to re-write Send/Recv.


Scott
_______________________________________________
Beowulf mailing list, [email protected]
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Re: [Beowulf] Performance characterising a HPC application

Reply via email to