Re: [OMPI devel] openib btl header caching

Jeff Squyres Mon, 13 Aug 2007 10:34:28 -0400

On Aug 12, 2007, at 3:49 PM, Gleb Natapov wrote:

- Mellanox tested MVAPICH with the header caching; latency was around
1.4us
- Mellanox tested MVAPICH without the header caching; latency was
around 1.9us

As far as I remember Mellanox results and according to our testing
difference between MVAPICH with header caching and OMPI is 0.2-0.3us.
Not 0.5us. And MVAPICH without header caching is actually worse then
OMPI for small messages.

I guess reading the graph that Pasha sent is difficult; Pasha -- canyou send the actual numbers?

Given that OMPI is the lone outlier around 1.9us, I think we have no
choice except to implement the header caching and/or examine our
header to see if we can shrink it.  Mellanox has volunteered to
implement header caching in the openib btl.

I think we have a chose. Not implement header caching, but justchange the

osu_latency benchmark to send each message with different tag :)


If only.  :-)

But that misses the point (and the fact that all the common ping-pongbenchmarks use a single tag: NetPIPE, IMB, osu_latency, etc.). *Allother MPI's* give us latency around 1.4us, but Open MPI is around1.9us. So we need to do something.

Are we optimizing for a benchmark? Yes. But we have to do it. Manypeople know that these benchmarks are fairly useless, but not enough-- too many customers do not, and education is not enough. "Surethis MPI looks slower but, really, it isn't. Trust me; my name isJoe Isuzu." That's a hard sell.

I am not against header caching per se, but if it will complicate code

even a little bit I don't think we should implemented it just tobenefit one

fabricated benchmark (AFAIR before header caching was implemented in
MVAPICH mpi_latency actually sent messages with different tags).

That may be true and a reason for us to wail and gnash our teeth, butit doesn't change the current reality.

Also there is really nothing to cache in openib BTL. Openin BTLheader is 4
bytes long. The caching will have to be done in OB1 and there it will
affect every other interconnect.

Surely there is *something* we can do -- what, exactly, is theobjection to peeking inside the PML header down in the btl? Is itreally so horrible for a btl to look inside the upper layer'sheader? I agree that the PML looking into a btl header would[obviously] be Bad.

All this being said -- is there another reason to lower our latency?My main goal here is to lower the latency. If header caching isunattractive, then another method would be fine.


--
Jeff Squyres
Cisco Systems

Re: [OMPI devel] openib btl header caching

Reply via email to