Re: [OMPI users] Best way to overlap computation and transfer using MPIover TCP/Ethernet?

Jeff Squyres Fri, 5 Jun 2009 07:30:24 -0400

On Jun 4, 2009, at 3:53 AM, Lars Andersson wrote:

In my second test, I simply put a sleep(3) at point 2), and expected
the MPI_Wait() call at 3) to finish almost instantly, since I assumed
that the message would have been transferred during the sleep. To my
disappointment tough, it took more or less the same time to finish the
MPI_Wait as without any sleep.

As you found by googling, and as Bogdan infers, Open MPI doesn'tcurrently make much progress over TCP-based networks "in thebackground." And you're right that putting an MPI_WAIT in a progressthread would cause that thread to spin heavily, effectively takingmuch of your CPU cycles away from you, and possibly even having otherbad effects (e.g., cache thrashing, context switching, etc.).

I'd say that your own workaround here is to intersperse MPI_TEST'speriodically. This will trigger OMPI's pipelined protocol for largemessages, and should allow partial bursts of progress while you'reassumedly off doing useful work. If this is difficult because thework is being done in library code that you can't change, then perhapsa pre-spawned "work" through could be used to call MPI_TESTperiodically. That way, it won't steal huge ammounts of CPU cycles(like MPI_WAIT would). You still might get some cache thrashing,context switching, etc. -- YMMV.

As for exactly how many / how often you should call MPI_TEST, that isgoing to be up to you. It's going to depend on a lot of factors -- howbig the message is, how well synchronized you are with the receiver,what strategy you use to call MPI_TEST, etc.

Open MPI may someday treat this better to either have a blocking formof MPI_WAIT (i.e., not spinning, or spinning considerably less) orhave true TCP progress in the background. But if I had to guess, I'dsay that we'll likely do the former before the latter.


--
Jeff Squyres
Cisco Systems

Re: [OMPI users] Best way to overlap computation and transfer using MPIover TCP/Ethernet?

Reply via email to