Re: [OMPI users] top question

Number Cruncher Wed, 3 Jun 2009 11:50:01 -0400

Jeff Squyres wrote:

We get this question so much that I really need to add it to the FAQ.  :-\
Open MPI currently always spins for completion for exactly the reasonthat Scott cites: lower latency.
Arguably, when using TCP, we could probably get a bit better performanceby blocking and allowing the kernel to make more progress than a singlequick pass through the sockets progress engine, but that involves someother difficulties such as simultaneously allowing shared memoryprogress. We have ideas how to make this work, but it has unfortunatelyremained at a lower priority: the performance difference isn't thatgreat, and we've been focusing on the other, lower latency interconnects(shmem, MX, verbs, etc.).

Whilst I understand that you have other priorities, and I grateful forthe leverage I get by using OpenMPI, I would like to offer analternative use case, which I believe may become more common.

We're developing parallel software which is designed to be used*interactively* as well as in batch mode. We want the same SIMD coderunning on a user's quad-core workstation as on a 1,000-node cluster.

For the former case (single workstation), it would be *much* more userfriendly and interactive, for the back-end MPI code not to be spinningat 100% when it's just waiting for the next front-end command. The GUIthread doesn't get a look in.

I can't imagine the difficulties involved, but if the POSIX callsselect() and pthread_cond_wait() can do it for TCP and shared-memorythreads respectively, it can't be impossible!


Just my .2c,
Simon

Re: [OMPI users] top question

Reply via email to