Re: [OMPI devel] OpenMPI Performance Problem with Open|SpeedShop

George Bosilca Mon, 12 Jan 2009 14:08:34 -0500

There might be one reason to slowdown the application quite a bit. Ifthe fact that you're using timer interact with the libevent (thelibrary we're using to internally manage any kind of events), then wemight end-up in the situation where we call the poll for everyiteration in the event library. And this is really expensive.

A quick way to figure out if this is that case is to run Open MPIwithout support for shared memory (--mca btl ^sm). This way we willcall poll on a regular basis anyway, and if there is no differencebetween a normal run and a OSS one, we know at least where to startlooking ...


  george.

On Jan 12, 2009, at 13:00 , Jeff Squyres wrote:

On Jan 9, 2009, at 12:39 AM, William Hachfeld wrote:
Can any of the OpenMPI developers speculate as to possiblemechanisms by which the ptrace() attachment , signal handler, ortimer registration and corresponding signal delivery could causelarge amounts of time to be spent within the "progress" functionsof the OpenMPI library with an apparent lack of any real progress?Any ideas/information would be greatly appreciated.
Hum; interesting. I can't think of any reason why that would be aproblem offhand. The mca_btl_sm_component_progress() function isthe shared memory progression function. opal_progress() andmca_bml_r2_progress() are likely mainly dispatching off to thisfunction.
Does OSS interfere with shared memory between processes in any way?(I'm not enough of a kernel guy to know what the ramifications ofptrace and whatnot are)
--
Jeff Squyres
Cisco Systems

_______________________________________________
devel mailing list
[email protected]
http://www.open-mpi.org/mailman/listinfo.cgi/devel

Re: [OMPI devel] OpenMPI Performance Problem with Open|SpeedShop

Reply via email to