[OMPI devel] trunk hang (when remote orted has to spawn another orted?)

2012-05-08 Thread Eugene Loh
Here is another trunk hang. I get it if I use at least three remote nodes. E.g., with r26385: % mpirun -H remoteA,remoteB,remoteC -n 2 hostname [remoteA:20508] [[54625,0],1] ORTE_ERROR_LOG: Not found in file base/ess_base_fns.c at line 135 [remoteA:20508] [[54625,0],1] unable to get hostname

Re: [OMPI devel] trunk hang (when remote orted has to spawn another orted?)

2012-05-08 Thread Ralph Castain
Fixed - r26406 On May 7, 2012, at 10:35 PM, Eugene Loh wrote: > Here is another trunk hang. I get it if I use at least three remote nodes. > E.g., with r26385: > > % mpirun -H remoteA,remoteB,remoteC -n 2 hostname > [remoteA:20508] [[54625,0],1] ORTE_ERROR_LOG: Not found in file > base/ess_

[OMPI devel] The Architecture of Open Source Applications (vol 2)

2012-05-08 Thread Jeff Squyres (jsquyres)
I wrote a chapter about Open MPI in "The Architecture of Open Source Applications, volume 2", which was just made available in dead tree form today: http://blogs.cisco.com/performance/the-architecture-of-open-source-applicat ions-volume-ii/ All royalties from this book go to Amnesty Internationa

[OMPI devel] New MCA param: odls_base_exit_status_77_fatal

2012-05-08 Thread Jeff Squyres
This commit adds a new MTT param that people should set in their MTT testing environments: MCA odls: parameter "odls_base_exit_status_77_fatal" (current value: <1>, data source: default value) Whether to kill an entire job if any process in that job exits nor