Crud, ok. I added this info to https://svn.open-mpi.org/trac/ompi/ticket/1253 ; hopefully we'll resolve it today.

I guess people didn't test the libevent-merge branch before we brought it to the trunk. :-(


On Mar 25, 2008, at 9:22 AM, Tim Prins wrote:
I was able to replicate the failure with a debug build by running mpirun
through a batch job. I then added the parameter you gave me, and it
worked fine with the parameter.

Thanks,

Tim

Jeff Squyres wrote:
We're chasing down a problem that we're having on OSX w.r.t. libevent,
too -- can you try running with:

   --mca opal_event_include select

and see if that fixes the problem for you?


On Mar 25, 2008, at 8:49 AM, Tim Prins wrote:
Hi everyone,

For the last couple nights ALL of our mtt runs have been failing
(although the failure is masked because mpirun is returning the wrong
error code) with:

[odin005.cs.indiana.edu:28167] [[46567,0],0] ORTE_ERROR_LOG: Error
in file
base/plm_base_launch_support.c at line 161
--------------------------------------------------------------------------
mpirun was unable to start the specified application as it encountered
an error.
More information may be available above.
--------------------------------------------------------------------------

This line is where we try to do an IOF push. It looks like it was
broken
somewhere between r17922 and r17926, which includes the libevent
merge.

I cannot replicate this with a debug build, so I thought I would throw
this out before I look any further.

Thanks,

Tim
_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel



_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel


--
Jeff Squyres
Cisco Systems

Reply via email to