Re: [OMPI devel] Non-zero exit status

2012-04-14 Thread TERRY DONTJE
On 4/13/2012 6:40 PM, Ralph Castain wrote: Did you have the param set? I found some missing code in the orted errmgr that contributed to it, but unless you had set the param in your test, there is no way it would abort no matter how many procs exit with non-zero status. Is mpirun sticking aro

Re: [OMPI devel] Non-zero exit status

2012-04-13 Thread Ralph Castain
Did you have the param set? I found some missing code in the orted errmgr that contributed to it, but unless you had set the param in your test, there is no way it would abort no matter how many procs exit with non-zero status. I'm guessing you have that param set in your test due to our earlier

Re: [OMPI devel] Non-zero exit status

2012-04-13 Thread TERRY DONTJE
I could see if less then N processes exit with non-zero exit code that the ORTE may choose not to abort the job. However, if all N processes have exited or aborted I expect everything to clean up and mpirun to exit. It does not do that at the moment which I think is what is causing most of th