Fixed by r25313. Thanks, george.
On Oct 11, 2011, at 16:54 , Ralph Castain wrote: >> The patches (25245, 25248) proposed so far as a solution to this problem >> should be removed, as they do not really solve the problem, instead they >> alleviate the symptoms. From here there are two possible fixes: >> >> 1. Put back the code dealing with the daemons leaving the job in the >> "default" version of the orted error manager. >> >> Here are the lines to be added in update_status in >> orte/mca/errmgr/default_orted/errmgr_default_orted.c: >> >> if (0 == orte_routed.num_routes() && >> 0 == opal_list_get_size(&orte_local_children)) { >> orte_quit(); >> } > > Thanks for looking at this more closely. I'll restore those lines, and see if > we are actually getting there. Could be the system I'm using behaves > differently.