Fixed by r25313.
Thanks,
george.
On Oct 11, 2011, at 16:54 , Ralph Castain wrote:
>> The patches (25245, 25248) proposed so far as a solution to this problem
>> should be removed, as they do not really solve the problem, instead they
>> alleviate the symptoms. From here there are two possible fixes:
>>
>> 1. Put back the code dealing with the daemons leaving the job in the
>> "default" version of the orted error manager.
>>
>> Here are the lines to be added in update_status in
>> orte/mca/errmgr/default_orted/errmgr_default_orted.c:
>>
>> if (0 == orte_routed.num_routes() &&
>> 0 == opal_list_get_size(&orte_local_children)) {
>> orte_quit();
>> }
>
> Thanks for looking at this more closely. I'll restore those lines, and see if
> we are actually getting there. Could be the system I'm using behaves
> differently.