I filed this as https://svn.open-mpi.org/trac/ompi/ticket/1233 so that
it would not be forgotten.
On Feb 18, 2008, at 10:53 AM, Tim Prins wrote:
Hi all,
This is a bit strange, so I thought I'd ping the group before digging
any further.
The onesided test 't_win' is failing for us (specifica
Looks like it works.
Aurelien
Le 6 mars 08 à 10:36, Ralph Castain a écrit :
I believe I have at least helped reduce this with r17761. I added the
ability for procs to detect that their "lifeline" connection (either
the HNP
for unity routed, or their local daemon for tree) has been lost and
We now use the errmgr.
Aurelien
Le 6 mars 08 à 13:38, Aurélien Bouteiller a écrit :
Aside of what Josh said, we are working right know at UTK on orted/MPI
recovery (without killing/respawning all). For now we had no use of
the errgmr, but I'm quite sure this would be the smartest place to
put
I noticed that when btl_remove_procs is invoked on the openib BTL
(e.g., when you "mpirun --mca btl self,openib ...", an openib endpoint
will be removed because self's exclusivity will edge it out), the
openib remove_procs() function will not remove the corresponding
endpoint on mca_btl_ope