Re: [OMPI devel] OOB-TCP Retries

2008-10-30 Thread Leonardo Fialho
I´m not an expert in C neither Open MPI, but I´m a volunteer. Leonardo Ralph Castain escribió: Sorry for delayed response - had some things to finish, then had to stare at this code for awhile. Unfortunately, the OOB is a snarled can of hideous worms. It looks to me that the OOB continues to

Re: [OMPI devel] OOB-TCP Retries

2008-10-22 Thread Ralph Castain
Sorry for delayed response - had some things to finish, then had to stare at this code for awhile. Unfortunately, the OOB is a snarled can of hideous worms. It looks to me that the OOB continues to attempt to complete any pending message requests once it detects that retries have exceeded t

[OMPI devel] OOB-TCP Retries

2008-10-17 Thread Leonardo Fialho
Hi All, I´m doing some experiments and modifications in my heartbeat code witch uses the OOB-TCP communication channel. My modified orteds and orterun does not abort all processes when one orted die. The problem is: 1) I kill an orted, so another orted detect the fault when try to send a