On Wed, Jan 30, 2008 at 12:05:50PM -0500, George Bosilca wrote:

> What is the real issue behind this whole discussion?

Hanging connections. See

   https://svn.open-mpi.org/trac/ompi/ticket/1206

The multi-address peer tries to connect, but btl_tcp_proc_accept denies
due to not matching addresses. (less btl_endpoints than possible source
addresses)

r17331 and r17332 haven't fixed the issue. Don't code when leaving the
office ;) I'll have a look at it tomorrow.

Sorry for all the noise in the trunk.

> multiple IP addresses by interface the connection step will work. Now  
> I can see a benefit of having multiple socket over the same link (and  
> it's already implemented in Open MPI), but I don't see the interest of  
> using multiple IP in this case.

I have an easy to reproduce testcase for #1206. If you like, we can step
through the debugger in a shared screen (screen -x) or VNC session.

Just mail me if you're interested. ;)



-- 
Cluster and Metacomputing Working Group
Friedrich-Schiller-Universität Jena, Germany

private: http://adi.thur.de

Reply via email to