On Wed, Jan 30, 2008 at 12:05:50PM -0500, George Bosilca wrote: > What is the real issue behind this whole discussion?
Hanging connections. See https://svn.open-mpi.org/trac/ompi/ticket/1206 The multi-address peer tries to connect, but btl_tcp_proc_accept denies due to not matching addresses. (less btl_endpoints than possible source addresses) r17331 and r17332 haven't fixed the issue. Don't code when leaving the office ;) I'll have a look at it tomorrow. Sorry for all the noise in the trunk. > multiple IP addresses by interface the connection step will work. Now > I can see a benefit of having multiple socket over the same link (and > it's already implemented in Open MPI), but I don't see the interest of > using multiple IP in this case. I have an easy to reproduce testcase for #1206. If you like, we can step through the debugger in a shared screen (screen -x) or VNC session. Just mail me if you're interested. ;) -- Cluster and Metacomputing Working Group Friedrich-Schiller-Universität Jena, Germany private: http://adi.thur.de