Classification: UNCLASSIFIED Caveats: NONE I have also had issues with this in the past. I know that this was a problem in 1.6.3, supposedly fixed in 1.7.4. For some reason, when attempting to set up inter process communication this way you are only able to connect once.
I still have issues as of 1.8.3, but as the actual error is of the form: ------------------------------------------------------------ A process or daemon was unable to complete a TCP connection to another process: Local host: xxxxxxxxx Remote host: xxxxxxxxx This is usually caused by a firewall on the remote host. Please check that any firewall (e.g., iptables) has been disabled and try again. ------------------------------------------------------------ [xxxxxxxxx:59057] [[9837,0],0]-[[9830,0],0] mca_oob_tcp_peer_recv_handler: invalid socket state(6) I suspect it may be a local configuration issue as opposed to an issue with the code itself. Andrew Burns Lockheed Martin Software Engineer 410-306-0409 ARL DSRC andrew.j.bur...@us.army.mil andrew.j.burns35....@mail.mil -----Original Message----- From: users [mailto:users-boun...@open-mpi.org] On Behalf Of Ralph Castain Sent: Sunday, February 01, 2015 11:54 PM To: Open MPI Users Subject: Re: [OMPI users] use accept/connect to merge a new intra-comm Which OMPI version? On Jan 25, 2015, at 5:41 AM, haozi <yidany...@163.com> wrote: Hi guys. I am interested in an example from OpenMPI, as attachment: singleton_client_server.c. So, I wrote another example. And some error happened. My example includes two servers and one client. First, server1 runs. Second, client runs. server1 and client merge an new intra-comm. Next, server2 runs. server2 open a port, server1 and client want to connect server2. At the moment, some error happen. client and server2 block, and server1 has errors as following: [datanode-2:06824] [[53818,0],0]:route_callback tried routing message from [[53818,1],0] to [[53822,1],0]:16, can't find route [0] func:/usr/lib/libopen-pal.so.0(opal_backtrace_print+0x30) [0xb769fbc0] [1] func:/usr/lib/openmpi/lib/openmpi/mca_rml_oob.so(+0x1bfd) [0xb748fbfd] [2] func:/usr/lib/openmpi/lib/openmpi/mca_oob_tcp.so(+0x6cfa) [0xb7484cfa] [3] func:/usr/lib/openmpi/lib/openmpi/mca_oob_tcp.so(+0x81c2) [0xb74861c2] [4] func:/usr/lib/libopen-pal.so.0(+0x1aca4) [0xb7688ca4] [5] func:/usr/lib/libopen-pal.so.0(opal_event_loop+0x25) [0xb7688ea5] [6] func:/usr/lib/libopen-pal.so.0(opal_event_dispatch+0x1b) [0xb7688ecb] [7] func:mpiexec() [0x804ac0a] [8] func:mpiexec() [0x8049f4f] [9] func:/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3) [0xb74c14d3] [10] func:mpiexec() [0x8049ea1] My code is attached. Regards. <client.c><server1.c><server2.c>_______________________________________________ users mailing list us...@open-mpi.org Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users Link to this post: http://www.open-mpi.org/community/lists/users/2015/01/26233.php Classification: UNCLASSIFIED Caveats: NONE
smime.p7s
Description: S/MIME cryptographic signature