Classification: UNCLASSIFIED
Caveats: NONE

I have also had issues with this in the past. I know that this was a problem in 
1.6.3, supposedly fixed in 1.7.4. For some reason,
when attempting to set up inter process communication this way you are only 
able to connect once.

I still have issues as of 1.8.3, but as the actual error is of the form:

------------------------------------------------------------
A process or daemon was unable to complete a TCP connection
to another process:
  Local host:    xxxxxxxxx
  Remote host:   xxxxxxxxx
This is usually caused by a firewall on the remote host. Please
check that any firewall (e.g., iptables) has been disabled and
try again.
------------------------------------------------------------
[xxxxxxxxx:59057] [[9837,0],0]-[[9830,0],0] mca_oob_tcp_peer_recv_handler: 
invalid socket state(6)

I suspect it may be a local configuration issue as opposed to an issue with the 
code itself.

Andrew Burns
Lockheed Martin
Software Engineer
410-306-0409
ARL DSRC
andrew.j.bur...@us.army.mil
andrew.j.burns35....@mail.mil

-----Original Message-----
From: users [mailto:users-boun...@open-mpi.org] On Behalf Of Ralph Castain
Sent: Sunday, February 01, 2015 11:54 PM
To: Open MPI Users
Subject: Re: [OMPI users] use accept/connect to merge a new intra-comm

Which OMPI version?


        On Jan 25, 2015, at 5:41 AM, haozi <yidany...@163.com> wrote:

        Hi guys.

        I am interested in an example from OpenMPI, as  attachment: 
singleton_client_server.c.
        So, I wrote another example. And some error happened.
        My example includes two servers and one client. 
        First, server1 runs. Second, client runs. server1 and client merge an 
new intra-comm. Next, server2 runs. server2 open a
port, server1 and client want to connect server2. At the moment, some error 
happen. client and server2 block, and server1 has errors
as following:

        [datanode-2:06824] [[53818,0],0]:route_callback tried routing message 
from [[53818,1],0] to [[53822,1],0]:16, can't find
route
        [0] func:/usr/lib/libopen-pal.so.0(opal_backtrace_print+0x30) 
[0xb769fbc0]
        [1] func:/usr/lib/openmpi/lib/openmpi/mca_rml_oob.so(+0x1bfd) 
[0xb748fbfd]
        [2] func:/usr/lib/openmpi/lib/openmpi/mca_oob_tcp.so(+0x6cfa) 
[0xb7484cfa]
        [3] func:/usr/lib/openmpi/lib/openmpi/mca_oob_tcp.so(+0x81c2) 
[0xb74861c2]
        [4] func:/usr/lib/libopen-pal.so.0(+0x1aca4) [0xb7688ca4]
        [5] func:/usr/lib/libopen-pal.so.0(opal_event_loop+0x25) [0xb7688ea5]
        [6] func:/usr/lib/libopen-pal.so.0(opal_event_dispatch+0x1b) 
[0xb7688ecb]
        [7] func:mpiexec() [0x804ac0a]
        [8] func:mpiexec() [0x8049f4f]
        [9] func:/lib/i386-linux-gnu/libc.so.6(__libc_start_main+0xf3) 
[0xb74c14d3]
        [10] func:mpiexec() [0x8049ea1] 

        My code is attached.

        Regards.


        
<client.c><server1.c><server2.c>_______________________________________________
        users mailing list
        us...@open-mpi.org
        Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
        Link to this post: 
http://www.open-mpi.org/community/lists/users/2015/01/26233.php



Classification: UNCLASSIFIED
Caveats: NONE


Attachment: smime.p7s
Description: S/MIME cryptographic signature

Reply via email to