Re: [OMPI devel] Error in TCP BTL??
https://github.com/open-mpi/ompi/pull/5819 will ease the pain. I couldn't figure out what exactly trigger this, but apparently recent versions of OSX refuse to bind with port 0. George. On Mon, Oct 1, 2018 at 4:12 PM Jeff Squyres (jsquyres) via devel < devel@lists.open-mpi.org> wrote: > I get that 100% time in the runs on MacOS, too (with today's HEAD): > > -- > $ mpirun -np 4 --mca btl tcp,self ring_c > Process 0 sending 10 to 1, tag 201 (4 processes in ring) > [JSQUYRES-M-26UT][[5535,1],0][btl_tcp_endpoint.c:742:mca_btl_tcp_endpoint_start_connect] > bind() failed: Invalid argument (22) > [JSQUYRES-M-26UT:85104] *** An error occurred in MPI_Send > [JSQUYRES-M-26UT:85104] *** reported by process [362741761,0] > [JSQUYRES-M-26UT:85104] *** on communicator MPI_COMM_WORLD > [JSQUYRES-M-26UT:85104] *** MPI_ERR_OTHER: known error not in list > [JSQUYRES-M-26UT:85104] *** MPI_ERRORS_ARE_FATAL (processes in this > communicator will now abort, > [JSQUYRES-M-26UT:85104] ***and potentially your MPI job) > -- > > > > On Oct 1, 2018, at 2:12 PM, Ralph H Castain wrote: > > > > I’m getting this error when trying to run a simple ring program on my > Mac: > > > > > [Ralphs-iMac-2.local][[21423,14],0][btl_tcp_endpoint.c:742:mca_btl_tcp_endpoint_start_connect] > bind() failed: Invalid argument (22) > > > > Anyone recognize the problem? It causes the job to immediately abort. > This is with current head of master this morning - it was working when I > last used it, but it has been an unknown period of time. > > Ralph > > > > ___ > > devel mailing list > > devel@lists.open-mpi.org > > https://lists.open-mpi.org/mailman/listinfo/devel > > > -- > Jeff Squyres > jsquy...@cisco.com > > ___ > devel mailing list > devel@lists.open-mpi.org > https://lists.open-mpi.org/mailman/listinfo/devel ___ devel mailing list devel@lists.open-mpi.org https://lists.open-mpi.org/mailman/listinfo/devel
Re: [OMPI devel] Error in TCP BTL??
I get that 100% time in the runs on MacOS, too (with today's HEAD): -- $ mpirun -np 4 --mca btl tcp,self ring_c Process 0 sending 10 to 1, tag 201 (4 processes in ring) [JSQUYRES-M-26UT][[5535,1],0][btl_tcp_endpoint.c:742:mca_btl_tcp_endpoint_start_connect] bind() failed: Invalid argument (22) [JSQUYRES-M-26UT:85104] *** An error occurred in MPI_Send [JSQUYRES-M-26UT:85104] *** reported by process [362741761,0] [JSQUYRES-M-26UT:85104] *** on communicator MPI_COMM_WORLD [JSQUYRES-M-26UT:85104] *** MPI_ERR_OTHER: known error not in list [JSQUYRES-M-26UT:85104] *** MPI_ERRORS_ARE_FATAL (processes in this communicator will now abort, [JSQUYRES-M-26UT:85104] ***and potentially your MPI job) -- > On Oct 1, 2018, at 2:12 PM, Ralph H Castain wrote: > > I’m getting this error when trying to run a simple ring program on my Mac: > > [Ralphs-iMac-2.local][[21423,14],0][btl_tcp_endpoint.c:742:mca_btl_tcp_endpoint_start_connect] > bind() failed: Invalid argument (22) > > Anyone recognize the problem? It causes the job to immediately abort. This is > with current head of master this morning - it was working when I last used > it, but it has been an unknown period of time. > Ralph > > ___ > devel mailing list > devel@lists.open-mpi.org > https://lists.open-mpi.org/mailman/listinfo/devel -- Jeff Squyres jsquy...@cisco.com ___ devel mailing list devel@lists.open-mpi.org https://lists.open-mpi.org/mailman/listinfo/devel
[OMPI devel] Error in TCP BTL??
I’m getting this error when trying to run a simple ring program on my Mac: [Ralphs-iMac-2.local][[21423,14],0][btl_tcp_endpoint.c:742:mca_btl_tcp_endpoint_start_connect] bind() failed: Invalid argument (22) Anyone recognize the problem? It causes the job to immediately abort. This is with current head of master this morning - it was working when I last used it, but it has been an unknown period of time. Ralph ___ devel mailing list devel@lists.open-mpi.org https://lists.open-mpi.org/mailman/listinfo/devel