Re: [OMPI devel] Error in TCP BTL??

2018-10-01 Thread George Bosilca
https://github.com/open-mpi/ompi/pull/5819 will ease the pain. I couldn't
figure out what exactly trigger this, but apparently recent versions of OSX
refuse to bind with port 0.

  George.



On Mon, Oct 1, 2018 at 4:12 PM Jeff Squyres (jsquyres) via devel <
devel@lists.open-mpi.org> wrote:

> I get that 100% time in the runs on MacOS, too (with today's HEAD):
>
> --
> $ mpirun -np 4 --mca btl tcp,self ring_c
> Process 0 sending 10 to 1, tag 201 (4 processes in ring)
> [JSQUYRES-M-26UT][[5535,1],0][btl_tcp_endpoint.c:742:mca_btl_tcp_endpoint_start_connect]
> bind() failed: Invalid argument (22)
> [JSQUYRES-M-26UT:85104] *** An error occurred in MPI_Send
> [JSQUYRES-M-26UT:85104] *** reported by process [362741761,0]
> [JSQUYRES-M-26UT:85104] *** on communicator MPI_COMM_WORLD
> [JSQUYRES-M-26UT:85104] *** MPI_ERR_OTHER: known error not in list
> [JSQUYRES-M-26UT:85104] *** MPI_ERRORS_ARE_FATAL (processes in this
> communicator will now abort,
> [JSQUYRES-M-26UT:85104] ***and potentially your MPI job)
> --
>
>
> > On Oct 1, 2018, at 2:12 PM, Ralph H Castain  wrote:
> >
> > I’m getting this error when trying to run a simple ring program on my
> Mac:
> >
> >
> [Ralphs-iMac-2.local][[21423,14],0][btl_tcp_endpoint.c:742:mca_btl_tcp_endpoint_start_connect]
> bind() failed: Invalid argument (22)
> >
> > Anyone recognize the problem? It causes the job to immediately abort.
> This is with current head of master this morning - it was working when I
> last used it, but it has been an unknown period of time.
> > Ralph
> >
> > ___
> > devel mailing list
> > devel@lists.open-mpi.org
> > https://lists.open-mpi.org/mailman/listinfo/devel
>
>
> --
> Jeff Squyres
> jsquy...@cisco.com
>
> ___
> devel mailing list
> devel@lists.open-mpi.org
> https://lists.open-mpi.org/mailman/listinfo/devel
___
devel mailing list
devel@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/devel

Re: [OMPI devel] Error in TCP BTL??

2018-10-01 Thread Jeff Squyres (jsquyres) via devel
I get that 100% time in the runs on MacOS, too (with today's HEAD):

--
$ mpirun -np 4 --mca btl tcp,self ring_c
Process 0 sending 10 to 1, tag 201 (4 processes in ring)
[JSQUYRES-M-26UT][[5535,1],0][btl_tcp_endpoint.c:742:mca_btl_tcp_endpoint_start_connect]
 bind() failed: Invalid argument (22)
[JSQUYRES-M-26UT:85104] *** An error occurred in MPI_Send
[JSQUYRES-M-26UT:85104] *** reported by process [362741761,0]
[JSQUYRES-M-26UT:85104] *** on communicator MPI_COMM_WORLD
[JSQUYRES-M-26UT:85104] *** MPI_ERR_OTHER: known error not in list
[JSQUYRES-M-26UT:85104] *** MPI_ERRORS_ARE_FATAL (processes in this 
communicator will now abort,
[JSQUYRES-M-26UT:85104] ***and potentially your MPI job)
--


> On Oct 1, 2018, at 2:12 PM, Ralph H Castain  wrote:
> 
> I’m getting this error when trying to run a simple ring program on my Mac:
> 
> [Ralphs-iMac-2.local][[21423,14],0][btl_tcp_endpoint.c:742:mca_btl_tcp_endpoint_start_connect]
>  bind() failed: Invalid argument (22)
> 
> Anyone recognize the problem? It causes the job to immediately abort. This is 
> with current head of master this morning - it was working when I last used 
> it, but it has been an unknown period of time.
> Ralph
> 
> ___
> devel mailing list
> devel@lists.open-mpi.org
> https://lists.open-mpi.org/mailman/listinfo/devel


-- 
Jeff Squyres
jsquy...@cisco.com

___
devel mailing list
devel@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/devel

[OMPI devel] Error in TCP BTL??

2018-10-01 Thread Ralph H Castain
I’m getting this error when trying to run a simple ring program on my Mac:

[Ralphs-iMac-2.local][[21423,14],0][btl_tcp_endpoint.c:742:mca_btl_tcp_endpoint_start_connect]
 bind() failed: Invalid argument (22)

Anyone recognize the problem? It causes the job to immediately abort. This is 
with current head of master this morning - it was working when I last used it, 
but it has been an unknown period of time.
Ralph

___
devel mailing list
devel@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/devel