[OMPI devel] PMIx in 2.x

2016-11-07 Thread Pieter Noordhuis
Hi folks, Question about PMIx in the 2.x tree: on 1.x I used to be able to start N individual jobs through mpirun with -np1 and have them gradually join a single intercommunicator through MPI_Comm_accept, MPI_Comm_connect, MPI_Intercomm_create, and MPI_Intercomm_merge. The port that one of the

Re: [OMPI devel] PMIx in 2.x

2016-11-08 Thread Pieter Noordhuis
ndezvous point. This option remains - it just uses PMIx calls to implement the rendezvous. On Nov 7, 2016, at 9:23 AM, Pieter Noordhuis mailto:piet...@fb.com>> wrote: Hi folks, Question about PMIx in the 2.x tree: on 1.x I used to be able to start N individual jobs throug

Re: [OMPI devel] PMIx in 2.x

2016-11-08 Thread Pieter Noordhuis
: Pieter Noordhuis Sent: Tuesday, November 8, 2016 10:29:02 AM To: OpenMPI Devel Subject: Re: [OMPI devel] PMIx in 2.x Ah, that's good to know. I'm trying to wire things up through orte-server on 2.x now and am at the point where I can do an MPI_Publish_name on one node and MPI_Look

Re: [OMPI devel] PMIx in 2.x

2016-11-08 Thread Pieter Noordhuis
t 10:30 AM, Pieter Noordhuis mailto:piet...@fb.com>> wrote: ... accidentally sent before finishing. This error happened because lookup was returning multiple published entries, N-1 of which had already disconnected. Should this case be handled more gracefully, or is it expected to fail li

Re: [OMPI devel] PMIx in 2.x

2016-11-09 Thread Pieter Noordhuis
November 8, 2016 2:30:52 PM To: OpenMPI Devel Subject: Re: [OMPI devel] PMIx in 2.x Yes, they should find one another. You do have to pass the server’s URI on the mpirun cmd line so it can find it. On Nov 8, 2016, at 12:40 PM, Pieter Noordhuis mailto:piet...@fb.com>> wrote: I'll open an