Hi folks,
Question about PMIx in the 2.x tree: on 1.x I used to be able to start N
individual jobs through mpirun with -np1 and have them gradually join a single
intercommunicator through MPI_Comm_accept, MPI_Comm_connect,
MPI_Intercomm_create, and MPI_Intercomm_merge. The port that one of the
ndezvous point. This option remains - it just uses PMIx calls to implement
the rendezvous.
On Nov 7, 2016, at 9:23 AM, Pieter Noordhuis
mailto:piet...@fb.com>> wrote:
Hi folks,
Question about PMIx in the 2.x tree: on 1.x I used to be able to start N
individual jobs throug
: Pieter Noordhuis
Sent: Tuesday, November 8, 2016 10:29:02 AM
To: OpenMPI Devel
Subject: Re: [OMPI devel] PMIx in 2.x
Ah, that's good to know. I'm trying to wire things up through orte-server on
2.x now and am at the point where I can do an MPI_Publish_name on one node and
MPI_Look
t 10:30 AM, Pieter Noordhuis
mailto:piet...@fb.com>> wrote:
... accidentally sent before finishing.
This error happened because lookup was returning multiple published entries,
N-1 of which had already disconnected. Should this case be handled more
gracefully, or is it expected to fail li
November 8, 2016 2:30:52 PM
To: OpenMPI Devel
Subject: Re: [OMPI devel] PMIx in 2.x
Yes, they should find one another. You do have to pass the server’s URI on the
mpirun cmd line so it can find it.
On Nov 8, 2016, at 12:40 PM, Pieter Noordhuis
mailto:piet...@fb.com>> wrote:
I'll open an