Re: [OMPI users] A hang in Rmpi at PMIx_Disconnect

2018-06-05 Thread marcin.krotkiewski
Well, I tried with 3.0.1, and it also hangs. I guess we will try to write to the R community about this. m On 06/04/2018 11:42 PM, Ben Menadue wrote: Hi All, This looks very much like what I reported a couple of weeks ago with Rmpi and doMPI — the trace looks the same.  But as far as I

Re: [OMPI users] A hang in Rmpi at PMIx_Disconnect

2018-06-04 Thread Ben Menadue
Hi All, This looks very much like what I reported a couple of weeks ago with Rmpi and doMPI — the trace looks the same. But as far as I could see, doMPI does exactly what simple_spawn.c does — use MPI_Comm_spawn to create the workers and then MPI_Comm_disconnect them when you call

Re: [OMPI users] A hang in Rmpi at PMIx_Disconnect

2018-06-04 Thread r...@open-mpi.org
Yes, that does sound like a bug - the #connects must equal the #disconnects. > On Jun 4, 2018, at 1:17 PM, marcin.krotkiewski > wrote: > > huh. This code also runs, but it also only displays 4 connect / disconnect > messages. I should add that the test R script shows 4 connect, but 8 >

Re: [OMPI users] A hang in Rmpi at PMIx_Disconnect

2018-06-04 Thread Bennet Fauber
Marcin, If you are interested, I can send you the R examples I use to test things offline. -- bennet On Mon, Jun 4, 2018 at 4:17 PM, marcin.krotkiewski wrote: > huh. This code also runs, but it also only displays 4 connect / disconnect > messages. I should add that the test R script shows 4

Re: [OMPI users] A hang in Rmpi at PMIx_Disconnect

2018-06-04 Thread marcin.krotkiewski
huh. This code also runs, but it also only displays 4 connect / disconnect messages. I should add that the test R script shows 4 connect, but 8 disconnect messages. Looks like a bug to me, but where? I guess we will try to contact R forums and ask there. Bennet: I tried to use doMPI +

Re: [OMPI users] A hang in Rmpi at PMIx_Disconnect

2018-06-04 Thread r...@open-mpi.org
It might call disconnect more than once if it creates multiple communicators. Here’s another test case for that behavior: intercomm_create.c Description: Binary data > On Jun 4, 2018, at 7:08 AM, Bennet Fauber wrote: > > Just out of curiosity, but would using Rmpi and/or doMPI help in any

Re: [OMPI users] A hang in Rmpi at PMIx_Disconnect

2018-06-04 Thread Bennet Fauber
Just out of curiosity, but would using Rmpi and/or doMPI help in any way? -- bennet On Mon, Jun 4, 2018 at 10:00 AM, marcin.krotkiewski wrote: > Thanks, Ralph! > > Your code finishes normally, I guess then the reason might be lying in R. > Running the R code with -mca pmix_base_verbose 1 i see

Re: [OMPI users] A hang in Rmpi at PMIx_Disconnect

2018-06-04 Thread marcin.krotkiewski
Thanks, Ralph! Your code finishes normally, I guess then the reason might be lying in R. Running the R code with -mca pmix_base_verbose 1 i see that each rank calls ext2x:client disconnect twice (each PID prints the line twice) [...]     3 slaves are spawned successfully. 0 failed.

Re: [OMPI users] A hang in Rmpi at PMIx_Disconnect

2018-06-04 Thread r...@open-mpi.org
Try running the attached example dynamic code - if that works, then it likely is something to do with how R operates. simple_spawn.c Description: Binary data > On Jun 4, 2018, at 3:43 AM, marcin.krotkiewski > wrote: > > Hi, > > I have some problems running R + Rmpi with OpenMPI 3.1.0 +