Well, I tried with 3.0.1, and it also hangs. I guess we will try to
write to the R community about this.
m
On 06/04/2018 11:42 PM, Ben Menadue wrote:
Hi All,
This looks very much like what I reported a couple of weeks ago with
Rmpi and doMPI — the trace looks the same. But as far as I
Hi All,
This looks very much like what I reported a couple of weeks ago with Rmpi and
doMPI — the trace looks the same. But as far as I could see, doMPI does
exactly what simple_spawn.c does — use MPI_Comm_spawn to create the workers and
then MPI_Comm_disconnect them when you call
Yes, that does sound like a bug - the #connects must equal the #disconnects.
> On Jun 4, 2018, at 1:17 PM, marcin.krotkiewski
> wrote:
>
> huh. This code also runs, but it also only displays 4 connect / disconnect
> messages. I should add that the test R script shows 4 connect, but 8
>
Marcin,
If you are interested, I can send you the R examples I use to test
things offline.
-- bennet
On Mon, Jun 4, 2018 at 4:17 PM, marcin.krotkiewski
wrote:
> huh. This code also runs, but it also only displays 4 connect / disconnect
> messages. I should add that the test R script shows 4
huh. This code also runs, but it also only displays 4 connect /
disconnect messages. I should add that the test R script shows 4
connect, but 8 disconnect messages. Looks like a bug to me, but where? I
guess we will try to contact R forums and ask there.
Bennet: I tried to use doMPI +
It might call disconnect more than once if it creates multiple communicators.
Here’s another test case for that behavior:
intercomm_create.c
Description: Binary data
> On Jun 4, 2018, at 7:08 AM, Bennet Fauber wrote:
>
> Just out of curiosity, but would using Rmpi and/or doMPI help in any
Just out of curiosity, but would using Rmpi and/or doMPI help in any way?
-- bennet
On Mon, Jun 4, 2018 at 10:00 AM, marcin.krotkiewski
wrote:
> Thanks, Ralph!
>
> Your code finishes normally, I guess then the reason might be lying in R.
> Running the R code with -mca pmix_base_verbose 1 i see
Thanks, Ralph!
Your code finishes normally, I guess then the reason might be lying in
R. Running the R code with -mca pmix_base_verbose 1 i see that each rank
calls ext2x:client disconnect twice (each PID prints the line twice)
[...]
3 slaves are spawned successfully. 0 failed.
Try running the attached example dynamic code - if that works, then it likely
is something to do with how R operates.
simple_spawn.c
Description: Binary data
> On Jun 4, 2018, at 3:43 AM, marcin.krotkiewski
> wrote:
>
> Hi,
>
> I have some problems running R + Rmpi with OpenMPI 3.1.0 +