Hi Kay
I suppose you could try to implement the MPI support directly in your plugin -
does ARE have a launcher that starts its own daemons that fork/exec the user
procs? You’d also have to implement a PMIx server in those daemons to make it
all work, but you could model that after the one in OR
Ralph,
No it did not run.
The complete output (which I really should have included in the first
place) is below.
-Paul
$ mpirun -mca btl sm,self -np 2 examples/ring_c'
Error opening /devices/pci@0,0:reg: Permission denied
[pcp-d-3:26054] PMIX ERROR: ERROR in file
/export/home/phargrov/OMPI/openm
Ah, okay - that makes more sense. I’ll have to let Brice see if he can figure
out how to silence the hwloc error message as I can’t find where it came from.
The other errors are real and are the reason why the job was terminated.
The problem is that we are trying to establish a communication bet
Ralph,
The output from the requested run is attached.
-Paul
On Sat, Sep 19, 2015 at 9:46 PM, Ralph Castain wrote:
> Ah, okay - that makes more sense. I’ll have to let Brice see if he can
> figure out how to silence the hwloc error message as I can’t find where it
> came from. The other errors ar
Paul,
Can you please double check pmix1xx is compiled with -D_REENTRANT ?
We ran into similar issues in the past, and they only occurred with Solaris
Cheers,
Gilles
On Sunday, September 20, 2015, Paul Hargrove wrote:
> Ralph,
> The output from the requested run is attached.
> -Paul
>
> On Sat
Gilles,
Yes every $CC invocation in opal/mca/pmix/pmix1xx includes "-D_REENTRANT".
However, they don't include "-mt".
I believe we concluded (when we had problems previously) that "-mt" was the
proper flag (at compile and link) for multi-threaded with the Studio
compilers.
-Paul
On Sat, Sep 19,
Paul,
I do not remember it like that ...
at that time, the issue in ompi was that the global errno was uses instead
of the per thread errno.
though the man pages tells -mt should be used fir multithreaded apps, you
tried -D_REENTRANT on all your platforms, and it was enough to get the
expected re
Paul - can you please confirm that you gave mpirun a level of 10 for the
pmix_base_verbose param? This output isn’t what I would have expected from that
level - it looks more like the verbosity was set to 5, and so the error number
isn’t printed.
Thanks
Ralph
> On Sep 20, 2015, at 3:42 AM, Gi
Yes, it is definitely at 10.
Another attempt is attached.
-Paul
On Sun, Sep 20, 2015 at 8:19 AM, Ralph Castain wrote:
> Paul - can you please confirm that you gave mpirun a level of 10 for the
> pmix_base_verbose param? This output isn’t what I would have expected from
> that level - it looks mo
Argh - found a typo in the output line. Could you please try the attached patch and do it again? This might fix it, but if not it will provide me with some idea of the returned error.ThanksRalph
paul.diff
Description: Binary data
On Sep 20, 2015, at 12:40 PM, Paul Hargrove wro
Ralph,
Still failing with that patch, but with the addition of a fairly long pause
(64s) before the first error message appears, and again after the second
"server setup_fork" (64s again)
New output is attached.
-Paul
On Sun, Sep 20, 2015 at 2:15 PM, Ralph Castain wrote:
> Argh - found a typo
11 matches
Mail list logo