Any further info on this? I can't replicate it on my cluster, even using Slurm
2.6.2 with pmi2 enabled, using the current trunk.
On Oct 1, 2013, at 6:44 PM, Ralph Castain wrote:
> Hmmm...working for me, though with an earlier version of Slurm.
>
> It looks like you are seeing a failure in Job_
Hmmm...working for me, though with an earlier version of Slurm.
It looks like you are seeing a failure in Job_Getid in the libpmi2 support. I
wonder if you have a problem in that library? Can you check the arguments going
to it? Perhaps the max value length is too big or something?
You might al
I am getting the following failure on two hosts, 1 proc per host. We are
running SLURM 2.6.2.
joshual@mir13 ~/ompi_1.7/openmpi-1.7.3rc1/examples
$mpirun -np 2 -bynode hostname
*** buffer overflow detected ***: mpirun terminated
=== Backtrace: =
/lib64/libc.so.6(__fortify_fail+0x37)[0