Re: [OMPI devel] 1.7.3rc2 is out

2013-10-02 Thread Ralph Castain
Any further info on this? I can't replicate it on my cluster, even using Slurm 2.6.2 with pmi2 enabled, using the current trunk. On Oct 1, 2013, at 6:44 PM, Ralph Castain wrote: > Hmmm...working for me, though with an earlier version of Slurm. > > It looks like you are seeing a failure in Job_

Re: [OMPI devel] 1.7.3rc2 is out

2013-10-01 Thread Ralph Castain
Hmmm...working for me, though with an earlier version of Slurm. It looks like you are seeing a failure in Job_Getid in the libpmi2 support. I wonder if you have a problem in that library? Can you check the arguments going to it? Perhaps the max value length is too big or something? You might al

Re: [OMPI devel] 1.7.3rc2 is out

2013-10-01 Thread Joshua Ladd
I am getting the following failure on two hosts, 1 proc per host. We are running SLURM 2.6.2. joshual@mir13 ~/ompi_1.7/openmpi-1.7.3rc1/examples $mpirun -np 2 -bynode hostname *** buffer overflow detected ***: mpirun terminated === Backtrace: = /lib64/libc.so.6(__fortify_fail+0x37)[0