On Mar 20, 2009, at 11:06 AM, Eugene Loh wrote:
> I'm still seeing a very low incidence of the sm segv during
startup (.
> 01% -- 23 tests out of ~160k), so let's ship 1.3.1 and roll in
> Eugene's new sm code for 1.3.2.
>
I wanted to join in the fun, but... no go. I'm running an
"MPI_Init()"
job on a single node with np=8. So far about 40K runs with no
failures. Am I missing a special ingredient?
I wish I knew what it was. :-(
The 160k runs are all my MTT runs. I run a large variety of different
configurations with different compilers, mpirun options, etc.
Although when I poked into this last week, I couldn't find any obvious
pattern as to what exactly was causing the failure.
--
Jeff Squyres
Cisco Systems