On Mar 20, 2009, at 11:06 AM, Eugene Loh wrote:

> I'm still seeing a very low incidence of the sm segv during startup (.
> 01% -- 23 tests out of ~160k), so let's ship 1.3.1 and roll in
> Eugene's new sm code for 1.3.2.
>
I wanted to join in the fun, but... no go. I'm running an "MPI_Init()"
job on a single node with np=8.  So far about 40K runs with no
failures.  Am I missing a special ingredient?


I wish I knew what it was.  :-(

The 160k runs are all my MTT runs. I run a large variety of different configurations with different compilers, mpirun options, etc. Although when I poked into this last week, I couldn't find any obvious pattern as to what exactly was causing the failure.

--
Jeff Squyres
Cisco Systems

Reply via email to