[OMPI devel] intermittent crash in mpirun upon non zero exit status

2014-06-09 Thread Gilles Gouaillardet
Folks, several mtt tests (ompi-trunk r31963) failed (SIGSEGV in mpirun) with a similar stack trace. For example, you can refer to : http://mtt.open-mpi.org/index.php?do_redir=2199 the issue is not related whatsoever to the init_thread_serialized test (other tests failed with similar symptoms) s

Re: [OMPI devel] intermittent crash in mpirun upon non zero exit status

2014-06-09 Thread Ralph Castain
I'm not sure that was created by the commit you cite, but it may have been exposed by it. Either way, the patch is correct - the TCP component will NULL the entry in the hash table, but that doesn't remove the key and so the hash_table lookup request will return "success" with a NULL pointer.

Re: [OMPI devel] intermittent crash in mpirun upon non zero exit status

2014-06-09 Thread Ralph Castain
Rats - sent too soon. Should have noted that I committed the fix and CMR'd it to 1.8.2 On Jun 9, 2014, at 10:47 AM, Ralph Castain wrote: > I'm not sure that was created by the commit you cite, but it may have been > exposed by it. Either way, the patch is correct - the TCP component will NULL