Re: [OMPI devel] [OMPI users] simple mpi hello world segfaults when coll ml not disabled

2015-06-26 Thread Jeff Squyres (jsquyres)
On Jun 25, 2015, at 10:48 PM, Gilles Gouaillardet wrote: > > as far as i understand, the behavior depends on how plugins are enumerated > and this is system dependent > (by default, Daniel got a crash, but i got none ...) > should we sort the plugins by name/library name so we do not fall into t

[OMPI devel] the bug in btl_openib_connect_sl.c

2015-06-26 Thread Алексей Рыжих
Hi everybody, I tried the functionality for 3D-torus cluster topology support and encountered the bug with error message like below: srvmpisnb02][[9011,1],3][ompi/mca/btl/openib/connect/btl_openib_connect_sl.c:239:get_pathrecord_info] error posting receive on QP [0x4f] errno says: Success [

Re: [OMPI devel] [OMPI users] simple mpi hello world segfaults when coll ml not disabled

2015-06-26 Thread Paul Hargrove
On Thu, Jun 25, 2015 at 10:48 PM, Gilles Gouaillardet wrote: > Paul, > > i assume you ran the test with Open MPI configured with --disable-dlopen, > right ? > > --disable-dlopen is like forcing coll_ml to be loaded first, hence the > crash, even with --mca coll ^ml > > without --disable-dlopen, a

Re: [OMPI devel] [OMPI users] simple mpi hello world segfaults when coll ml not disabled

2015-06-26 Thread Gilles Gouaillardet
Paul, i assume you ran the test with Open MPI configured with --disable-dlopen, right ? --disable-dlopen is like forcing coll_ml to be loaded first, hence the crash, even with --mca coll ^ml without --disable-dlopen, and with default coll_ml_priority=0, the crash only occurs if coll_ml is

Re: [OMPI devel] [OMPI users] simple mpi hello world segfaults when coll ml not disabled

2015-06-26 Thread Ralph Castain
Crud - thanks Paul! Mellanox is working on a fix (renaming the symbols in their proprietary library so they don't conflict). If they can release that soon, I'm hoping to avoid having to release a quick 1.8.7 to fix the problem from inside OMPI (i.e., removing one of the conflicting plugins). On

Re: [OMPI devel] === CREATE FAILURE (dev-1979-g13425e7) ===

2015-06-26 Thread Jeff Squyres (jsquyres)
No results were put back to the PR where we did the extensions work this afternoon. See attached. [cid:43E6B6DA-588A-4766-80B3-BD8762F683AD@cisco.com] On Jun 25, 2015, at 9:03 PM, Howard Pritchard mailto:hpprit...@gmail.com>> wrote: sorry, not true. look at the logs on IU. runs at 3:07 and

Re: [OMPI devel] === CREATE FAILURE (dev-1979-g13425e7) ===

2015-06-26 Thread Howard Pritchard
sorry, not true. look at the logs on IU. runs at 3:07 and 4:08 IU time. 2015-06-25 21:46 GMT-06:00 Jeff Squyres (jsquyres) : > Howard -- > > The LANL distcheck jenkins hasn't been running all day. > > > > On Jun 25, 2015, at 8:33 PM, Howard Pritchard > wrote: > > > > Hi folks, > > > > I'm confu