Re: [OMPI devel] Shared object dependencies

2018-06-12 Thread Jeff Squyres (jsquyres) via devel
On Jun 12, 2018, at 7:34 AM, Gabriel, Edgar wrote: > > Well, I am still confused. What is different on nixOS vs. other linux distros > that makes this error appear, Fair enough. I don't think I realized nixOS was a Linux distro. That being said, every time I think I understand linkers, I find

Re: [OMPI devel] Shared object dependencies

2018-06-12 Thread Gabriel, Edgar
Well, I am still confused. What is different on nixOS vs. other linux distros that makes this error appear, and is it relevant enough for the backport or should we just go forward for 4.0? Is it again a RTLD_GLOBAL issue as it was back 2014? And last but not least, I raised on the github discuss

Re: [OMPI devel] Shared object dependencies

2018-06-12 Thread Jeff Squyres (jsquyres) via devel
On Jun 12, 2018, at 7:21 AM, Gilles Gouaillardet wrote: > > I think this also depends on the linker (configuration ?) and possibly the > order the libraries are dlopen’ed. > > Note the issue was initially reported (as warnings only) from ompi_info, so > there is a possibility it we all missed

Re: [OMPI devel] Shared object dependencies

2018-06-12 Thread Gilles Gouaillardet
I think this also depends on the linker (configuration ?) and possibly the order the libraries are dlopen’ed. Note the issue was initially reported (as warnings only) from ompi_info, so there is a possibility it we all missed it. That being said, the errors make perfect sense to me. fwiw, I inst

Re: [OMPI devel] Shared object dependencies

2018-06-12 Thread Gabriel, Edgar
No, I do not use -disable-dlopen, this is the other thing that is confusing to me, how comes this error does not occur for anybody else. Thanks Edgar > -Original Message- > From: devel [mailto:devel-boun...@lists.open-mpi.org] On Behalf Of Jeff > Squyres (jsquyres) via devel > Sent: Tuesd

Re: [OMPI devel] Shared object dependencies

2018-06-12 Thread Jeff Squyres (jsquyres) via devel
How is it that Edgar is not running into these issues? Edgar: are you compiling with --disable-dlopen, perchance? > On Jun 12, 2018, at 6:04 AM, Gilles Gouaillardet > wrote: > > Edgar, > > Regarding this specific problem, the issue is mca_fcoll_individual.so did not > depend on libmca_commo

Re: [OMPI devel] Shared object dependencies

2018-06-12 Thread Gilles Gouaillardet
Edgar, Regarding this specific problem, the issue is mca_fcoll_individual.so did not depend on libmca_commom_ompio.so, the PR does address that (among other abstraction violations) What about following up in github ? Cheers, Gilles On Tuesday, June 12, 2018, Gabriel, Edgar wrote: > So , I a

Re: [OMPI devel] Shared object dependencies

2018-06-12 Thread Gabriel, Edgar
So , I am still surprised to see this error message: if you look at lets say just one error message (and all others are the same): > > [orc-login2:107400] mca_base_component_repository_open: unable to open > > mca_fcoll_individual: .../lib/openmpi/mca_fcoll_individual.so: > > undefined symbol: mc

Re: [OMPI devel] Shared object dependencies

2018-06-12 Thread Gilles Gouaillardet
Tyson, thanks for taking the time to do some more tests. This is really a bug in Open MPI, and unlike what I thought earlier, there are still some abstraction violations here and there related to ompio. I filed https://github.com/open-mpi/ompi/pull/5263 in order to address them Meanwhil