On Jun 12, 2018, at 7:34 AM, Gabriel, Edgar wrote:
>
> Well, I am still confused. What is different on nixOS vs. other linux distros
> that makes this error appear,
Fair enough. I don't think I realized nixOS was a Linux distro.
That being said, every time I think I understand linkers, I find
Well, I am still confused. What is different on nixOS vs. other linux distros
that makes this error appear, and is it relevant enough for the backport or
should we just go forward for 4.0? Is it again a RTLD_GLOBAL issue as it was
back 2014? And last but not least, I raised on the github discuss
On Jun 12, 2018, at 7:21 AM, Gilles Gouaillardet
wrote:
>
> I think this also depends on the linker (configuration ?) and possibly the
> order the libraries are dlopen’ed.
>
> Note the issue was initially reported (as warnings only) from ompi_info, so
> there is a possibility it we all missed
I think this also depends on the linker (configuration ?) and possibly the
order the libraries are dlopen’ed.
Note the issue was initially reported (as warnings only) from ompi_info, so
there is a possibility it we all missed it.
That being said, the errors make perfect sense to me.
fwiw, I inst
No, I do not use -disable-dlopen, this is the other thing that is confusing to
me, how comes this error does not occur for anybody else.
Thanks
Edgar
> -Original Message-
> From: devel [mailto:devel-boun...@lists.open-mpi.org] On Behalf Of Jeff
> Squyres (jsquyres) via devel
> Sent: Tuesd
How is it that Edgar is not running into these issues?
Edgar: are you compiling with --disable-dlopen, perchance?
> On Jun 12, 2018, at 6:04 AM, Gilles Gouaillardet
> wrote:
>
> Edgar,
>
> Regarding this specific problem, the issue is mca_fcoll_individual.so did not
> depend on libmca_commo
Edgar,
Regarding this specific problem, the issue is mca_fcoll_individual.so did
not depend on libmca_commom_ompio.so,
the PR does address that (among other abstraction violations)
What about following up in github ?
Cheers,
Gilles
On Tuesday, June 12, 2018, Gabriel, Edgar wrote:
> So , I a
So , I am still surprised to see this error message: if you look at lets say
just one error message (and all others are the same):
> > [orc-login2:107400] mca_base_component_repository_open: unable to open
> > mca_fcoll_individual: .../lib/openmpi/mca_fcoll_individual.so:
> > undefined symbol: mc
Tyson,
thanks for taking the time to do some more tests.
This is really a bug in Open MPI, and unlike what I thought earlier,
there are still
some abstraction violations here and there related to ompio.
I filed https://github.com/open-mpi/ompi/pull/5263 in order to address them
Meanwhil