George, I think you misunderstand the difference between the two modules. PSM 
supports one type of fabric, and PSM2 supports a different one. They are not 
interchangeable.

I agree with your second point. If you have a way of resolving it, I would 
welcome hearing it. So far, the problems have been a lack of testing that fails 
to identify problems, frequently due to missed use-cases. Perhaps someone has a 
better suite of tests (or would volunteer to help develop them), and/or would 
offer up access to a broader range of environments?

As for why the distro would provide two rpm’s: you can argue that with them. 
:-)  However, I believe the issue is the same one we have with hcoll/ml and 
libnl/libnl3 conflicts - and I have yet to see someone propose a reliable fix 
for either of those. If you can create one, please chime in on the libnl/libnl3 
ticket as Gilles and Jeff have spent a lot of time and pain on it.


> On Sep 2, 2015, at 9:38 PM, Gilles Gouaillardet <gil...@rist.or.jp> wrote:
> 
> George,
> 
> about your third point :
> some libraries does stuff in the constructors, so "mtl = ^psm" might also not 
> work if OMPI was configure'd with --disable-dlopen.
> as far as i know, --disable-dlopen is quite popular (and --disable-shared 
> --enable-static is not so much)
> 
> Cheers,
> 
> Gilles
> 
> On 9/3/2015 1:31 PM, George Bosilca wrote:
>> I might have missed something here but:
>> 
>> 1. I bet that, and I'm certainly using a lower bound here, 99.9% of our 
>> users will not even notice the issue between PSM and PSM2.
>> 
>> 2. If there is anything that might negatively impact us as a community is 
>> the recurrent screwed-up with our own releases. For a production-quality 
>> software, releasing a new "stable" version every 3 weeks is not being 
>> reactive, is being obnoxious.
>> 
>> 3. Except if the distro builds OMPI statically, I see no reason to have 2 
>> build of OMPI due to conflicting symbols between two shared libraries that 
>> OMPI MCA load willingly. Why a simple "mtl = ^psm" in the OMPI system wide 
>> configuration file is not enough to solve the issue?
>> 
>>   George.
>> 
>> 
>> On Wed, Sep 2, 2015 at 11:47 PM, Ralph Castain <r...@open-mpi.org 
>> <mailto:r...@open-mpi.org>> wrote:
>> I’m afraid that won’t solve the problem - the distro will still feel the 
>> need to release -two- versions of OMPI, one with PSM and one with PSM2. 
>> Ordinarily, I wouldn’t care - but this creates user confusion and reflects 
>> on us as a community.
>> 
>> 
>> > On Sep 2, 2015, at 6:50 PM, Gilles Gouaillardet <gil...@rist.or.jp 
>> > <mailto:gil...@rist.or.jp>> wrote:
>> >
>> > Ralph,
>> >
>> > what about automatically *not* building PSM2 if PSM is built and PSM2 is 
>> > not explicitly required ?
>> > /* in order to be future proof, we could even do that only if we detect a 
>> > symbol conflict */
>> > we could abort if ompi is configure'd with both --with-psm and 
>> > --with-psm2, or simply do nothing
>> > (the end user might know what he/she is doing, and there will be nothing 
>> > to do on the ompi side
>> > when this gets fixed by the PSM folks)
>> >
>> > Cheers,
>> >
>> > Gilles
>> >
>> > On 9/3/2015 10:21 AM, Ralph Castain wrote:
>> >> Hi folks
>> >>
>> >> I regret to say that 1.10.0 is hitting an issue with at least one 
>> >> upstream distro. Apparently, there is a symbol conflict between the PSM 
>> >> and PSM2 libraries that precludes building both of those MTLs at the same 
>> >> time. This is leading the distro to push for release of two OMPI 1.10.0 
>> >> builds - one with PSM and the other with PSM2.
>> >>
>> >> IMO, this is a very undesirable situation. I agree with the distro that 
>> >> delaying release for some significant time as this would impact everyone 
>> >> else’s users. Therefore, assuming that the PSM team is unable to quickly 
>> >> resolve the problem in their libraries, my inclination is to release an 
>> >> immediate 1.10.1 with the PSM2 MTL removed.
>> >>
>> >> I’m soliciting input - any opinions?
>> >> Ralph
>> >>
>> >> _______________________________________________
>> >> devel mailing list
>> >> de...@open-mpi.org <mailto:de...@open-mpi.org>
>> >> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel 
>> >> <http://www.open-mpi.org/mailman/listinfo.cgi/devel>
>> >> Link to this post: 
>> >> http://www.open-mpi.org/community/lists/devel/2015/09/17919.php 
>> >> <http://www.open-mpi.org/community/lists/devel/2015/09/17919.php>
>> >
>> > _______________________________________________
>> > devel mailing list
>> > de...@open-mpi.org <mailto:de...@open-mpi.org>
>> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel 
>> > <http://www.open-mpi.org/mailman/listinfo.cgi/devel>
>> > Link to this post: 
>> > http://www.open-mpi.org/community/lists/devel/2015/09/17920.php 
>> > <http://www.open-mpi.org/community/lists/devel/2015/09/17920.php>
>> 
>> _______________________________________________
>> devel mailing list
>> de...@open-mpi.org <mailto:de...@open-mpi.org>
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel 
>> <http://www.open-mpi.org/mailman/listinfo.cgi/devel>
>> Link to this post: 
>> http://www.open-mpi.org/community/lists/devel/2015/09/17921.php 
>> <http://www.open-mpi.org/community/lists/devel/2015/09/17921.php>
>> 
>> 
>> _______________________________________________
>> devel mailing list
>> de...@open-mpi.org <mailto:de...@open-mpi.org>
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel 
>> <http://www.open-mpi.org/mailman/listinfo.cgi/devel>
>> Link to this post: 
>> http://www.open-mpi.org/community/lists/devel/2015/09/17923.php 
>> <http://www.open-mpi.org/community/lists/devel/2015/09/17923.php>
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post: 
> http://www.open-mpi.org/community/lists/devel/2015/09/17924.php

Reply via email to