Paul, Here is a revised patch to be applied vs the 1.8.7-rc1 tarball
Could you please give it a try ? Cheers, Gilles On 7/11/2015 4:22 AM, Paul Hargrove wrote:
The timing on this is less than ideal for me.To accommodate work on some high-voltage switching equipment, our building will be without power over the weekend. The system I use to autogen will be OFF from around 3pm today until perhaps 3pm on Monday. I will also be busy with shutting down our group's systems gracefully today and bringing them back on Monday.The test platforms where I have reproduced the failures is NOT going to be off-line. So, I will be able to test only *tarballs* (but not patches to .m4 files) until probably Monday evening.Gilles,I think it reasonable to suspect the lib could hold a stub that returns ENOSYS for the deprecated function. I suspect that checking for ibv_create_xrc_rcv_qp+IBV_QPT_XRC should work for the rhel6.5 failure case described previously. That way the checks for the two flavors both look for a function in the lib and a constant in the header.-PaulOn Fri, Jul 10, 2015 at 8:21 AM, Jeff Squyres (jsquyres) <jsquy...@cisco.com <mailto:jsquy...@cisco.com>> wrote:Yes, I seem to recall that this issue came up before... ah, here it is: commit 04bec4475e5a962432b73dd6254f62bb263703ab Author: Jeff Squyres <jsquy...@cisco.com <mailto:jsquy...@cisco.com>> Date: Fri Jan 16 18:13:31 2015 -0800 openib: check more thoroughly for XRC Some systems have XRC symbols in their libibverbs libraries, but do not have the appropriate XRC bits in their devel headers (cough cough RHEL 6.5 libibverbs-rocee-*.x86-64.rpm cough cough). So expand the XRC config checks to ensure that we can actually find one of the XRC constants that we need to compile XRC code before ruling that we can actually build XRC support. > On Jul 10, 2015, at 10:33 AM, Gilles Gouaillardet <gilles.gouaillar...@gmail.com <mailto:gilles.gouaillar...@gmail.com>> wrote: > > Sorry about that, and thanks for reverting the commit. > > Paul mentioned a patch I sent to the ml, and that worked for him. > The commit was supposed to be a more robust version. > For example, in rhel7, the deprecated function have been removed, but the xrc domains is fine. > Currently, xrc is not supported as it should. > > It seems rhel 6.5 has the deprecated function, but it is not in the header files are missing it among other things. > > I will fix that and post a issue a pr so you can test it on rhel6.5 before I commit it. > > I noticed there is no infiniband/verbs.h on a lanl test cluster (the non cray one) > Is it possible to have it installed ? > > Cheers, > > Gilles > > On Friday, July 10, 2015, Jeff Squyres (jsquyres) <jsquy...@cisco.com <mailto:jsquy...@cisco.com>> wrote: > On Jul 10, 2015, at 2:12 AM, Paul Hargrove <phhargr...@lbl.gov <mailto:phhargr...@lbl.gov>> wrote: > > > > The only "new" (non-cosmetic) problem I observed was the failure to detect "ConnectX XRC support". > > It looks like Gilles and I iterated on that issue until we have something that works now. > > 'fraid not. :-( > > Per https://github.com/open-mpi/ompi-release/pull/384#issuecomment-120412836, the latest commit breaks on RHEL 6.5 systems that do not have MOFED installed. > > -- > Jeff Squyres > jsquy...@cisco.com <mailto:jsquy...@cisco.com> > For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/ > > _______________________________________________ > devel mailing list > de...@open-mpi.org <mailto:de...@open-mpi.org> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel > Link to this post: http://www.open-mpi.org/community/lists/devel/2015/07/17618.php > _______________________________________________ > devel mailing list > de...@open-mpi.org <mailto:de...@open-mpi.org> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel > Link to this post: http://www.open-mpi.org/community/lists/devel/2015/07/17620.php -- Jeff Squyres jsquy...@cisco.com <mailto:jsquy...@cisco.com> For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/ _______________________________________________ devel mailing list de...@open-mpi.org <mailto:de...@open-mpi.org> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel Link to this post: http://www.open-mpi.org/community/lists/devel/2015/07/17623.php -- Paul H. Hargrove phhargr...@lbl.gov <mailto:phhargr...@lbl.gov> Computer Languages & Systems Software (CLaSS) Group Computer Science Department Tel: +1-510-495-2352 Lawrence Berkeley National Laboratory Fax: +1-510-486-6900 _______________________________________________ devel mailing list de...@open-mpi.org Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel Link to this post: http://www.open-mpi.org/community/lists/devel/2015/07/17625.php
xrc.4.v1.8.patch.gz
Description: application/gzip