Paul,

Here is a revised patch to be applied vs the 1.8.7-rc1 tarball

Could you please give it a try ?

Cheers,

Gilles

On 7/11/2015 4:22 AM, Paul Hargrove wrote:
The timing on this is less than ideal for me.

To accommodate work on some high-voltage switching equipment, our building will be without power over the weekend. The system I use to autogen will be OFF from around 3pm today until perhaps 3pm on Monday. I will also be busy with shutting down our group's systems gracefully today and bringing them back on Monday.

The test platforms where I have reproduced the failures is NOT going to be off-line. So, I will be able to test only *tarballs* (but not patches to .m4 files) until probably Monday evening.

Gilles,

I think it reasonable to suspect the lib could hold a stub that returns ENOSYS for the deprecated function. I suspect that checking for ibv_create_xrc_rcv_qp+IBV_QPT_XRC should work for the rhel6.5 failure case described previously. That way the checks for the two flavors both look for a function in the lib and a constant in the header.


-Paul

On Fri, Jul 10, 2015 at 8:21 AM, Jeff Squyres (jsquyres) <jsquy...@cisco.com <mailto:jsquy...@cisco.com>> wrote:

    Yes, I seem to recall that this issue came up before... ah, here
    it is:

    commit 04bec4475e5a962432b73dd6254f62bb263703ab
    Author: Jeff Squyres <jsquy...@cisco.com <mailto:jsquy...@cisco.com>>
    Date:   Fri Jan 16 18:13:31 2015 -0800

        openib: check more thoroughly for XRC

        Some systems have XRC symbols in their libibverbs libraries,
    but do
        not have the appropriate XRC bits in their devel headers
    (cough cough
        RHEL 6.5 libibverbs-rocee-*.x86-64.rpm cough cough).

        So expand the XRC config checks to ensure that we can actually
    find
        one of the XRC constants that we need to compile XRC code before
        ruling that we can actually build XRC support.



    > On Jul 10, 2015, at 10:33 AM, Gilles Gouaillardet
    <gilles.gouaillar...@gmail.com
    <mailto:gilles.gouaillar...@gmail.com>> wrote:
    >
    > Sorry about that, and thanks for reverting the commit.
    >
    > Paul mentioned a patch I sent to the ml, and that worked for him.
    > The commit was supposed to be a more robust version.
    > For example, in rhel7, the deprecated function have been
    removed, but the xrc domains is fine.
    > Currently, xrc is not supported as it should.
    >
    > It seems rhel 6.5 has the deprecated function, but it is not in
    the header files are missing it among other things.
    >
    > I will fix that and post a issue a pr so you can test it on
    rhel6.5 before I commit it.
    >
    > I noticed there is no infiniband/verbs.h on a lanl test cluster
    (the non cray one)
    > Is it possible to have it installed ?
    >
    > Cheers,
    >
    > Gilles
    >
    > On Friday, July 10, 2015, Jeff Squyres (jsquyres)
    <jsquy...@cisco.com <mailto:jsquy...@cisco.com>> wrote:
    > On Jul 10, 2015, at 2:12 AM, Paul Hargrove <phhargr...@lbl.gov
    <mailto:phhargr...@lbl.gov>> wrote:
    > >
    > > The only "new" (non-cosmetic) problem I observed was the
    failure to detect "ConnectX XRC support".
    > > It looks like Gilles and I iterated on that issue until we
    have something that works now.
    >
    > 'fraid not.  :-(
    >
    > Per
    https://github.com/open-mpi/ompi-release/pull/384#issuecomment-120412836,
    the latest commit breaks on RHEL 6.5 systems that do not have
    MOFED installed.
    >
    > --
    > Jeff Squyres
    > jsquy...@cisco.com <mailto:jsquy...@cisco.com>
    > For corporate legal information go to:
    http://www.cisco.com/web/about/doing_business/legal/cri/
    >
    > _______________________________________________
    > devel mailing list
    > de...@open-mpi.org <mailto:de...@open-mpi.org>
    > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
    > Link to this post:
    http://www.open-mpi.org/community/lists/devel/2015/07/17618.php
    > _______________________________________________
    > devel mailing list
    > de...@open-mpi.org <mailto:de...@open-mpi.org>
    > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
    > Link to this post:
    http://www.open-mpi.org/community/lists/devel/2015/07/17620.php


    --
    Jeff Squyres
    jsquy...@cisco.com <mailto:jsquy...@cisco.com>
    For corporate legal information go to:
    http://www.cisco.com/web/about/doing_business/legal/cri/

    _______________________________________________
    devel mailing list
    de...@open-mpi.org <mailto:de...@open-mpi.org>
    Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
    Link to this post:
    http://www.open-mpi.org/community/lists/devel/2015/07/17623.php




--
Paul H. Hargrove phhargr...@lbl.gov <mailto:phhargr...@lbl.gov>
Computer Languages & Systems Software (CLaSS) Group
Computer Science Department               Tel: +1-510-495-2352
Lawrence Berkeley National Laboratory     Fax: +1-510-486-6900


_______________________________________________
devel mailing list
de...@open-mpi.org
Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
Link to this post: 
http://www.open-mpi.org/community/lists/devel/2015/07/17625.php

Attachment: xrc.4.v1.8.patch.gz
Description: application/gzip

Reply via email to