Bug#910485: Confirm issue with libpsm2-2/11.2.68-1

2018-10-29 Thread Mehdi Dogguy
Sure, but it is still an improvement over the current situation and is simple enough to minimize its impact. Of course, it should be considered as a wrkaround, until upstream releases a fixed version. Le 29 octobre 2018 19:41:06 GMT+01:00, Brian Smith a écrit : >Hi Mehdi, > > >On Mon, Oct

Bug#910485: Confirm issue with libpsm2-2/11.2.68-1

2018-10-29 Thread Brian Smith
Hi Mehdi, On Mon, Oct 29, 2018 at 4:48 AM Mehdi Dogguy wrote: > > Sorry for not replying sooner. > > On 2018-10-20 17:54, Brian Smith wrote: > > The change is in psm2_hal.c. It is a brand new file. Reference the > > initialization loop at line 246. > > > > Indeed. The solution described in the

Bug#910485: Confirm issue with libpsm2-2/11.2.68-1

2018-10-29 Thread Mehdi Dogguy
Sorry for not replying sooner. On 2018-10-20 17:54, Brian Smith wrote: The change is in psm2_hal.c. It is a brand new file. Reference the initialization loop at line 246. Indeed. The solution described in the github issue looks very fine. Why not uploading it in Debian? It will solve a real

Bug#910485: Confirm issue with libpsm2-2/11.2.68-1

2018-10-20 Thread Brian Smith
The change is in psm2_hal.c. It is a brand new file. Reference the initialization loop at line 246. /* Optimization note: The following code attempts to initialize two different times: First time assumes that the driver is already up, and so it attempts to initialize with the

Bug#910485: Confirm issue with libpsm2-2/11.2.68-1

2018-10-20 Thread Mehdi Dogguy
On 2018-10-19 19:53, Brian Smith wrote: The problem occurs when the OFI psm2 provider invokes psm2_init() when there are no hfi1 devices present on the system. The call chain eventually invokes hfi1_wait_for_device() with a timeout of 0. That is interpreted as 15000ms. Actually, that part of

Bug#910485: Confirm issue with libpsm2-2/11.2.68-1

2018-10-19 Thread Brian Smith
Sorry for my late reply on this issue. The BTS notifications were sent to spam. I'm looking into adjusting the filters. -- Brian T. Smith System Fabric Works Senior Technical Staff bsm...@systemfabricworks.com GPG Key: 0xB3C2C7B73BA3CD7F

Bug#910485: Confirm issue with libpsm2-2/11.2.68-1

2018-10-19 Thread Brian Smith
I have confirmed this behavior with libfabric1_1.6.1-5 and libpsm2_11.2.68-1. There is no existing workaround for the problem, other than downgrading libpsm2 to 10.3.58-2. The problem occurs when the OFI psm2 provider invokes psm2_init() when there are no hfi1 devices present on the system. The

Bug#910485: Confirm issue with libpsm2-2/11.2.68-1

2018-10-16 Thread Lippuner, Jonas
http://jonaslippuner.com -Original Message- From: Mehdi Dogguy To: "Lippuner, Jonas" , 910...@bugs.debian.org Subject: Re: Bug#910485: Confirm issue with libpsm2-2/11.2.68-1 Date: Tue, 16 Oct 2018 10:10:04 +0200 Hi Jonas, On 2018-10-15 19:54, Lippuner, Jonas wrote: > I'm ha

Bug#910485: Confirm issue with libpsm2-2/11.2.68-1

2018-10-16 Thread Mehdi Dogguy
Hi Jonas, On 2018-10-15 19:54, Lippuner, Jonas wrote: I'm having the same issue with libpsm2-2 version 11.2.68-1. Downgrading to 10.3.58-2 fixes it for me. Can you please explain how you experienced the bug? I've understood Drew's case, but maybe yours is slightly different. -- Mehdi

Bug#910485: Confirm issue with libpsm2-2/11.2.68-1

2018-10-15 Thread Lippuner, Jonas
I'm having the same issue with libpsm2-2 version 11.2.68-1. Downgrading to 10.3.58-2 fixes it for me. -- Jonas Lippuner, PhD Scientist Computational Physics and Methods, CCS-2 Center for Theoretical Astrophysics Los Alamos National Laboratory jlippu...@lanl.gov 505-667-1646