Re: [ewg] OFA EWG Meeting: Monday, Sept 10th, 2018 - Minutes

2018-09-13 Thread Kalderon, Michal
Hi,

Sorry for not attending this week.
Cavium update -

  *   Completed 7.4 backports
  *   Working on sles12.3
  *   Still testing on all distros

Thanks,
Michal

From: ewg  On Behalf Of Davis, Arlin R
Sent: Thursday, September 13, 2018 1:03 AM
To: ewg@lists.openfabrics.org
Subject: [ewg] OFA EWG Meeting: Monday, Sept 10th, 2018 - Minutes


External Email


Attendees Company
Steve Wise Chelsio
Arlin Davis  Intel
Bob Woodruff   Intel
Tatyana Nikolova Intel - i40iw
Vladimir Sokolovsky Mellanox
Llolsten KaongaSW Forge





Minutes:



  *   OFED-4.17 daily build (Sept 6th): 
https://downloads.openfabrics.org/downloads/OFED/ofed-4.17-daily/OFED-4.17-20180906-0743.tgz

o   Mellanox

?  All core backports done (RH7.4, 7.5, SLES12.3, 15), still testing

?  iSER target and NVMeoF target backports still in process

?  NFS-RDMA status unknown, Arlin to check with Jeff Becker.

o   Chelsio

?  Still working SLES12.3, hope to finish up this week.

o   Broadcom

?  Finished all backports, one patch is still under review in linux-rdma.

o   Cavium

?  Still working RH7.4?  Is this correct?

o   Intel

?  i40iw on 4.17 fails build, Tatyana will send Vlad a patch this week.

?  qib (True Scale) 7.5 backport issues, qib may be limited to kernel.org-4.17 
similar to i40iw. Arlin will verify.

o   VMware

*   Build testing done for RHEL 7.4/7.5, SLES 12.3/15, functional tests 
done by end of this week.
*   Interoperability and backward compatibility tests next week.

o   Huawei

?  Software Forge will reach out to maintainers and previous contacts.

o   RC/GA Timeline

?  Vlad asked everyone to update release notes in preparation for RC1, Arlin 
will drive the effort

?  Everyone agreed to be ready for RC1 by next meeting (Sept 24)



Please correct as needed.



Regards,



Arlin
___
ewg mailing list
ewg@lists.openfabrics.org
https://lists.openfabrics.org/mailman/listinfo/ewg


Re: [ewg] OFA EWG Meeting Minutes

2018-08-14 Thread Kalderon, Michal
Thanks Rupert,

Sorry we could not attend. Update from Cavium:


  *   Backports for RHEL7.5 were sufficient for SLES15 as well
  *   Working on RHEL 7.4
  *   We have some issues with fastlinq binary FW installation ( currently 
working around this by manually copying
the binary into /lib/firmware ) We are working on resolving the issue with Vlad.
  *   Testing on 7.5 / Sles in the meantime looks ok.


Thanks,
Michal

From: ewg  On Behalf Of Rupert Dance - SFI
Sent: Tuesday, August 14, 2018 7:56 PM
To: ewg@lists.openfabrics.org
Subject: [ewg] OFA EWG Meeting Minutes


External Email
Hi All,

I have attached the OFA EWG meeting minutes. Please let me know if I missed 
anything or there are some corrections needed.

Thanks

Rupert
___
ewg mailing list
ewg@lists.openfabrics.org
https://lists.openfabrics.org/mailman/listinfo/ewg


[ewg] ofed-4.17 perftest

2018-08-13 Thread Kalderon, Michal
Hi Vlad,

Can you please update to take latest perftest ?
Current version 4.2-0.5 has a bug which was fixed in commit:
https://github.com/linux-rdma/perftest/commit/198473181e0365f97c5840b8fd406ff52af6335b

(Fail to register MR)

Thanks,
Michal
___
ewg mailing list
ewg@lists.openfabrics.org
https://lists.openfabrics.org/mailman/listinfo/ewg


[ewg] netdev_notifiers

2018-07-23 Thread Kalderon, Michal
Hi Vlad, Arlin,

We had an issue with loading our net driver and found that the following is 
missing from compat/netdevice.h:

#ifdef HAVE_REGISTER_NETDEVICE_NOTIFIER_RH
#define register_netdevice_notifier register_netdevice_notifier_rh
#define unregister_netdevice_notifier unregister_netdevice_notifier_rh
#endif

This leads to a crash when getting a netdev event.

In addition, I wanted to check if this was fixed (git pull) but got the 
following message:
Fatal: unable to connect to git.openfabrics.org:
git.openfabrics.org[0: 69.55.231.76]: errno=Connection refused
(been like this since yesterday )

thanks,
Michal
___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/mailman/listinfo/ewg


Re: [ewg] OFED-4.17 daily build status - 20180502-1140

2018-06-04 Thread Kalderon, Michal
Thanks Vlad,

I was referring to the fw binary file - that isn’t part of the kernel but of 
the Linux-firmware git

Thanks
Michal

Sent from my iPhone

On 4 Jun 2018, at 16:53, Vladimir Sokolovsky 
mailto:v...@dev.mellanox.co.il>> wrote:

On 06/04/2018 12:32 PM, Kalderon, Michal wrote:
Hi Vlad, Arlin,

Thanks for the daily build.
We currently have two issues with qedr:

  1.  We need to add newer FW
  2.  It seems that even though iWARP support exists in 4.17 and the code seems 
to be
part of compat-rdma that I extracted from your build, our iWARP devices are not 
detected.
we need to investigate this issue.

Arlin – do we need to open bugzilla’s for these two issues ?
Vlad, are the development git’s up to date?

Thanks,
Michal


Hi Michal,
If your FW is a part of the compat-rdma package and it was updated since 
linux-4.17-rc1 then the next build of OFED will include newer FW.

Regards,
Vladimir



From: ewg [mailto:ewg-boun...@lists.openfabrics.org] On Behalf Of Vladimir 
Sokolovsky
Sent: Friday, May 18, 2018 10:46 PM
To: Davis, Arlin R <mailto:arlin.r.da...@intel.com>; 
ewg@lists.openfabrics.org<mailto:ewg@lists.openfabrics.org>
Subject: Re: [ewg] OFED-4.17 daily build status - 20180502-1140

On 05/18/2018 08:38 PM, Davis, Arlin R wrote:

Hello Vlad,

Thanks for the first OFED 4.17 daily build. It seems to build and install fine 
on our servers with 4.17 RC5 kernel but we cannot run anything.

Here is a quick status for reference:

Console message: infiniband mlx5_0: ib_register_mad_agent: QP 0 not supported

Hi Arlin,
Please try to update HCA firmware to the latest version:
http://www.mellanox.com/downloads/firmware/fw-ConnectX4-rel-12_22_1002-MCX455A-ECA_Ax-UEFI-14.15.19-FlexBoot-3.5.403.bin.zip

Regards,
Vladimir




hca_id:  mlx5_0
   transport:  InfiniBand 
(0)
   fw_ver:  12.14.2036
   node_guid:
7cfe:9003:00d4:10d0
   sys_image_guid: 
7cfe:9003:00d4:10d0
   vendor_id: 0x02c9
   vendor_part_id:   4115
   hw_ver: 0x0
   board_id:   MT_2180110032
   phys_port_cnt: 1
  port:  1
 state: 
  PORT_ACTIVE (4)
 max_mtu:   
4096 (5)
 active_mtu:   
1024 (3)
 sm_lid:
0
 port_lid:  
0
 port_lmc:  
 0x00
 link_layer:
  Ethernet


# ib_write_bw -a -d mlx5_0

Couldn't allocate MR
failed to create mr
Failed to create MR
Couldn't create IB resources

After a reboot, it hangs. We are investigating.

Thanks,

Arlin






___

ewg mailing list

ewg@lists.openfabrics.org<mailto:ewg@lists.openfabrics.org>

http://lists.openfabrics.org/mailman/listinfo/ewg


___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/mailman/listinfo/ewg

Re: [ewg] OFED-4.17 daily build status - 20180502-1140

2018-06-04 Thread Kalderon, Michal
Hi Vlad, Arlin,

Thanks for the daily build.
We currently have two issues with qedr:

  1.  We need to add newer FW
  2.  It seems that even though iWARP support exists in 4.17 and the code seems 
to be
part of compat-rdma that I extracted from your build, our iWARP devices are not 
detected.
we need to investigate this issue.

Arlin - do we need to open bugzilla's for these two issues ?
Vlad, are the development git's up to date?

Thanks,
Michal


From: ewg [mailto:ewg-boun...@lists.openfabrics.org] On Behalf Of Vladimir 
Sokolovsky
Sent: Friday, May 18, 2018 10:46 PM
To: Davis, Arlin R ; ewg@lists.openfabrics.org
Subject: Re: [ewg] OFED-4.17 daily build status - 20180502-1140

On 05/18/2018 08:38 PM, Davis, Arlin R wrote:

Hello Vlad,

Thanks for the first OFED 4.17 daily build. It seems to build and install fine 
on our servers with 4.17 RC5 kernel but we cannot run anything.

Here is a quick status for reference:

Console message: infiniband mlx5_0: ib_register_mad_agent: QP 0 not supported

Hi Arlin,
Please try to update HCA firmware to the latest version:
http://www.mellanox.com/downloads/firmware/fw-ConnectX4-rel-12_22_1002-MCX455A-ECA_Ax-UEFI-14.15.19-FlexBoot-3.5.403.bin.zip

Regards,
Vladimir




hca_id:  mlx5_0
   transport:  InfiniBand 
(0)
   fw_ver:  12.14.2036
   node_guid:
7cfe:9003:00d4:10d0
   sys_image_guid: 
7cfe:9003:00d4:10d0
   vendor_id: 0x02c9
   vendor_part_id:   4115
   hw_ver: 0x0
   board_id:   MT_2180110032
   phys_port_cnt: 1
  port:  1
 state: 
  PORT_ACTIVE (4)
 max_mtu:   
4096 (5)
 active_mtu:   
1024 (3)
 sm_lid:
0
 port_lid:  
0
 port_lmc:  
 0x00
 link_layer:
  Ethernet


# ib_write_bw -a -d mlx5_0

Couldn't allocate MR
failed to create mr
Failed to create MR
Couldn't create IB resources

After a reboot, it hangs. We are investigating.

Thanks,

Arlin






___

ewg mailing list

ewg@lists.openfabrics.org

http://lists.openfabrics.org/mailman/listinfo/ewg

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/mailman/listinfo/ewg

[ewg] CEPH and rdma-core

2018-03-22 Thread Kalderon, Michal
Hi,
 
Hope someone here can help. 

I'm trying to run Ceph over OFED (specifically 4.8.2 but not sure it matters) 
I've root caused that we get invalid src gid index due to header-mismatch. 
Are you aware of whether  there  is a CEPH version compatible with OFED / 
rdma-core If not -> what's the process? Does CEPH need to be recompiled against 
rdma-core header?

Thanks,
Michal

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/mailman/listinfo/ewg

Re: [ewg] OFA EWG Meeting: Monday, Feb 26, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

2018-03-07 Thread Kalderon, Michal
Thanks for the quick response.

Perhaps it should be documented in a more generic  way, as this is not only 
related to openibd
But to all drivers part of the initrd. 

And for next versions, maybe it should be part of the build script ? with 
dracut -f --add-drivers  for every driver in compat-rdma?

Thanks,
Michal


> -Original Message-
> From: Vladimir Sokolovsky [mailto:v...@dev.mellanox.co.il]
> Sent: Wednesday, March 07, 2018 4:49 PM
> To: Kalderon, Michal <michal.kalde...@cavium.com>; Davis, Arlin R
> <arlin.r.da...@intel.com>; ewg@lists.openfabrics.org; Vladimir Sokolovsky
> <v...@mellanox.com>
> Cc: Rahman, Ameen <ameen.rah...@cavium.com>; Elior, Ariel
> <ariel.el...@cavium.com>; Tayar, Tomer <tomer.ta...@cavium.com>
> Subject: Re: [ewg] OFA EWG Meeting: Monday, Feb 26, 2017, 09:00 AM US
> Pacific Time (12pm EST) - Minutes
> 
> Hi Michal,
> 
> This issue is documented already in the OFED_release_notes.txt:
> 
> ==
> =
> 3. Known Issues
> ==
> =
> ...
> 
> 22. Bug 2640 - openibd fail to start when system is coming up:
>      The inbox kernel modules being loaded from initrd
>      So, need to rebuild the initrd by:
>      # dracut -f -v
> 
> 
> Regards,
> 
> Vladimir
> 
> 
> On 03/07/2018 04:44 PM, Kalderon, Michal wrote:
> > Hi Vlad and All,
> >
> > During soft-forge testing of the package containing the fixes, we hit
> > a different issue which I believe could affect all vendors
> >
> > The problem is when drivers are part of the initramfs/initrd of the distro 
> > as
> they come inbox.
> > In our case qed/qede are inbox, but qedr is not. Meaning that qed/qede
> > are in initrd and loaded automatically with older version than the one
> installed by OFED. Then when qedr is probed there is a mismatch, as the
> OFED qedr version is newer and incompatible.
> > If the qedr was inbox as well, we probably wouldn't have even noticed that
> OFED drivers aren't loaded.
> > (they will be loaded only after an rmmod of all qed drivers and
> > modprobe )
> >
> > In our Out-of-box installation scripts, we update the initramfs with
> > the newly compiled drivers. I'd expect to see the same at the end of
> > an OFED build ( dracut -f / update initrd, mkinitrd etc... for example
> > )
> >
> > We could just document this for the user to rebuild the ramfs after OFED
> installation.
> > I'm not sure how this didn't come up until now, I've looked a bit on
> > our setups and noticed that a lot have omit-drivers in the lsinitrd or older
> drivers that We're installed etc... this is easy to miss. In addition, the 
> only
> reason we did hit this is because the inbox / ofed drivers are mismatched...
> >
> > Thanks,
> > Michal
> >
> >
> >> -Original Message-
> >> From: Davis, Arlin R [mailto:arlin.r.da...@intel.com]
> >> Sent: Monday, March 05, 2018 4:34 AM
> >> To: Kalderon, Michal <michal.kalde...@cavium.com>;
> >> ewg@lists.openfabrics.org
> >> Cc: Woodruff, Robert J <robert.j.woodr...@intel.com>; Vladimir
> >> Sokolovsky <v...@mellanox.com>; Amrani, Ram
> <ram.amr...@cavium.com>;
> >> Rahman, Ameen <ameen.rah...@cavium.com>; Brendan Myers
> >> <brendan.my...@soft-forge.com>; Vladimir Sokolovsky
> >> <v...@mellanox.com>
> >> Subject: RE: OFA EWG Meeting: Monday, Feb 26, 2017, 09:00 AM US
> >> Pacific Time (12pm EST) - Minutes
> >>
> >> Hello Michal,
> >>
> >> Great progress, kudos to the team for a quick resolution. I concur,
> >> let's test before we roll RC3.
> >>
> >> Vlad, please pull these fixes into new daily build so Brendan can test.
> >>
> >> Arlin
> >>
> >>
> >>> Hi Arlin,
> >>>
> >>> We've been working with Brendan on this and were able to reproduce
> >>> on our setups fix, and test locally.
> >>> There are three commits (2 fix the issue and 1 fix was exposed that
> >>> our data collection had an issue)
> >>> 2 out of the 3 fixes have already been upstream in official linux 
> >>> revisions.
> >>> one of the fixes can't go through next as is as the code varies quite a 
> >>> bit.
> >>>
> >>> Brendan will only be able to fully verify the fix Monday / Tuesday.
> >>>
> >>> The commits that need to be pulled are in 

Re: [ewg] OFA EWG Meeting: Monday, Feb 26, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

2018-03-07 Thread Kalderon, Michal
Hi Vlad and All, 

During soft-forge testing of the package containing the fixes, we hit a 
different issue which I believe could affect all vendors

The problem is when drivers are part of the initramfs/initrd of the distro as 
they come inbox. 
In our case qed/qede are inbox, but qedr is not. Meaning that qed/qede are in 
initrd and loaded automatically with older version
than the one installed by OFED. Then when qedr is probed there is a mismatch, 
as the OFED qedr version is newer and incompatible.
If the qedr was inbox as well, we probably wouldn't have even noticed that OFED 
drivers aren't loaded.
(they will be loaded only after an rmmod of all qed drivers and modprobe )

In our Out-of-box installation scripts, we update the initramfs with the newly 
compiled drivers. I'd expect to see the same at the end of an OFED build
( dracut -f / update initrd, mkinitrd etc... for example ) 

We could just document this for the user to rebuild the ramfs after OFED 
installation. 
I'm not sure how this didn't come up until now, I've looked a bit on our setups 
and noticed that a lot have omit-drivers in the lsinitrd or older drivers that
We're installed etc... this is easy to miss. In addition, the only reason we 
did hit this is because the inbox / ofed drivers are mismatched... 

Thanks,
Michal


> -Original Message-
> From: Davis, Arlin R [mailto:arlin.r.da...@intel.com]
> Sent: Monday, March 05, 2018 4:34 AM
> To: Kalderon, Michal <michal.kalde...@cavium.com>;
> ewg@lists.openfabrics.org
> Cc: Woodruff, Robert J <robert.j.woodr...@intel.com>; Vladimir Sokolovsky
> <v...@mellanox.com>; Amrani, Ram <ram.amr...@cavium.com>; Rahman,
> Ameen <ameen.rah...@cavium.com>; Brendan Myers
> <brendan.my...@soft-forge.com>; Vladimir Sokolovsky
> <v...@mellanox.com>
> Subject: RE: OFA EWG Meeting: Monday, Feb 26, 2017, 09:00 AM US Pacific
> Time (12pm EST) - Minutes
> 
> Hello Michal,
> 
> Great progress, kudos to the team for a quick resolution. I concur, let's test
> before we roll RC3.
> 
> Vlad, please pull these fixes into new daily build so Brendan can test.
> 
> Arlin
> 
> 
> > Hi Arlin,
> >
> > We've been working with Brendan on this and were able to reproduce on
> > our setups fix, and test locally.
> > There are three commits (2 fix the issue and 1 fix was exposed that
> > our data collection had an issue)
> > 2 out of the 3 fixes have already been upstream in official linux revisions.
> > one of the fixes can't go through next as is as the code varies quite a bit.
> >
> > Brendan will only be able to fully verify the fix Monday / Tuesday.
> >
> > The commits that need to be pulled are in my github:
> >
> > https://github.com/mkalderon/ofed-compat-
> > rdma/commit/f20134d8f4736c6ce30975bb920cf64c2ec4248d
> > https://github.com/mkalderon/ofed-compat-
> > rdma/commit/171235eb14bf2a7bccd28650470c44807ea644e4
> > https://github.com/mkalderon/ofed-compat-
> > rdma/commit/4c5949ba5d075d814e30dc18bd4cdd71b45c972f
> >
> > I would prefer Brendan gave this a test before rc-3. But I understand
> > we're on a tight timeframe.
> >
> > thanks,
> > Michal
> >
> > 
> > From: Davis, Arlin R <arlin.r.da...@intel.com>
> > Sent: Friday, March 2, 2018 9:50 PM
> > To: ewg@lists.openfabrics.org
> > Cc: Kalderon, Michal; Woodruff, Robert J; Vladimir Sokolovsky; Amrani,
> > Ram; Rahman, Ameen
> > Subject: RE: OFA EWG Meeting: Monday, Feb 26, 2017, 09:00 AM US
> > Pacific Time (12pm EST) - Minutes
> >
> > Quick update on RC3..
> >
> > Broadcom has all critical bugs fixed and included in a new daily build.
> > Thanks!
> > http://downloads.openfabrics.org/OFED/ofed-4.8-2-daily/OFED-4.8-2-
> > 20180228-1121.tgz
> >
> >
> > Our final blocking item is a critical "perftest hang" issue on a
> > Cavium
> > QL45412 RoCE adapter.
> >
> > Bug 2674<http://bugs.openfabrics.org/bugzilla/show_bug.cgi?id=2674>
> > "Unable to complete RDMA applications (perftest)".
> >
> > Michal, can we please get an ETA for the fix or a "won't fix"
> > disposition so we can push forward with RC3?
> >
> > Regards,
> >
> > Arlin
> >
> > From: ewg [mailto:ewg-boun...@lists.openfabrics.org] On Behalf Of
> > Davis, Arlin R
> > Sent: Monday, February 26, 2018 1:04 PM
> > To: ewg@lists.openfabrics.org
> > Subject: [ewg] OFA EWG Meeting: Monday, Feb 26, 2017, 09:00 AM US
> > Pacific Time (12pm EST) - Minutes
> >
> > Attendees:
> >
> > Rupert DanceSW Forge
> &g

Re: [ewg] OFA EWG Meeting: Monday, Feb 26, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

2018-03-05 Thread Kalderon, Michal
Thanks Vlad

Sent from my iPhone

On 5 Mar 2018, at 16:43, Vladimir Sokolovsky <v...@dev.mellanox.co.il> wrote:

Merged + build: OFED-4.8-2-20180305-0641.tgz


Regards,

Vladimir


> On 03/05/2018 04:33 AM, Davis, Arlin R wrote:
> Hello Michal,
> 
> Great progress, kudos to the team for a quick resolution. I concur, let's 
> test before we roll RC3.
> 
> Vlad, please pull these fixes into new daily build so Brendan can test.
> 
> Arlin
> 
>  
>> Hi Arlin,
>> 
>> We've been working with Brendan on this and were able to reproduce on
>> our setups fix, and test locally.
>> There are three commits (2 fix the issue and 1 fix was exposed that our data
>> collection had an issue)
>> 2 out of the 3 fixes have already been upstream in official linux revisions.
>> one of the fixes can't go through next as is as the code varies quite a bit.
>> 
>> Brendan will only be able to fully verify the fix Monday / Tuesday.
>> 
>> The commits that need to be pulled are in my github:
>> 
>> https://github.com/mkalderon/ofed-compat-
>> rdma/commit/f20134d8f4736c6ce30975bb920cf64c2ec4248d
>> https://github.com/mkalderon/ofed-compat-
>> rdma/commit/171235eb14bf2a7bccd28650470c44807ea644e4
>> https://github.com/mkalderon/ofed-compat-
>> rdma/commit/4c5949ba5d075d814e30dc18bd4cdd71b45c972f
>> 
>> I would prefer Brendan gave this a test before rc-3. But I understand we're
>> on a tight timeframe.
>> 
>> thanks,
>> Michal
>> 
>> 
>> From: Davis, Arlin R <arlin.r.da...@intel.com>
>> Sent: Friday, March 2, 2018 9:50 PM
>> To: ewg@lists.openfabrics.org
>> Cc: Kalderon, Michal; Woodruff, Robert J; Vladimir Sokolovsky; Amrani, Ram;
>> Rahman, Ameen
>> Subject: RE: OFA EWG Meeting: Monday, Feb 26, 2017, 09:00 AM US Pacific
>> Time (12pm EST) - Minutes
>> 
>> Quick update on RC3..
>> 
>> Broadcom has all critical bugs fixed and included in a new daily build.
>> Thanks!
>> http://downloads.openfabrics.org/OFED/ofed-4.8-2-daily/OFED-4.8-2-
>> 20180228-1121.tgz
>> 
>> 
>> Our final blocking item is a critical "perftest hang" issue on a Cavium
>> QL45412 RoCE adapter.
>> 
>> Bug 2674<http://bugs.openfabrics.org/bugzilla/show_bug.cgi?id=2674>
>> "Unable to complete RDMA applications (perftest)".
>> 
>> Michal, can we please get an ETA for the fix or a "won't fix" disposition so
>> we can push forward with RC3?
>> 
>> Regards,
>> 
>> Arlin
>> 
>> From: ewg [mailto:ewg-boun...@lists.openfabrics.org] On Behalf Of Davis,
>> Arlin R
>> Sent: Monday, February 26, 2018 1:04 PM
>> To: ewg@lists.openfabrics.org
>> Subject: [ewg] OFA EWG Meeting: Monday, Feb 26, 2017, 09:00 AM US Pacific
>> Time (12pm EST) - Minutes
>> 
>> Attendees:
>> 
>> Rupert DanceSW Forge
>> Pradeep KankipatiBroadcom
>> Robert Woodruff  Intel
>> Arlin Davis Intel
>> Michal Kalderon   Cavium
>> Vladimir Sokolovsky Mellanox
>> 
>> 
>> Minutes:
>> 
>> 
>> 
>> ·Opens
>> 
>> o   Broadcom's RC1 validation testing uncovered new critical bug. Fix is in 
>> the
>> works, would like to get fix into 4.8-2
>> 
>> §  Broadcom will open new bug with details. (FIO stress test caused hang)
>> 
>> 
>> 
>> ·OFED 4.8-2 RC2 status:  http://downloads.openfabrics.org/OFED/ofed-
>> 4.8-2/OFED-4.8-2-rc2.tgz
>> 
>> o   Release Notes:
>> http://downloads.openfabrics.org/OFED/release_notes/OFED_4.8-2-rc2-
>> release_notes
>> 
>> o   Test Status:
>> 
>> §  Intel - RC2 build/validation (mlx4/5) RH 7.1, 7.2, 7.3, 7.4 SLES 12.1, 
>> 12.2,
>> 12.3 - Passed
>> 
>> §  VMware - RC2 validation complete - Passed
>> 
>> §  IWG interop results - new sightings for Cavium (perftest) and Broadcom
>> (FW update?).
>> 
>> ·Rupert will work with Cavium/Broadcom to get OFED inbox driver
>> versions passing.
>> 
>> ·Note: for PF 33 RoCE interop, we prefer to use OFED inbox instead of
>> out-of-box drivers.
>> 
>> o   Bugs:
>> 
>> §  All - please open new bugs for any new sighting
>> 
>> 
>> 
>> ·OFED 4.8-2 GA --  Not ready
>> 
>> o   RC3 needed for new Broadcom bug and to get PF33 RoCE interop tests
>> passing with OFED inbox drivers.
>> 
>> 
>> 
>> ·OFED next
>> 
>> o   No discussion, OFED 4.8-2 going to RC3.
>> 
>> 
>> 
>> Regards,
>> 
>> 
>> 
>> Arlin
>> 
> ___
> ewg mailing list
> ewg@lists.openfabrics.org
> http://lists.openfabrics.org/mailman/listinfo/ewg

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/mailman/listinfo/ewg

Re: [ewg] OFA EWG Meeting: Monday, Feb 26, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

2018-03-04 Thread Kalderon, Michal
Hi Arlin, 

We've been working with Brendan on this and were able to reproduce on our 
setups fix, and test locally. 
There are three commits (2 fix the issue and 1 fix was exposed that our data 
collection had an issue)
2 out of the 3 fixes have already been upstream in official linux revisions. 
one of the fixes can't go through
next as is as the code varies quite a bit.

Brendan will only be able to fully verify the fix Monday / Tuesday. 

The commits that need to be pulled are in my github: 

https://github.com/mkalderon/ofed-compat-rdma/commit/f20134d8f4736c6ce30975bb920cf64c2ec4248d
https://github.com/mkalderon/ofed-compat-rdma/commit/171235eb14bf2a7bccd28650470c44807ea644e4
https://github.com/mkalderon/ofed-compat-rdma/commit/4c5949ba5d075d814e30dc18bd4cdd71b45c972f

I would prefer Brendan gave this a test before rc-3. But I understand we're on 
a tight timeframe. 

thanks,
Michal
 

From: Davis, Arlin R <arlin.r.da...@intel.com>
Sent: Friday, March 2, 2018 9:50 PM
To: ewg@lists.openfabrics.org
Cc: Kalderon, Michal; Woodruff, Robert J; Vladimir Sokolovsky; Amrani, Ram; 
Rahman, Ameen
Subject: RE: OFA EWG Meeting: Monday, Feb 26, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes

Quick update on RC3….

Broadcom has all critical bugs fixed and included in a new daily build. Thanks!
http://downloads.openfabrics.org/OFED/ofed-4.8-2-daily/OFED-4.8-2-20180228-1121.tgz


Our final blocking item is a critical “perftest hang” issue on a Cavium QL45412 
RoCE adapter.

Bug 2674<http://bugs.openfabrics.org/bugzilla/show_bug.cgi?id=2674> “Unable to 
complete RDMA applications (perftest)”.

Michal, can we please get an ETA for the fix or a “won’t fix” disposition so we 
can push forward with RC3?

Regards,

Arlin

From: ewg [mailto:ewg-boun...@lists.openfabrics.org] On Behalf Of Davis, Arlin R
Sent: Monday, February 26, 2018 1:04 PM
To: ewg@lists.openfabrics.org
Subject: [ewg] OFA EWG Meeting: Monday, Feb 26, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes

Attendees:

Rupert DanceSW Forge
Pradeep KankipatiBroadcom
Robert Woodruff  Intel
Arlin Davis Intel
Michal Kalderon   Cavium
Vladimir Sokolovsky Mellanox


Minutes:



·Opens

o   Broadcom’s RC1 validation testing uncovered new critical bug. Fix is in the 
works, would like to get fix into 4.8-2

§  Broadcom will open new bug with details. (FIO stress test caused hang)



·OFED 4.8-2 RC2 status:  
http://downloads.openfabrics.org/OFED/ofed-4.8-2/OFED-4.8-2-rc2.tgz

o   Release Notes: 
http://downloads.openfabrics.org/OFED/release_notes/OFED_4.8-2-rc2-release_notes

o   Test Status:

§  Intel – RC2 build/validation (mlx4/5) RH 7.1, 7.2, 7.3, 7.4 SLES 12.1, 12.2, 
12.3 – Passed

§  VMware – RC2 validation complete - Passed

§  IWG interop results – new sightings for Cavium (perftest) and Broadcom (FW 
update?).

·Rupert will work with Cavium/Broadcom to get OFED inbox driver 
versions passing.

·Note: for PF 33 RoCE interop, we prefer to use OFED inbox instead of 
out-of-box drivers.

o   Bugs:

§  All - please open new bugs for any new sighting



·OFED 4.8-2 GA --  Not ready

o   RC3 needed for new Broadcom bug and to get PF33 RoCE interop tests passing 
with OFED inbox drivers.



·OFED next

o   No discussion, OFED 4.8-2 going to RC3.



Regards,



Arlin


___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/mailman/listinfo/ewg

Re: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

2018-02-06 Thread Kalderon, Michal
Hi Arlin,

Sorry for the delay, had some logistic stuff to work out.

Hope this is sufficient:
Vlad,

Please pull following fix:
https://github.com/mkalderon/ofed-compat-rdma/commit/01d945c12286b1ad8960ffa74b64fcd256c873e7

Let me know if you prefer I email a patch
I updated the Bug but left it assigned to me -> not sure who I’m supposed to 
assign it to at this point?

Thanks,
Michal

From: Pradeep Kankipati [mailto:pradeep.kankip...@broadcom.com]
Sent: Wednesday, February 07, 2018 9:27 AM
To: Davis, Arlin R <arlin.r.da...@intel.com>; Kalderon, Michal 
<michal.kalde...@cavium.com>
Cc: Srikakulam, Venkata <venkata.srikaku...@cavium.com>; 
ewg@lists.openfabrics.org; Vladimir Sokolovsky <v...@mellanox.com>; Woodruff, 
Robert J <robert.j.woodr...@intel.com>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes

Hi Arlin,

Sorry, just coming back today from sick leave. Let me look into this.

Thanks,
Pradeep
--

From: Davis, Arlin R 
[mailto:arlin.r.da...@intel.com<mailto:arlin.r.da...@intel.com>]
Sent: Wednesday, February 7, 2018 1:18 AM
To: 'Kalderon, Michal'; 'Pradeep Kankipati'
Cc: 'Srikakulam, Venkata'; 
'ewg@lists.openfabrics.org<mailto:ewg@lists.openfabrics.org>'; Vladimir 
Sokolovsky; Woodruff, Robert J
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes

Michal and Pradeep,

OFA Interop Working Group is anxiously waiting for RC1 for Interop testing 
(scheduled to start this week).
Is it possible to get Vlad some patches soon so we can move to RC1 by end of 
the week?

Thanks,
Arlin


From: Davis, Arlin R
Sent: Thursday, February 01, 2018 11:25 AM
To: Kalderon, Michal 
<michal.kalde...@cavium.com<mailto:michal.kalde...@cavium.com>>; 
'ewg@lists.openfabrics.org<mailto:ewg@lists.openfabrics.org>' 
<ewg@lists.openfabrics.org<mailto:ewg@lists.openfabrics.org>>
Cc: Srikakulam, Venkata 
<venkata.srikaku...@cavium.com<mailto:venkata.srikaku...@cavium.com>>; Pradeep 
Kankipati 
<pradeep.kankip...@broadcom.com<mailto:pradeep.kankip...@broadcom.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes

Michal, thanks for the update.

Thanks Michal for the update. This is what we have so far:

Bug 2662<http://bugs.openfabrics.org/show_bug.cgi?id=2662> Chelsio: Cannot set 
mtu greater than 1500 on SLES12Sp3 – Fixed/Closed, Thanks!
Bug 2668<http://bugs.openfabrics.org/show_bug.cgi?id=2668> Broadcom: Cannot 
change MTU to greater than default – No ETA on Fix
Bug 2669<http://bugs.openfabrics.org/show_bug.cgi?id=2669> Cavium: Cannot set 
mtu greater than 1500 on SLES12Sp3 – ETA for fix, next week

We will need fixes before moving to RC1.

-arlin


From: Kalderon, Michal [mailto:michal.kalde...@cavium.com]
Sent: Thursday, February 01, 2018 7:08 AM
To: Davis, Arlin R <arlin.r.da...@intel.com<mailto:arlin.r.da...@intel.com>>; 
Schmidt, William R 
<william.r.schm...@intel.com<mailto:william.r.schm...@intel.com>>; 
'ewg@lists.openfabrics.org<mailto:ewg@lists.openfabrics.org>' 
<ewg@lists.openfabrics.org<mailto:ewg@lists.openfabrics.org>>
Cc: Srikakulam, Venkata 
<venkata.srikaku...@cavium.com<mailto:venkata.srikaku...@cavium.com>>; 
Nikolova, Tatyana E 
<tatyana.e.nikol...@intel.com<mailto:tatyana.e.nikol...@intel.com>>; Pradeep 
Kankipati 
<pradeep.kankip...@broadcom.com<mailto:pradeep.kankip...@broadcom.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes

Opened bugzilla: http://bugs.openfabrics.org/show_bug.cgi?id=2669

Will provide a patch to qedr next week.

Thanks,
Michal

From: Davis, Arlin R [mailto:arlin.r.da...@intel.com]
Sent: Wednesday, January 31, 2018 8:56 PM
To: Kalderon, Michal 
<michal.kalde...@cavium.com<mailto:michal.kalde...@cavium.com>>; Schmidt, 
William R <william.r.schm...@intel.com<mailto:william.r.schm...@intel.com>>; 
'ewg@lists.openfabrics.org<mailto:ewg@lists.openfabrics.org>' 
<ewg@lists.openfabrics.org<mailto:ewg@lists.openfabrics.org>>
Cc: Srikakulam, Venkata 
<venkata.srikaku...@cavium.com<mailto:venkata.srikaku...@cavium.com>>; 
Nikolova, Tatyana E 
<tatyana.e.nikol...@intel.com<mailto:tatyana.e.nikol...@intel.com>>; Pradeep 
Kankipati 
<pradeep.kankip...@broadcom.com<mailto:pradeep.kankip...@broadcom.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes

True, thanks for catching this. Please open a qedr bug so we can track as 
critical/blocking.

Tatyana and Pradeep, do Intel and/or Broadcom drivers need similar changes?

Anyone else?


From: Kalderon, Michal [mailto:michal.kalde...@cavium.com]
Sent: Wednesday, January 31, 2018 10:29 AM
To: Davis, Arlin R <arlin.r.da...@intel

[ewg] changes to compat-rdma

2018-02-06 Thread Kalderon, Michal
Hi Vlad,

When do changes need to be provided as a patch in linux-next-cherry-pick and 
when can they be made a fix
directly on the driver files in linux-4.8 or in compat-rdma?

Thanks,
Michal

___
ewg mailing list
ewg@lists.openfabrics.org
http://lists.openfabrics.org/mailman/listinfo/ewg

Re: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

2018-02-01 Thread Kalderon, Michal
Opened bugzilla: http://bugs.openfabrics.org/show_bug.cgi?id=2669

Will provide a patch to qedr next week.

Thanks,
Michal

From: Davis, Arlin R [mailto:arlin.r.da...@intel.com]
Sent: Wednesday, January 31, 2018 8:56 PM
To: Kalderon, Michal <michal.kalde...@cavium.com>; Schmidt, William R 
<william.r.schm...@intel.com>; 'ewg@lists.openfabrics.org' 
<ewg@lists.openfabrics.org>
Cc: Srikakulam, Venkata <venkata.srikaku...@cavium.com>; Nikolova, Tatyana E 
<tatyana.e.nikol...@intel.com>; Pradeep Kankipati 
<pradeep.kankip...@broadcom.com>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes

True, thanks for catching this. Please open a qedr bug so we can track as 
critical/blocking.

Tatyana and Pradeep, do Intel and/or Broadcom drivers need similar changes?

Anyone else?


From: Kalderon, Michal [mailto:michal.kalde...@cavium.com]
Sent: Wednesday, January 31, 2018 10:29 AM
To: Davis, Arlin R <arlin.r.da...@intel.com<mailto:arlin.r.da...@intel.com>>; 
Schmidt, William R 
<william.r.schm...@intel.com<mailto:william.r.schm...@intel.com>>; 
'ewg@lists.openfabrics.org' 
<ewg@lists.openfabrics.org<mailto:ewg@lists.openfabrics.org>>
Cc: Srikakulam, Venkata 
<venkata.srikaku...@cavium.com<mailto:venkata.srikaku...@cavium.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes

We assumed this is a generic issue since it happened on several adapters, I see 
now with the bug resolve that Steve made changes specific to cxgb3
We need to make similar changes in qede.

Thanks,
Michal


From: Davis, Arlin R [mailto:arlin.r.da...@intel.com]
Sent: Wednesday, January 31, 2018 8:10 PM
To: Kalderon, Michal 
<michal.kalde...@cavium.com<mailto:michal.kalde...@cavium.com>>; Schmidt, 
William R <william.r.schm...@intel.com<mailto:william.r.schm...@intel.com>>; 
'ewg@lists.openfabrics.org' 
<ewg@lists.openfabrics.org<mailto:ewg@lists.openfabrics.org>>
Cc: Srikakulam, Venkata 
<venkata.srikaku...@cavium.com<mailto:venkata.srikaku...@cavium.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes


Are you using the latest daily builds? Steve Wise reported this and forwarded 
patches to Vlad on Jan 19th.



Fix went into build: 
http://downloads.openfabrics.org/OFED/ofed-4.8-2-daily/OFED-4.8-2-20180122-1411.tgz


Bug: http://bugs.openfabrics.org/show_bug.cgi?id=2662

Please let us know if you still have issues with latest builds.

-arlin


From: Kalderon, Michal [mailto:michal.kalde...@cavium.com]
Sent: Wednesday, January 31, 2018 9:15 AM
To: Schmidt, William R 
<william.r.schm...@intel.com<mailto:william.r.schm...@intel.com>>; Davis, Arlin 
R <arlin.r.da...@intel.com<mailto:arlin.r.da...@intel.com>>; 
'ewg@lists.openfabrics.org' 
<ewg@lists.openfabrics.org<mailto:ewg@lists.openfabrics.org>>
Cc: Srikakulam, Venkata 
<venkata.srikaku...@cavium.com<mailto:venkata.srikaku...@cavium.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes

Hi

We're seeing an issue on SLES12SP3 with modifying mtu.


linux-p4eo:~ # ifconfig eth7 mtu 9000

SIOCSIFMTU: Invalid argument

Dmesg: eth7: Invalid MTU 9000 requested, hw max 1500

We've seen this with other vendor devices as well.
Chelsio reported an issue in the past regarding MTU change,
But I didn't see any related bugs open or discussions.

Has this been discussed since?
Attaching relevant email

Thanks,
Michal


From: ewg [mailto:ewg-boun...@lists.openfabrics.org] On Behalf Of Schmidt, 
William R
Sent: Monday, January 29, 2018 10:15 PM
To: Davis, Arlin R <arlin.r.da...@intel.com<mailto:arlin.r.da...@intel.com>>; 
'ewg@lists.openfabrics.org' 
<ewg@lists.openfabrics.org<mailto:ewg@lists.openfabrics.org>>
Subject: Re: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific 
Time (12pm EST) - Minutes

Yes please. The omitted commits are listed in the OFED bugs.

From: Davis, Arlin R
Sent: Monday, January 29, 2018 2:06 PM
To: Schmidt, William R 
<william.r.schm...@intel.com<mailto:william.r.schm...@intel.com>>; 
'ewg@lists.openfabrics.org' 
<ewg@lists.openfabrics.org<mailto:ewg@lists.openfabrics.org>>
Cc: v...@dev.mellanox.co.il<mailto:v...@dev.mellanox.co.il>; Woodruff, Robert J 
<robert.j.woodr...@intel.com<mailto:robert.j.woodr...@intel.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes

Bill, thanks for the update. Do you need Vlad's help getting these fixes into 
OFED 4.8 compat-rdma?

From: Schmidt, William R
Sent: Monday, January 29, 2018 10:37 AM
To: Davis, Arlin R <arlin.r.da...@intel.com<mailto:arlin.r.da...@intel.com>>; 
'ewg@lists.openfabrics.org' 
<ewg@lists.openfabrics.org<

Re: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

2018-01-31 Thread Kalderon, Michal
We assumed this is a generic issue since it happened on several adapters, I see 
now with the bug resolve that Steve made changes specific to cxgb3
We need to make similar changes in qede.

Thanks,
Michal


From: Davis, Arlin R [mailto:arlin.r.da...@intel.com]
Sent: Wednesday, January 31, 2018 8:10 PM
To: Kalderon, Michal <michal.kalde...@cavium.com>; Schmidt, William R 
<william.r.schm...@intel.com>; 'ewg@lists.openfabrics.org' 
<ewg@lists.openfabrics.org>
Cc: Srikakulam, Venkata <venkata.srikaku...@cavium.com>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes


Are you using the latest daily builds? Steve Wise reported this and forwarded 
patches to Vlad on Jan 19th.



Fix went into build: 
http://downloads.openfabrics.org/OFED/ofed-4.8-2-daily/OFED-4.8-2-20180122-1411.tgz


Bug: http://bugs.openfabrics.org/show_bug.cgi?id=2662

Please let us know if you still have issues with latest builds.

-arlin


From: Kalderon, Michal [mailto:michal.kalde...@cavium.com]
Sent: Wednesday, January 31, 2018 9:15 AM
To: Schmidt, William R 
<william.r.schm...@intel.com<mailto:william.r.schm...@intel.com>>; Davis, Arlin 
R <arlin.r.da...@intel.com<mailto:arlin.r.da...@intel.com>>; 
'ewg@lists.openfabrics.org' 
<ewg@lists.openfabrics.org<mailto:ewg@lists.openfabrics.org>>
Cc: Srikakulam, Venkata 
<venkata.srikaku...@cavium.com<mailto:venkata.srikaku...@cavium.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes

Hi

We're seeing an issue on SLES12SP3 with modifying mtu.


linux-p4eo:~ # ifconfig eth7 mtu 9000

SIOCSIFMTU: Invalid argument

Dmesg: eth7: Invalid MTU 9000 requested, hw max 1500

We've seen this with other vendor devices as well.
Chelsio reported an issue in the past regarding MTU change,
But I didn't see any related bugs open or discussions.

Has this been discussed since?
Attaching relevant email

Thanks,
Michal


From: ewg [mailto:ewg-boun...@lists.openfabrics.org] On Behalf Of Schmidt, 
William R
Sent: Monday, January 29, 2018 10:15 PM
To: Davis, Arlin R <arlin.r.da...@intel.com<mailto:arlin.r.da...@intel.com>>; 
'ewg@lists.openfabrics.org' 
<ewg@lists.openfabrics.org<mailto:ewg@lists.openfabrics.org>>
Subject: Re: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific 
Time (12pm EST) - Minutes

Yes please. The omitted commits are listed in the OFED bugs.

From: Davis, Arlin R
Sent: Monday, January 29, 2018 2:06 PM
To: Schmidt, William R 
<william.r.schm...@intel.com<mailto:william.r.schm...@intel.com>>; 
'ewg@lists.openfabrics.org' 
<ewg@lists.openfabrics.org<mailto:ewg@lists.openfabrics.org>>
Cc: v...@dev.mellanox.co.il<mailto:v...@dev.mellanox.co.il>; Woodruff, Robert J 
<robert.j.woodr...@intel.com<mailto:robert.j.woodr...@intel.com>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes

Bill, thanks for the update. Do you need Vlad's help getting these fixes into 
OFED 4.8 compat-rdma?

From: Schmidt, William R
Sent: Monday, January 29, 2018 10:37 AM
To: Davis, Arlin R <arlin.r.da...@intel.com<mailto:arlin.r.da...@intel.com>>; 
'ewg@lists.openfabrics.org' 
<ewg@lists.openfabrics.org<mailto:ewg@lists.openfabrics.org>>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes


>>2. New bugs:  True Scale qib bonding issue on RH 7.4, SLES 12.1 and 12.2 - 
>>Need update from Bill Schmidt (Intel)

Bug 2664<http://bugs.openfabrics.org/show_bug.cgi?id=2664> - Bonding doesn't 
work on RHEL 7.4

Bonding driver form RHEL 7.4 has integrated commit: 
https://github.com/torvalds/linux/commit/b5bf0f5b16b9c316c34df9f31d4be8729eb86845

This requires ipoib driver to return correct speed and duplex mode.  Commit 
adding this feature on ipoib driver:

https://github.com/torvalds/linux/commit/0d7e2d2166f6b0b7d1959ca858052a15feb574cc
 was added in 4.12 kernel, so it is missing from OFED 4.8 compat-rdma.

In consequence Bonding driver cannot retrieve required data and fails.

Bug 2665<http://bugs.openfabrics.org/show_bug.cgi?id=2665> - Bonding causes 
kernel panic on SLES 12.1 and SLES 12.2

Bonding driver from SLES 12.1/12.2 is missing kernel panic fix commit: 
https://github.com/torvalds/linux/commit/1533e77315220dc1d5ec3bd6d9fe32e2aa0a74c0

added in linux kernel 4.8. This makes it incompatible with ipoib drivers from 
OFED 4.8 compat-rdma.


From: ewg [mailto:ewg-boun...@lists.openfabrics.org] On Behalf Of Davis, Arlin R
Sent: Monday, January 29, 2018 11:57 AM
To: 'ewg@lists.openfabrics.org' 
<ewg@lists.openfabrics.org<mailto:ewg@lists.openfabrics.org>>
Subject: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes

Attendees  Company
Pradeep Kankipati Broa

Re: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes

2018-01-31 Thread Kalderon, Michal
Hi

We're seeing an issue on SLES12SP3 with modifying mtu.


linux-p4eo:~ # ifconfig eth7 mtu 9000

SIOCSIFMTU: Invalid argument

Dmesg: eth7: Invalid MTU 9000 requested, hw max 1500

We've seen this with other vendor devices as well.
Chelsio reported an issue in the past regarding MTU change,
But I didn't see any related bugs open or discussions.

Has this been discussed since?
Attaching relevant email

Thanks,
Michal


From: ewg [mailto:ewg-boun...@lists.openfabrics.org] On Behalf Of Schmidt, 
William R
Sent: Monday, January 29, 2018 10:15 PM
To: Davis, Arlin R ; 'ewg@lists.openfabrics.org' 

Subject: Re: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific 
Time (12pm EST) - Minutes

Yes please. The omitted commits are listed in the OFED bugs.

From: Davis, Arlin R
Sent: Monday, January 29, 2018 2:06 PM
To: Schmidt, William R 
>; 
'ewg@lists.openfabrics.org' 
>
Cc: v...@dev.mellanox.co.il; Woodruff, Robert J 
>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes

Bill, thanks for the update. Do you need Vlad's help getting these fixes into 
OFED 4.8 compat-rdma?

From: Schmidt, William R
Sent: Monday, January 29, 2018 10:37 AM
To: Davis, Arlin R >; 
'ewg@lists.openfabrics.org' 
>
Subject: RE: OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes


>>2. New bugs:  True Scale qib bonding issue on RH 7.4, SLES 12.1 and 12.2 - 
>>Need update from Bill Schmidt (Intel)

Bug 2664 - Bonding doesn't 
work on RHEL 7.4

Bonding driver form RHEL 7.4 has integrated commit: 
https://github.com/torvalds/linux/commit/b5bf0f5b16b9c316c34df9f31d4be8729eb86845

This requires ipoib driver to return correct speed and duplex mode.  Commit 
adding this feature on ipoib driver:

https://github.com/torvalds/linux/commit/0d7e2d2166f6b0b7d1959ca858052a15feb574cc
 was added in 4.12 kernel, so it is missing from OFED 4.8 compat-rdma.

In consequence Bonding driver cannot retrieve required data and fails.

Bug 2665 - Bonding causes 
kernel panic on SLES 12.1 and SLES 12.2

Bonding driver from SLES 12.1/12.2 is missing kernel panic fix commit: 
https://github.com/torvalds/linux/commit/1533e77315220dc1d5ec3bd6d9fe32e2aa0a74c0

added in linux kernel 4.8. This makes it incompatible with ipoib drivers from 
OFED 4.8 compat-rdma.


From: ewg [mailto:ewg-boun...@lists.openfabrics.org] On Behalf Of Davis, Arlin R
Sent: Monday, January 29, 2018 11:57 AM
To: 'ewg@lists.openfabrics.org' 
>
Subject: [ewg] OFA EWG Meeting: Monday, Jan 29, 2017, 09:00 AM US Pacific Time 
(12pm EST) - Minutes

Attendees  Company
Pradeep Kankipati Broadcom
Steve Wise Chelsio
Robert WoodruffIntel
Arlin Davis  Intel
Vladimir Sokolovsky   Mellanox
Ariel Elior   Cavium
Michal Kalderon Cavium
Michael Rice   HPE



Minutes:



*Opens - none



  *   OFED 4.8-2:  
http://downloads.openfabrics.org/OFED/ofed-4.8-2-daily/OFED-4.8-2-20180124-0818.tgz



Status:  RH7.4 and SLES12.3 backports added.

Updated packages:  rdma_core-v16, perftest 4.1-0.2, libfabric 
1.5.3

Installation changes:  --without-depcheck docs, vmw_pvrdma 
moved out of tech preview

 Test Status:  Intel - build RH 7.0, 7.1, 7.2, 7.3, 7.4 SLES 12, 
12.1, 12.2, and 12.3 - Passed



Known issues to be resolved before RC1:



1. Bug #2663 - (P1) rping fails, iwpmd hitting segfault on SLES12.3 -

*   Chelsio validation team hits bug, engineering team cannot reproduce.
*   Steve (Chelsio) needs help from Tatyana's (Intel) team to reproduce 
and isolate.

2. New bugs:  True Scale qib bonding issue on RH 7.4, SLES 12.1 and 12.2 - Need 
update from Bill Schmidt (Intel)



  *   OFED 4.8-2 RC1 schedule: (2 blocking bugs)



Plan is to clean up bugs this week and push hard for RC1 by Friday.

The GA plan is to go from RC1 to GA, 1-2 week RC1 validation, 
and Feb 16th for a GA target.

Board approved OFED 4.8-2 so we can move to GA as soon as EWG is ready.



Regards,



Arlin


--- Begin Message ---
Happy New Year everyone! Since Jan 1st and 15th fall on US holiday's we will 
need to update status and make progress via ewg email list. 

Please provide status where applicable.

- Board Approval for OFED 4.8-2 GA:
Status:  Approved by board.

- Problem with OFED