Hi Arlin, We've been working with Brendan on this and were able to reproduce on our setups fix, and test locally. There are three commits (2 fix the issue and 1 fix was exposed that our data collection had an issue) 2 out of the 3 fixes have already been upstream in official linux revisions. one of the fixes can't go through next as is as the code varies quite a bit.
Brendan will only be able to fully verify the fix Monday / Tuesday. The commits that need to be pulled are in my github: https://github.com/mkalderon/ofed-compat-rdma/commit/f20134d8f4736c6ce30975bb920cf64c2ec4248d https://github.com/mkalderon/ofed-compat-rdma/commit/171235eb14bf2a7bccd28650470c44807ea644e4 https://github.com/mkalderon/ofed-compat-rdma/commit/4c5949ba5d075d814e30dc18bd4cdd71b45c972f I would prefer Brendan gave this a test before rc-3. But I understand we're on a tight timeframe. thanks, Michal ________________________________________ From: Davis, Arlin R <arlin.r.da...@intel.com> Sent: Friday, March 2, 2018 9:50 PM To: ewg@lists.openfabrics.org Cc: Kalderon, Michal; Woodruff, Robert J; Vladimir Sokolovsky; Amrani, Ram; Rahman, Ameen Subject: RE: OFA EWG Meeting: Monday, Feb 26, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes Quick update on RC3…. Broadcom has all critical bugs fixed and included in a new daily build. Thanks! http://downloads.openfabrics.org/OFED/ofed-4.8-2-daily/OFED-4.8-2-20180228-1121.tgz Our final blocking item is a critical “perftest hang” issue on a Cavium QL45412 RoCE adapter. Bug 2674<http://bugs.openfabrics.org/bugzilla/show_bug.cgi?id=2674> “Unable to complete RDMA applications (perftest)”. Michal, can we please get an ETA for the fix or a “won’t fix” disposition so we can push forward with RC3? Regards, Arlin From: ewg [mailto:ewg-boun...@lists.openfabrics.org] On Behalf Of Davis, Arlin R Sent: Monday, February 26, 2018 1:04 PM To: ewg@lists.openfabrics.org Subject: [ewg] OFA EWG Meeting: Monday, Feb 26, 2017, 09:00 AM US Pacific Time (12pm EST) - Minutes Attendees: Rupert Dance SW Forge Pradeep Kankipati Broadcom Robert Woodruff Intel Arlin Davis Intel Michal Kalderon Cavium Vladimir Sokolovsky Mellanox Minutes: · Opens o Broadcom’s RC1 validation testing uncovered new critical bug. Fix is in the works, would like to get fix into 4.8-2 § Broadcom will open new bug with details. (FIO stress test caused hang) · OFED 4.8-2 RC2 status: http://downloads.openfabrics.org/OFED/ofed-4.8-2/OFED-4.8-2-rc2.tgz o Release Notes: http://downloads.openfabrics.org/OFED/release_notes/OFED_4.8-2-rc2-release_notes o Test Status: § Intel – RC2 build/validation (mlx4/5) RH 7.1, 7.2, 7.3, 7.4 SLES 12.1, 12.2, 12.3 – Passed § VMware – RC2 validation complete - Passed § IWG interop results – new sightings for Cavium (perftest) and Broadcom (FW update?). · Rupert will work with Cavium/Broadcom to get OFED inbox driver versions passing. · Note: for PF 33 RoCE interop, we prefer to use OFED inbox instead of out-of-box drivers. o Bugs: § All - please open new bugs for any new sighting · OFED 4.8-2 GA -- Not ready o RC3 needed for new Broadcom bug and to get PF33 RoCE interop tests passing with OFED inbox drivers. · OFED next o No discussion, OFED 4.8-2 going to RC3. Regards, Arlin _______________________________________________ ewg mailing list ewg@lists.openfabrics.org http://lists.openfabrics.org/mailman/listinfo/ewg