Thank you, Kai-Heng Feng. Really appreciate it.

Currently I'm under a lot of pressure at work. But I will try this in the
next days, to see if it fixes the problem for us. My network still have the
same condition and my previous kernel versions are still breaking. So, it
should be easy to reproduce.
Will write back reporting as soon as I can.

Thank you again,
Paulo


On Tue, Jul 2, 2019, 03:15 Kai-Heng Feng <kai.heng.f...@canonical.com>
wrote:

> Latest kernels in Xenial, Bionic, Cosmic and Disco have the following
> commit:
> commit 3a498606bb04af603a46ebde8296040b2de350d1
> Author: Sanjeev Bansal <sanjeevb.ban...@broadcom.com>
> Date:   Mon Jul 16 11:13:32 2018 +0530
>
>     tg3: Add higher cpu clock for 5762.
>
>     This patch has fix for TX timeout while running bi-directional
>     traffic with 100 Mbps using 5762.
>
>     Signed-off-by: Sanjeev Bansal <sanjeevb.ban...@broadcom.com>
>     Signed-off-by: Siva Reddy Kallam <siva.kal...@broadcom.com>
>     Reviewed-by: Michael Chan <michael.c...@broadcom.com>
>     Signed-off-by: David S. Miller <da...@davemloft.net>
>
> ** Changed in: linux (Ubuntu)
>        Status: Triaged => Fix Released
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/1447664
>
> Title:
>   14e4:1687 broadcom tg3 network driver disconnects under high load
>
> Status in linux package in Ubuntu:
>   Fix Released
> Status in linux package in Debian:
>   New
>
> Bug description:
>   The tg3 broadcom network driver that binds with chipset 5762 goes
> offline and unable to recover (even with tg3 watchdog timeout) when network
> transmit is under high load.  Call trace:
>   https://launchpadlibrarian.net/204185480/dmesg
>
>   When this happens, only a reboot would be able to fix it.  Sometimes,
>   however, bringing the interface offline and online (via ifconfig)
>   would recover networking.  I've also tested with the latest tg3 driver
>   (dec 2014 version) and networking is still problematic.  I have also
>   disabled TSO, GSO etc... with ethtool and the bug still surfaces.
>   This bug may be related to the integrated Firmware.
>
>   Here is the procedure to replicate the issue because it is hard to
>   replicate it under moderate network load.
>
>   1. Bootup a machine with a broadcom 5762 NIC (ie. HP DeskElite 705)
> using a Ubuntu/Kubunu Live CD 14.04-15.04.
>   2. from another machine: start 5 sessions, repetitively copy (scp with
> public key authentication) a 70 meg file back and forth to the tg3 machine
> in each session. (not sure if this is necessary)
>   3. create a 1GB file on the tg3 machine, with something like dd
> if=/dev/urandom of=/my/test/file bs=1024 count=$((1024*1000))
>   4. from another machine: repetitively scp copy that 1GB file from the
> tg3 machine. This can be done with something like:
>
>   while [ 0 ]; do
>      scp -i /my/scp/private.key u...@ip.of.tg3:/my/test/file /tmp
>   done;
>
>   Networking will mostly goes offline in about 10-30 minutes.
>
>   WORKAROUND: Add udev rule to make the changes permanent in
> /etc/udev/rules.d/80-tg3-fix.rules :
>   ACTION=="add", SUBSYSTEM=="net", ATTRS{vendor}=="0x14e4",
> ATTRS{device}=="0x1687", RUN+="/sbin/ethtool -K %k highdma off"
>
>   ProblemType: Bug
>   DistroRelease: Ubuntu 15.04
>   Package: linux-image-3.19.0-15-generic 3.19.0-15.15
>   ProcVersionSignature: Ubuntu 3.19.0-15.15-generic 3.19.3
>   Uname: Linux 3.19.0-15-generic x86_64
>   ApportVersion: 2.17.2-0ubuntu1
>   Architecture: amd64
>   AudioDevicesInUse:
>    USER        PID ACCESS COMMAND
>    /dev/snd/controlC1:  kubuntu    3748 F.... pulseaudio
>    /dev/snd/controlC0:  kubuntu    3748 F.... pulseaudio
>   CasperVersion: 1.360
>   Date: Thu Apr 23 11:16:24 2015
>   IwConfig:
>    eth0      no wireless extensions.
>
>    lo        no wireless extensions.
>   LiveMediaBuild: Kubuntu 15.04 "Vivid Vervet" - Release amd64 (20150422)
>   MachineType: Hewlett-Packard HP EliteDesk 705 G1 MT
>   ProcEnviron:
>    LANGUAGE=
>    TERM=xterm
>    PATH=(custom, no user)
>    LANG=en_US.UTF-8
>    SHELL=/bin/bash
>   ProcFB: 0 radeondrmfb
>   ProcKernelCmdLine: BOOT_IMAGE=/casper/vmlinuz.efi
> file=/cdrom/preseed/hostname.seed boot=casper maybe-ubiquity quiet splash
> ---
>   PulseList:
>    Error: command ['pacmd', 'list'] failed with exit code 1: Home
> directory not accessible: Permission denied
>    No PulseAudio daemon running, or not running as session daemon.
>   RelatedPackageVersions:
>    linux-restricted-modules-3.19.0-15-generic N/A
>    linux-backports-modules-3.19.0-15-generic  N/A
>    linux-firmware                             1.143
>   RfKill:
>
>   SourcePackage: linux
>   UdevLog: Error: [Errno 2] No such file or directory: '/var/log/udev'
>   UpgradeStatus: No upgrade log present (probably fresh install)
>   dmi.bios.date: 10/22/2014
>   dmi.bios.vendor: Hewlett-Packard
>   dmi.bios.version: L06 v02.15
>   dmi.board.asset.tag: 2UA5041TG4
>   dmi.board.name: 2215
>   dmi.board.vendor: Hewlett-Packard
>   dmi.chassis.asset.tag: 2UA5041TG4
>   dmi.chassis.type: 6
>   dmi.chassis.vendor: Hewlett-Packard
>   dmi.modalias:
> dmi:bvnHewlett-Packard:bvrL06v02.15:bd10/22/2014:svnHewlett-Packard:pnHPEliteDesk705G1MT:pvr:rvnHewlett-Packard:rn2215:rvr:cvnHewlett-Packard:ct6:cvr:
>   dmi.product.name: HP EliteDesk 705 G1 MT
>   dmi.sys.vendor: Hewlett-Packard
>
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1447664/+subscriptions
>

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1447664

Title:
  14e4:1687 broadcom tg3 network driver disconnects under high load

Status in linux package in Ubuntu:
  Fix Released
Status in linux package in Debian:
  New

Bug description:
  The tg3 broadcom network driver that binds with chipset 5762 goes offline and 
unable to recover (even with tg3 watchdog timeout) when network transmit is 
under high load.  Call trace:
  https://launchpadlibrarian.net/204185480/dmesg

  When this happens, only a reboot would be able to fix it.  Sometimes,
  however, bringing the interface offline and online (via ifconfig)
  would recover networking.  I've also tested with the latest tg3 driver
  (dec 2014 version) and networking is still problematic.  I have also
  disabled TSO, GSO etc... with ethtool and the bug still surfaces.
  This bug may be related to the integrated Firmware.

  Here is the procedure to replicate the issue because it is hard to
  replicate it under moderate network load.

  1. Bootup a machine with a broadcom 5762 NIC (ie. HP DeskElite 705) using a 
Ubuntu/Kubunu Live CD 14.04-15.04.
  2. from another machine: start 5 sessions, repetitively copy (scp with public 
key authentication) a 70 meg file back and forth to the tg3 machine in each 
session. (not sure if this is necessary)
  3. create a 1GB file on the tg3 machine, with something like dd 
if=/dev/urandom of=/my/test/file bs=1024 count=$((1024*1000))
  4. from another machine: repetitively scp copy that 1GB file from the tg3 
machine. This can be done with something like:

  while [ 0 ]; do
     scp -i /my/scp/private.key u...@ip.of.tg3:/my/test/file /tmp
  done;

  Networking will mostly goes offline in about 10-30 minutes.

  WORKAROUND: Add udev rule to make the changes permanent in 
/etc/udev/rules.d/80-tg3-fix.rules :
  ACTION=="add", SUBSYSTEM=="net", ATTRS{vendor}=="0x14e4", 
ATTRS{device}=="0x1687", RUN+="/sbin/ethtool -K %k highdma off"

  ProblemType: Bug
  DistroRelease: Ubuntu 15.04
  Package: linux-image-3.19.0-15-generic 3.19.0-15.15
  ProcVersionSignature: Ubuntu 3.19.0-15.15-generic 3.19.3
  Uname: Linux 3.19.0-15-generic x86_64
  ApportVersion: 2.17.2-0ubuntu1
  Architecture: amd64
  AudioDevicesInUse:
   USER        PID ACCESS COMMAND
   /dev/snd/controlC1:  kubuntu    3748 F.... pulseaudio
   /dev/snd/controlC0:  kubuntu    3748 F.... pulseaudio
  CasperVersion: 1.360
  Date: Thu Apr 23 11:16:24 2015
  IwConfig:
   eth0      no wireless extensions.

   lo        no wireless extensions.
  LiveMediaBuild: Kubuntu 15.04 "Vivid Vervet" - Release amd64 (20150422)
  MachineType: Hewlett-Packard HP EliteDesk 705 G1 MT
  ProcEnviron:
   LANGUAGE=
   TERM=xterm
   PATH=(custom, no user)
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  ProcFB: 0 radeondrmfb
  ProcKernelCmdLine: BOOT_IMAGE=/casper/vmlinuz.efi 
file=/cdrom/preseed/hostname.seed boot=casper maybe-ubiquity quiet splash ---
  PulseList:
   Error: command ['pacmd', 'list'] failed with exit code 1: Home directory not 
accessible: Permission denied
   No PulseAudio daemon running, or not running as session daemon.
  RelatedPackageVersions:
   linux-restricted-modules-3.19.0-15-generic N/A
   linux-backports-modules-3.19.0-15-generic  N/A
   linux-firmware                             1.143
  RfKill:

  SourcePackage: linux
  UdevLog: Error: [Errno 2] No such file or directory: '/var/log/udev'
  UpgradeStatus: No upgrade log present (probably fresh install)
  dmi.bios.date: 10/22/2014
  dmi.bios.vendor: Hewlett-Packard
  dmi.bios.version: L06 v02.15
  dmi.board.asset.tag: 2UA5041TG4
  dmi.board.name: 2215
  dmi.board.vendor: Hewlett-Packard
  dmi.chassis.asset.tag: 2UA5041TG4
  dmi.chassis.type: 6
  dmi.chassis.vendor: Hewlett-Packard
  dmi.modalias: 
dmi:bvnHewlett-Packard:bvrL06v02.15:bd10/22/2014:svnHewlett-Packard:pnHPEliteDesk705G1MT:pvr:rvnHewlett-Packard:rn2215:rvr:cvnHewlett-Packard:ct6:cvr:
  dmi.product.name: HP EliteDesk 705 G1 MT
  dmi.sys.vendor: Hewlett-Packard

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1447664/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to