Re: [Kernel-packages] [Bug 1700834] Re: Intel i40e PF reset under load
> > There is one additional upstream commit required to fully fix this, > please see bug 1713553. > Ah, nice! Thanks for pointing it out, i had not found it myself. I solved my problems temporarily by adding i40e 2.0.30 driver as dkms to my current system and will follow the other bug (the i40e 2.1.26 leaked memory at an alarming rate, but that's a story for a different bugreport maybe). -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1700834 Title: Intel i40e PF reset under load Status in linux package in Ubuntu: Confirmed Status in linux source package in Xenial: Fix Released Bug description: SRU Justification: Impact: Using an Intel i40e network device, under heavy traffic load with TSO enabled, the device will spontaneously reset itself and issue errors similar to the following: Jun 14 14:09:51 hostname kernel: [4253913.851053] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:53 hostname kernel: [4253915.476283] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:54 hostname kernel: [4253917.411264] i40e :05:00.1: TX driver issue detected, PF reset issued This causes a full reset of the PF, which causes an interruption in traffic flow. In this case, these errors arise from a bug in the i40e device driver introduced by commit: commit 584a837e26408c66e87df87a022faa6a54c2b020 Author: Alexander DuyckDate: Wed Feb 17 11:02:50 2016 -0800 i40e/i40evf: Rewrite logic for 8 descriptor per packet check This patch was added to the Xenial kernel beginning with version 4.4.0-8.23. This bug does not manifest on any other Ubuntu kernel series. Fix: This error is resolved upstream by: commit 3f3f7cb875c0f621485644d4fd7453b0d37f00e4 Author: Alexander Duyck Date: Wed Mar 30 16:15:37 2016 -0700 i40e/i40evf: Limit TSO to 7 descriptors for payload instead of 8 per packet This fix was never backported into the Xenial 4.4 kernel series. Testcase: In this case, the issue occurs at a customer site using i40e based Intel network cards with SR-IOV enabled. Under heavy load, the card will reset itself as described. The customer has tested the 3f3f7cb875c patch in their environment and confirmed that it resolves the issue. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1700834/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1700834] Re: Intel i40e PF reset under load
> To me it looks like this issue isn't completely solved since I'm running a > more > recent kernel than the "fix" was commited to. There is one additional upstream commit required to fully fix this, please see bug 1713553. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1700834 Title: Intel i40e PF reset under load Status in linux package in Ubuntu: Confirmed Status in linux source package in Xenial: Fix Released Bug description: SRU Justification: Impact: Using an Intel i40e network device, under heavy traffic load with TSO enabled, the device will spontaneously reset itself and issue errors similar to the following: Jun 14 14:09:51 hostname kernel: [4253913.851053] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:53 hostname kernel: [4253915.476283] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:54 hostname kernel: [4253917.411264] i40e :05:00.1: TX driver issue detected, PF reset issued This causes a full reset of the PF, which causes an interruption in traffic flow. In this case, these errors arise from a bug in the i40e device driver introduced by commit: commit 584a837e26408c66e87df87a022faa6a54c2b020 Author: Alexander DuyckDate: Wed Feb 17 11:02:50 2016 -0800 i40e/i40evf: Rewrite logic for 8 descriptor per packet check This patch was added to the Xenial kernel beginning with version 4.4.0-8.23. This bug does not manifest on any other Ubuntu kernel series. Fix: This error is resolved upstream by: commit 3f3f7cb875c0f621485644d4fd7453b0d37f00e4 Author: Alexander Duyck Date: Wed Mar 30 16:15:37 2016 -0700 i40e/i40evf: Limit TSO to 7 descriptors for payload instead of 8 per packet This fix was never backported into the Xenial 4.4 kernel series. Testcase: In this case, the issue occurs at a customer site using i40e based Intel network cards with SR-IOV enabled. Under heavy load, the card will reset itself as described. The customer has tested the 3f3f7cb875c patch in their environment and confirmed that it resolves the issue. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1700834/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1700834] Re: Intel i40e PF reset under load
I forgot to mention in previous comment, that this happens within the hour (usually just a few minutes) of adding the server to production loads. I can provide more information and test patches if necessary. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1700834 Title: Intel i40e PF reset under load Status in linux package in Ubuntu: Confirmed Status in linux source package in Xenial: Fix Released Bug description: SRU Justification: Impact: Using an Intel i40e network device, under heavy traffic load with TSO enabled, the device will spontaneously reset itself and issue errors similar to the following: Jun 14 14:09:51 hostname kernel: [4253913.851053] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:53 hostname kernel: [4253915.476283] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:54 hostname kernel: [4253917.411264] i40e :05:00.1: TX driver issue detected, PF reset issued This causes a full reset of the PF, which causes an interruption in traffic flow. In this case, these errors arise from a bug in the i40e device driver introduced by commit: commit 584a837e26408c66e87df87a022faa6a54c2b020 Author: Alexander DuyckDate: Wed Feb 17 11:02:50 2016 -0800 i40e/i40evf: Rewrite logic for 8 descriptor per packet check This patch was added to the Xenial kernel beginning with version 4.4.0-8.23. This bug does not manifest on any other Ubuntu kernel series. Fix: This error is resolved upstream by: commit 3f3f7cb875c0f621485644d4fd7453b0d37f00e4 Author: Alexander Duyck Date: Wed Mar 30 16:15:37 2016 -0700 i40e/i40evf: Limit TSO to 7 descriptors for payload instead of 8 per packet This fix was never backported into the Xenial 4.4 kernel series. Testcase: In this case, the issue occurs at a customer site using i40e based Intel network cards with SR-IOV enabled. Under heavy load, the card will reset itself as described. The customer has tested the 3f3f7cb875c patch in their environment and confirmed that it resolves the issue. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1700834/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1700834] Re: Intel i40e PF reset under load
I'm running Xenial with kernel 4.4.0-92.115 on a Dell R330 with intel X710 NIC. Under load it fails with message: "TX driver issue detected, PF reset issued" as original report says. To me it looks like this issue isn't completely solved since I'm running a more recent kernel than the "fix" was commited to. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1700834 Title: Intel i40e PF reset under load Status in linux package in Ubuntu: Confirmed Status in linux source package in Xenial: Fix Released Bug description: SRU Justification: Impact: Using an Intel i40e network device, under heavy traffic load with TSO enabled, the device will spontaneously reset itself and issue errors similar to the following: Jun 14 14:09:51 hostname kernel: [4253913.851053] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:53 hostname kernel: [4253915.476283] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:54 hostname kernel: [4253917.411264] i40e :05:00.1: TX driver issue detected, PF reset issued This causes a full reset of the PF, which causes an interruption in traffic flow. In this case, these errors arise from a bug in the i40e device driver introduced by commit: commit 584a837e26408c66e87df87a022faa6a54c2b020 Author: Alexander DuyckDate: Wed Feb 17 11:02:50 2016 -0800 i40e/i40evf: Rewrite logic for 8 descriptor per packet check This patch was added to the Xenial kernel beginning with version 4.4.0-8.23. This bug does not manifest on any other Ubuntu kernel series. Fix: This error is resolved upstream by: commit 3f3f7cb875c0f621485644d4fd7453b0d37f00e4 Author: Alexander Duyck Date: Wed Mar 30 16:15:37 2016 -0700 i40e/i40evf: Limit TSO to 7 descriptors for payload instead of 8 per packet This fix was never backported into the Xenial 4.4 kernel series. Testcase: In this case, the issue occurs at a customer site using i40e based Intel network cards with SR-IOV enabled. Under heavy load, the card will reset itself as described. The customer has tested the 3f3f7cb875c patch in their environment and confirmed that it resolves the issue. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1700834/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1700834] Re: Intel i40e PF reset under load
** Tags removed: verification-needed-xenial ** Tags added: verification-done-xenial -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1700834 Title: Intel i40e PF reset under load Status in linux package in Ubuntu: Confirmed Status in linux source package in Xenial: Fix Released Bug description: SRU Justification: Impact: Using an Intel i40e network device, under heavy traffic load with TSO enabled, the device will spontaneously reset itself and issue errors similar to the following: Jun 14 14:09:51 hostname kernel: [4253913.851053] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:53 hostname kernel: [4253915.476283] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:54 hostname kernel: [4253917.411264] i40e :05:00.1: TX driver issue detected, PF reset issued This causes a full reset of the PF, which causes an interruption in traffic flow. In this case, these errors arise from a bug in the i40e device driver introduced by commit: commit 584a837e26408c66e87df87a022faa6a54c2b020 Author: Alexander DuyckDate: Wed Feb 17 11:02:50 2016 -0800 i40e/i40evf: Rewrite logic for 8 descriptor per packet check This patch was added to the Xenial kernel beginning with version 4.4.0-8.23. This bug does not manifest on any other Ubuntu kernel series. Fix: This error is resolved upstream by: commit 3f3f7cb875c0f621485644d4fd7453b0d37f00e4 Author: Alexander Duyck Date: Wed Mar 30 16:15:37 2016 -0700 i40e/i40evf: Limit TSO to 7 descriptors for payload instead of 8 per packet This fix was never backported into the Xenial 4.4 kernel series. Testcase: In this case, the issue occurs at a customer site using i40e based Intel network cards with SR-IOV enabled. Under heavy load, the card will reset itself as described. The customer has tested the 3f3f7cb875c patch in their environment and confirmed that it resolves the issue. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1700834/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1700834] Re: Intel i40e PF reset under load
This bug was fixed in the package linux - 4.4.0-89.112 --- linux (4.4.0-89.112) xenial; urgency=low * CVE-2017-7533 - dentry name snapshots linux (4.4.0-88.111) xenial; urgency=low * linux: 4.4.0-88.111 -proposed tracker (LP: #1705270) * [Xenial] nvme: Quirks for PM1725 controllers (LP: #1704435) - nvme: Quirks for PM1725 controllers * Upgrade Redpine WLAN/BT driver to ver. 1.2 (production release) (LP: #1697829) - SAUCE: Redpine: Upgrade to ver. 1.2 production release * ubuntu/rsi driver has several issues as picked up by static analysis (LP: #1694733) - SAUCE: Redpine: Upgrade to ver. 1.2 production release * Redpine vendor driver - Switching to AP mode causes kernel panic (LP: #1700941) - SAUCE: Redpine: Upgrade to ver. 1.2 production release * CVE-2017-10810 - drm/virtio: don't leak bo on drm_gem_object_init failure * Ath10k to read different board data file if specify in SMBIOS (LP: #1666742) - ath10k: search SMBIOS for OEM board file extension * make snap-pkg support (LP: #1700747) - SAUCE: make snap-pkg support * ISST-LTE: Briggs:Stratton:UbuntuKVM: ics_opal_set_affinity on host kernel log using Intel X710 (i40e driver) (LP: #1703663) - i40e: use valid online CPU on q_vector initialization * Update snapcraft.yaml (LP: #1700480) - snapcraft.yaml: various improvements * Xenial update to 4.4.76 stable release (LP: #1702863) - ipv6: release dst on error in ip6_dst_lookup_tail - net: don't call strlen on non-terminated string in dev_set_alias() - decnet: dn_rtmsg: Improve input length sanitization in dnrmg_receive_user_skb - net: Zero ifla_vf_info in rtnl_fill_vfinfo() - af_unix: Add sockaddr length checks before accessing sa_family in bind and connect handlers - Fix an intermittent pr_emerg warning about lo becoming free. - net: caif: Fix a sleep-in-atomic bug in cfpkt_create_pfx - igmp: acquire pmc lock for ip_mc_clear_src() - igmp: add a missing spin_lock_init() - ipv6: fix calling in6_ifa_hold incorrectly for dad work - net/mlx5: Wait for FW readiness before initializing command interface - decnet: always not take dst->__refcnt when inserting dst into hash table - net: 8021q: Fix one possible panic caused by BUG_ON in free_netdev - sfc: provide dummy definitions of vswitch functions - ipv6: Do not leak throw route references - rtnetlink: add IFLA_GROUP to ifla_policy - netfilter: xt_TCPMSS: add more sanity tests on tcph->doff - netfilter: synproxy: fix conntrackd interaction - NFSv4: fix a reference leak caused WARNING messages - drm/ast: Handle configuration without P2A bridge - mm, swap_cgroup: reschedule when neeed in swap_cgroup_swapoff() - MIPS: Avoid accidental raw backtrace - MIPS: pm-cps: Drop manual cache-line alignment of ready_count - MIPS: Fix IRQ tracing & lockdep when rescheduling - ALSA: hda - Fix endless loop of codec configure - ALSA: hda - set input_path bitmap to zero after moving it to new place - drm/vmwgfx: Free hash table allocated by cmdbuf managed res mgr - usb: gadget: f_fs: Fix possibe deadlock - sysctl: enable strict writes - mm: numa: avoid waiting on freed migrated pages - KVM: x86: fix fixing of hypercalls - scsi: sd: Fix wrong DPOFUA disable in sd_read_cache_type - scsi: lpfc: Set elsiocb contexts to NULL after freeing it - qla2xxx: Fix erroneous invalid handle message - ARM: dts: BCM5301X: Correct GIC_PPI interrupt flags - net: mvneta: Fix for_each_present_cpu usage - MIPS: ath79: fix regression in PCI window initialization - net: korina: Fix NAPI versus resources freeing - MIPS: ralink: MT7688 pinmux fixes - MIPS: ralink: fix USB frequency scaling - MIPS: ralink: Fix invalid assignment of SoC type - MIPS: ralink: fix MT7628 pinmux typos - MIPS: ralink: fix MT7628 wled_an pinmux gpio - mtd: bcm47xxpart: limit scanned flash area on BCM47XX (MIPS) only - bgmac: fix a missing check for build_skb - mtd: bcm47xxpart: don't fail because of bit-flips - bgmac: Fix reversed test of build_skb() return value. - net: bgmac: Fix SOF bit checking - net: bgmac: Start transmit queue in bgmac_open - net: bgmac: Remove superflous netif_carrier_on() - powerpc/eeh: Enable IO path on permanent error - gianfar: Do not reuse pages from emergency reserve - Btrfs: fix truncate down when no_holes feature is enabled - virtio_console: fix a crash in config_work_handler - swiotlb-xen: update dev_addr after swapping pages - xen-netfront: Fix Rx stall during network stress and OOM - scsi: virtio_scsi: Reject commands when virtqueue is broken - platform/x86: ideapad-laptop: handle ACPI event 1 - amd-xgbe: Check xgbe_init() return code - net: dsa: Check return value of phy_connect_direct() - drm/amdgpu: check ring being ready before using
[Kernel-packages] [Bug 1700834] Re: Intel i40e PF reset under load
This bug is awaiting verification that the kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed- xenial' to 'verification-done-xenial'. If the problem still exists, change the tag 'verification-needed-xenial' to 'verification-failed- xenial'. If verification is not done by 5 working days from today, this fix will be dropped from the source code, and this bug will be closed. See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you! ** Tags added: verification-needed-xenial -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1700834 Title: Intel i40e PF reset under load Status in linux package in Ubuntu: Confirmed Status in linux source package in Xenial: Fix Committed Bug description: SRU Justification: Impact: Using an Intel i40e network device, under heavy traffic load with TSO enabled, the device will spontaneously reset itself and issue errors similar to the following: Jun 14 14:09:51 hostname kernel: [4253913.851053] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:53 hostname kernel: [4253915.476283] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:54 hostname kernel: [4253917.411264] i40e :05:00.1: TX driver issue detected, PF reset issued This causes a full reset of the PF, which causes an interruption in traffic flow. In this case, these errors arise from a bug in the i40e device driver introduced by commit: commit 584a837e26408c66e87df87a022faa6a54c2b020 Author: Alexander DuyckDate: Wed Feb 17 11:02:50 2016 -0800 i40e/i40evf: Rewrite logic for 8 descriptor per packet check This patch was added to the Xenial kernel beginning with version 4.4.0-8.23. This bug does not manifest on any other Ubuntu kernel series. Fix: This error is resolved upstream by: commit 3f3f7cb875c0f621485644d4fd7453b0d37f00e4 Author: Alexander Duyck Date: Wed Mar 30 16:15:37 2016 -0700 i40e/i40evf: Limit TSO to 7 descriptors for payload instead of 8 per packet This fix was never backported into the Xenial 4.4 kernel series. Testcase: In this case, the issue occurs at a customer site using i40e based Intel network cards with SR-IOV enabled. Under heavy load, the card will reset itself as described. The customer has tested the 3f3f7cb875c patch in their environment and confirmed that it resolves the issue. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1700834/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1700834] Re: Intel i40e PF reset under load
** Changed in: linux (Ubuntu Xenial) Status: Confirmed => Fix Committed -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1700834 Title: Intel i40e PF reset under load Status in linux package in Ubuntu: Confirmed Status in linux source package in Xenial: Fix Committed Bug description: SRU Justification: Impact: Using an Intel i40e network device, under heavy traffic load with TSO enabled, the device will spontaneously reset itself and issue errors similar to the following: Jun 14 14:09:51 hostname kernel: [4253913.851053] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:53 hostname kernel: [4253915.476283] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:54 hostname kernel: [4253917.411264] i40e :05:00.1: TX driver issue detected, PF reset issued This causes a full reset of the PF, which causes an interruption in traffic flow. In this case, these errors arise from a bug in the i40e device driver introduced by commit: commit 584a837e26408c66e87df87a022faa6a54c2b020 Author: Alexander DuyckDate: Wed Feb 17 11:02:50 2016 -0800 i40e/i40evf: Rewrite logic for 8 descriptor per packet check This patch was added to the Xenial kernel beginning with version 4.4.0-8.23. This bug does not manifest on any other Ubuntu kernel series. Fix: This error is resolved upstream by: commit 3f3f7cb875c0f621485644d4fd7453b0d37f00e4 Author: Alexander Duyck Date: Wed Mar 30 16:15:37 2016 -0700 i40e/i40evf: Limit TSO to 7 descriptors for payload instead of 8 per packet This fix was never backported into the Xenial 4.4 kernel series. Testcase: In this case, the issue occurs at a customer site using i40e based Intel network cards with SR-IOV enabled. Under heavy load, the card will reset itself as described. The customer has tested the 3f3f7cb875c patch in their environment and confirmed that it resolves the issue. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1700834/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1700834] Re: Intel i40e PF reset under load
Status changed to 'Confirmed' because the bug affects multiple users. ** Changed in: linux (Ubuntu Xenial) Status: New => Confirmed -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1700834 Title: Intel i40e PF reset under load Status in linux package in Ubuntu: Confirmed Status in linux source package in Xenial: Confirmed Bug description: SRU Justification: Impact: Using an Intel i40e network device, under heavy traffic load with TSO enabled, the device will spontaneously reset itself and issue errors similar to the following: Jun 14 14:09:51 hostname kernel: [4253913.851053] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:53 hostname kernel: [4253915.476283] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:54 hostname kernel: [4253917.411264] i40e :05:00.1: TX driver issue detected, PF reset issued This causes a full reset of the PF, which causes an interruption in traffic flow. In this case, these errors arise from a bug in the i40e device driver introduced by commit: commit 584a837e26408c66e87df87a022faa6a54c2b020 Author: Alexander DuyckDate: Wed Feb 17 11:02:50 2016 -0800 i40e/i40evf: Rewrite logic for 8 descriptor per packet check This patch was added to the Xenial kernel beginning with version 4.4.0-8.23. This bug does not manifest on any other Ubuntu kernel series. Fix: This error is resolved upstream by: commit 3f3f7cb875c0f621485644d4fd7453b0d37f00e4 Author: Alexander Duyck Date: Wed Mar 30 16:15:37 2016 -0700 i40e/i40evf: Limit TSO to 7 descriptors for payload instead of 8 per packet This fix was never backported into the Xenial 4.4 kernel series. Testcase: In this case, the issue occurs at a customer site using i40e based Intel network cards with SR-IOV enabled. Under heavy load, the card will reset itself as described. The customer has tested the 3f3f7cb875c patch in their environment and confirmed that it resolves the issue. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1700834/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1700834] Re: Intel i40e PF reset under load
** Also affects: linux (Ubuntu Xenial) Importance: Undecided Status: New -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1700834 Title: Intel i40e PF reset under load Status in linux package in Ubuntu: Confirmed Status in linux source package in Xenial: New Bug description: SRU Justification: Impact: Using an Intel i40e network device, under heavy traffic load with TSO enabled, the device will spontaneously reset itself and issue errors similar to the following: Jun 14 14:09:51 hostname kernel: [4253913.851053] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:53 hostname kernel: [4253915.476283] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:54 hostname kernel: [4253917.411264] i40e :05:00.1: TX driver issue detected, PF reset issued This causes a full reset of the PF, which causes an interruption in traffic flow. In this case, these errors arise from a bug in the i40e device driver introduced by commit: commit 584a837e26408c66e87df87a022faa6a54c2b020 Author: Alexander DuyckDate: Wed Feb 17 11:02:50 2016 -0800 i40e/i40evf: Rewrite logic for 8 descriptor per packet check This patch was added to the Xenial kernel beginning with version 4.4.0-8.23. This bug does not manifest on any other Ubuntu kernel series. Fix: This error is resolved upstream by: commit 3f3f7cb875c0f621485644d4fd7453b0d37f00e4 Author: Alexander Duyck Date: Wed Mar 30 16:15:37 2016 -0700 i40e/i40evf: Limit TSO to 7 descriptors for payload instead of 8 per packet This fix was never backported into the Xenial 4.4 kernel series. Testcase: In this case, the issue occurs at a customer site using i40e based Intel network cards with SR-IOV enabled. Under heavy load, the card will reset itself as described. The customer has tested the 3f3f7cb875c patch in their environment and confirmed that it resolves the issue. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1700834/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp
[Kernel-packages] [Bug 1700834] Re: Intel i40e PF reset under load
** Changed in: linux (Ubuntu) Status: Incomplete => Confirmed -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1700834 Title: Intel i40e PF reset under load Status in linux package in Ubuntu: Confirmed Bug description: SRU Justification: Impact: Using an Intel i40e network device, under heavy traffic load with TSO enabled, the device will spontaneously reset itself and issue errors similar to the following: Jun 14 14:09:51 hostname kernel: [4253913.851053] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:53 hostname kernel: [4253915.476283] i40e :05:00.1: TX driver issue detected, PF reset issued Jun 14 14:09:54 hostname kernel: [4253917.411264] i40e :05:00.1: TX driver issue detected, PF reset issued This causes a full reset of the PF, which causes an interruption in traffic flow. In this case, these errors arise from a bug in the i40e device driver introduced by commit: commit 584a837e26408c66e87df87a022faa6a54c2b020 Author: Alexander DuyckDate: Wed Feb 17 11:02:50 2016 -0800 i40e/i40evf: Rewrite logic for 8 descriptor per packet check This patch was added to the Xenial kernel beginning with version 4.4.0-8.23. This bug does not manifest on any other Ubuntu kernel series. Fix: This error is resolved upstream by: commit 3f3f7cb875c0f621485644d4fd7453b0d37f00e4 Author: Alexander Duyck Date: Wed Mar 30 16:15:37 2016 -0700 i40e/i40evf: Limit TSO to 7 descriptors for payload instead of 8 per packet This fix was never backported into the Xenial 4.4 kernel series. Testcase: In this case, the issue occurs at a customer site using i40e based Intel network cards with SR-IOV enabled. Under heavy load, the card will reset itself as described. The customer has tested the 3f3f7cb875c patch in their environment and confirmed that it resolves the issue. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1700834/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp