I have a significant memory leak after upgrading from previous 4.10
series HWE kernels to the new 4.13 HWE series for Ubuntu 16.04 server
with Ethernet controller Intel X710 for 10GbE SFP+

# dmesg | grep i40e
[    1.625565] i40e: Intel(R) Ethernet Connection XL710 Network Driver - 
version 2.1.14-k
[    1.625565] i40e: Copyright (c) 2013 - 2014 Intel Corporation.
[    1.688509] i40e 0000:02:00.0: fw 5.40.47690 api 1.5 nvm 5.40 0x80002d35 
18.0.17
[    1.959126] i40e 0000:02:00.0: MAC address: 3c:fd:fe:1a:1d:e0
[    2.060021] i40e 0000:02:00.0: PCI-Express: Speed 8.0GT/s Width x4
[    2.060091] i40e 0000:02:00.0: PCI-Express bandwidth available for this 
device may be insufficient for optimal performance.
[    2.060096] i40e 0000:02:00.0: Please move the device to a different PCI-e 
link with more lanes and/or higher transfer rate.
[    2.085931] i40e 0000:02:00.0: Features: PF-id[0] VFs: 64 VSIs: 66 QP: 8 RSS 
FD_ATR FD_SB NTUPLE DCB VxLAN Geneve PTP VEPA
[    2.140793] i40e 0000:02:00.1: fw 5.40.47690 api 1.5 nvm 5.40 0x80002d35 
18.0.17
[    2.422817] i40e 0000:02:00.1: MAC address: 3c:fd:fe:1a:1d:e2
[    2.442684] i40e 0000:02:00.1: PCI-Express: Speed 8.0GT/s Width x4
[    2.442696] i40e 0000:02:00.1: PCI-Express bandwidth available for this 
device may be insufficient for optimal performance.
[    2.442715] i40e 0000:02:00.1: Please move the device to a different PCI-e 
link with more lanes and/or higher transfer rate.
[    2.443043] i40e 0000:02:00.1: Features: PF-id[1] VFs: 64 VSIs: 66 QP: 8 RSS 
FD_ATR FD_SB NTUPLE DCB VxLAN Geneve PTP VEPA
[    2.480205] i40e 0000:02:00.0 enp2s0f0: renamed from eth1
[    2.512183] i40e 0000:02:00.1 enp2s0f1: renamed from eth0
[    5.800514] i40e 0000:02:00.0 enp2s0f0: NIC Link is Up, 10 Gbps Full Duplex, 
Flow Control: None

** Attachment added: "201802_nperf_memory-week.png"
   
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1748408/+attachment/5063316/+files/201802_nperf_memory-week.png

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1748408

Title:
  Servers going OOM after updating kernel from 4.10 to 4.13

Status in linux package in Ubuntu:
  Triaged
Status in linux source package in Artful:
  Triaged

Bug description:
  We are seeing this on multiple servers after upgrading from previous
  4.10 series HWE kernels to the new 4.13 HWE series. With the new
  kernel, free memory is continously decreasing at a high rate and the
  servers start swapping and finally OOMing services within days. With
  the 4.10 kernel, decrease of free memory is slower and stabilizes
  after a while.

  Latest kernel tested is linux-image-4.13.0-32-generic but the issue
  also affects older kernels from that series, tested back to linux-
  image-4.13.0-19-generic. No issue with linux-image-4.10.0-42-generic.

  The servers are running as OpenStack controller nodes using either
  Ocata or Pike UCA plus ceph. See attached graph for the memory
  behaviour.

  ProblemType: Bug
  DistroRelease: Ubuntu 16.04
  Package: linux-image-4.13.0-32-generic 4.13.0-32.35~16.04.1
  ProcVersionSignature: Ubuntu 4.13.0-32.35~16.04.1-generic 4.13.13
  Uname: Linux 4.13.0-32-generic x86_64
  ApportVersion: 2.20.1-0ubuntu2.15
  Architecture: amd64
  Date: Fri Feb  9 09:45:50 2018
  ProcEnviron:
   LANGUAGE=en_US:
   TERM=screen
   PATH=(custom, no user)
   LANG=en_US.utf8
   SHELL=/bin/bash
  SourcePackage: linux-hwe
  UpgradeStatus: No upgrade log present (probably fresh install)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1748408/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to