Public bug reported:

Jeff,

We have analyzed this issue and see that there are 3 missing patches
required to fix this problem.  Otherwise vport creation >3 (depending on
the customers configuration) will result in a hang for FC or NVMe/FC
adapters.

Please pull these changes into the next hardware release.

These are the missing commits:
959239d [scsi] scsi: core: avoid pre-allocating big SGL for data
5418f2f [scsi] scsi: core: avoid pre-allocating big SGL for protection 
information
250f285 [nvme] scsi: lib/sg_pool.c: improve APIs for allocating sg pool

Laurie
---------------------------------------------
CONFIGURATION DETAILS         
Host OS with Support Pack  - Ubuntu 18.04.3 - HWE ()
Guest/VM OS Details        - 
System(s) Under Test       - IBM x3650 M4
Adapter(s) Under Test      - Prism 1-port
IPL Name                   - 
Active Profile ID          -
Network Configuration      - 
SAN Configuration          - 
OneCapture file attached   - 


BUG REPRODUCTION DETAILS      
Test Case ID or ATID - 
Reproducibility of Bug     - Always
Last Known Working Build   - 
Time to Reproduce Bug      - 5 mins
Steps To Reproduce Bug     - 

Create 126 vports on Prism adapter, observe that server goes out of memory and
hung.

root@ubuntu18043:~# free -h
              total        used        free      shared  buff/cache   available
Mem:            15G        1.7G         13G        1.3M        299M         13G
Swap:          2.0G          0B        2.0G

root@ubuntu18043:~# free -h
              total        used        free      shared  buff/cache   available
Mem:            15G        2.1G         12G        1.3M        299M         12G
Swap:          2.0G          0B        2.0G

root@ubuntu18043:~# free -h
              total        used        free      shared  buff/cache   available
Mem:            15G        6.1G        8.7G        1.5M        301M        8.7G
Swap:          2.0G          0B        2.0G

root@ubuntu18043:~# free -h
              total        used        free      shared  buff/cache   available
Mem:            15G         13G        1.4G        1.6M        303M        1.4G
Swap:          2.0G          0B        2.0G

root@ubuntu18043:~# free -h
              total        used        free      shared  buff/cache   available
Mem:            15G         14G        523M        1.7M        303M        528M
Swap:          2.0G          0B        2.0G

root@ubuntu18043:~# free -h
              total        used        free      shared  buff/cache   available
Mem:            15G         14G        147M         96K         48M        5.0M
Swap:          2.0G         38M        2.0G

root@ubuntu18043:~# free -h
              total        used        free      shared  buff/cache   available
Mem:            15G         14G        145M         96K         49M        3.6M
Swap:          2.0G         38M        2.0G

Nov 28 05:37:29 ubuntu18043 kernel: [ 1523.166689] scsi host112: Emulex
LPe36000 32Gb PCIe Fibre Channel Adapter on PCI bus 11 device 00 irq 26 port 0
Logical Link Speed: 8000 Mbps PCI resettable
Nov 28 05:37:29 ubuntu18043 kernel: [ 1523.233464] lpfc 0000:11:00.0:
0:(111):1825 Vport Created.
Nov 28 05:37:29 ubuntu18043 kernel: [ 1523.234071] scsi host1: vport-1:0-110
created via shost1 channel 0
Nov 28 05:37:29 ubuntu18043 kernel: [ 1523.698396] scsi host113: Emulex
LPe36000 32Gb PCIe Fibre Channel Adapter on PCI bus 11 device 00 irq 26 port 0
Logical Link Speed: 8000 Mbps PCI resettable
Nov 28 05:37:30 ubuntu18043 kernel: [ 1523.862582] lpfc 0000:11:00.0:
0:(112):1825 Vport Created.
Nov 28 05:37:30 ubuntu18043 kernel: [ 1523.863142] scsi host1: vport-1:0-111
created via shost1 channel 0
Nov 28 05:37:31 ubuntu18043 kernel: [ 1525.097335] scsi host114: Emulex
LPe36000 32Gb PCIe Fibre Channel Adapter on PCI bus 11 device 00 irq 26 port 0
Logical Link Speed: 8000 Mbps PCI resettable

After creating 111 vports, server ran out of memory.

As server is in hung state, no Crash dump got collected.

** Affects: linux (Ubuntu)
     Importance: Undecided
         Status: New

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1858840

Title:
  Vport creation leads to out of memory and server hung on Ubuntu
  18.04.3 on Broadcom FC HBAs

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1858840/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to