Public bug reported: Jeff,
We have analyzed this issue and see that there are 3 missing patches required to fix this problem. Otherwise vport creation >3 (depending on the customers configuration) will result in a hang for FC or NVMe/FC adapters. Please pull these changes into the next hardware release. These are the missing commits: 959239d [scsi] scsi: core: avoid pre-allocating big SGL for data 5418f2f [scsi] scsi: core: avoid pre-allocating big SGL for protection information 250f285 [nvme] scsi: lib/sg_pool.c: improve APIs for allocating sg pool Laurie --------------------------------------------- CONFIGURATION DETAILS Host OS with Support Pack - Ubuntu 18.04.3 - HWE () Guest/VM OS Details - System(s) Under Test - IBM x3650 M4 Adapter(s) Under Test - Prism 1-port IPL Name - Active Profile ID - Network Configuration - SAN Configuration - OneCapture file attached - BUG REPRODUCTION DETAILS Test Case ID or ATID - Reproducibility of Bug - Always Last Known Working Build - Time to Reproduce Bug - 5 mins Steps To Reproduce Bug - Create 126 vports on Prism adapter, observe that server goes out of memory and hung. root@ubuntu18043:~# free -h total used free shared buff/cache available Mem: 15G 1.7G 13G 1.3M 299M 13G Swap: 2.0G 0B 2.0G root@ubuntu18043:~# free -h total used free shared buff/cache available Mem: 15G 2.1G 12G 1.3M 299M 12G Swap: 2.0G 0B 2.0G root@ubuntu18043:~# free -h total used free shared buff/cache available Mem: 15G 6.1G 8.7G 1.5M 301M 8.7G Swap: 2.0G 0B 2.0G root@ubuntu18043:~# free -h total used free shared buff/cache available Mem: 15G 13G 1.4G 1.6M 303M 1.4G Swap: 2.0G 0B 2.0G root@ubuntu18043:~# free -h total used free shared buff/cache available Mem: 15G 14G 523M 1.7M 303M 528M Swap: 2.0G 0B 2.0G root@ubuntu18043:~# free -h total used free shared buff/cache available Mem: 15G 14G 147M 96K 48M 5.0M Swap: 2.0G 38M 2.0G root@ubuntu18043:~# free -h total used free shared buff/cache available Mem: 15G 14G 145M 96K 49M 3.6M Swap: 2.0G 38M 2.0G Nov 28 05:37:29 ubuntu18043 kernel: [ 1523.166689] scsi host112: Emulex LPe36000 32Gb PCIe Fibre Channel Adapter on PCI bus 11 device 00 irq 26 port 0 Logical Link Speed: 8000 Mbps PCI resettable Nov 28 05:37:29 ubuntu18043 kernel: [ 1523.233464] lpfc 0000:11:00.0: 0:(111):1825 Vport Created. Nov 28 05:37:29 ubuntu18043 kernel: [ 1523.234071] scsi host1: vport-1:0-110 created via shost1 channel 0 Nov 28 05:37:29 ubuntu18043 kernel: [ 1523.698396] scsi host113: Emulex LPe36000 32Gb PCIe Fibre Channel Adapter on PCI bus 11 device 00 irq 26 port 0 Logical Link Speed: 8000 Mbps PCI resettable Nov 28 05:37:30 ubuntu18043 kernel: [ 1523.862582] lpfc 0000:11:00.0: 0:(112):1825 Vport Created. Nov 28 05:37:30 ubuntu18043 kernel: [ 1523.863142] scsi host1: vport-1:0-111 created via shost1 channel 0 Nov 28 05:37:31 ubuntu18043 kernel: [ 1525.097335] scsi host114: Emulex LPe36000 32Gb PCIe Fibre Channel Adapter on PCI bus 11 device 00 irq 26 port 0 Logical Link Speed: 8000 Mbps PCI resettable After creating 111 vports, server ran out of memory. As server is in hung state, no Crash dump got collected. ** Affects: linux (Ubuntu) Importance: Undecided Status: New -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1858840 Title: Vport creation leads to out of memory and server hung on Ubuntu 18.04.3 on Broadcom FC HBAs To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1858840/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs