Public bug reported: We get kernel panic when we install a bond interface with two of Mellanox mlx5 NIC's and try to unload the bonding module.
scenario: 1. network interfaces configuration # cat /etc/network/interfaces # This file describes the network interfaces available on your system # and how to activate them. For more information, see interfaces(5). # The loopback network interface auto lo iface lo inet loopback # The primary network interface auto eno1 iface eno1 inet dhcp #ens1f0 auto ens1f0 iface ens1f0 inet manual bond-master bond1 auto ens1f1 iface ens1f1 inet manual bond-master bond1 auto bond1 iface bond1 inet static address 27.65.194.1 netmask 255.255.255.0 bond-slaves ens1f0 ens1f1 bond-mode 1 bond-primary ens1f0 bond-miimon 100 iface bond1 inet6 static address 907c:c828:4d05:5bf8:0000:0000:0000:0002/127 # cat /etc/modprobe.d/bonding.conf options bonding mode=1 2. ifup bond1 3. modprobe -r bonding 4. OOPS Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.443796] Oops: 0000 [#1] SMP Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.444686] Modules linked in: mlx5_ib mlx5_core bonding mlx4_ib ib_core mlx4_en mlx4_core nfsv3 nfs fscache xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat libcrc32c nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter ipmi_ssif intel_rapl sb_edac edac_core x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc aesni_intel aes_x86_64 crypto_simd glue_helper cryptd intel_cstate joydev input_leds intel_rapl_perf serio_raw lpc_ich hpilo ipmi_si ioatdma ipmi_devintf dca ipmi_msghandler shpchp mac_hid acpi_power_meter nfsd auth_rpcgss nfs_acl lockd grace sunrpc ip_tables Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.469445] x_tables autofs4 hid_generic psmouse usbhid hid pata_acpi tg3 hpsa ptp scsi_transport_sas devlink pps_core wmi fjes [last unloaded: mlx5_core] Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.473672] CPU: 23 PID: 4846 Comm: ifenslave Not tainted 4.10.0-9-generic #11-Ubuntu Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.475894] Hardware name: HP ProLiant DL380p Gen8, BIOS P70 07/01/2015 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.478038] task: ffff9b8394e31680 task.stack: ffffb2ed054f4000 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.533408] RIP: 0010:mlx5_lag_netdev_event+0x1e6/0x230 [mlx5_core] Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.590069] RSP: 0018:ffffb2ed054f7bd0 EFLAGS: 00010202 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.646302] RAX: 0000000000000002 RBX: ffff9b7f825f6000 RCX: 0000000000000000 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.701966] RDX: 0000000000000000 RSI: 0000000400000400 RDI: ffff9b7f840a00b0 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.756395] RBP: ffffb2ed054f7c18 R08: ffffffffc02fb000 R09: ffff9b7fa3117ea8 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.810250] R10: 0000000000000000 R11: 000000000051a84e R12: 0000000000000001 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.863569] R13: 0000000000000004 R14: ffff9b7fa3117ea8 R15: ffffffff8992b108 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.916725] FS: 00007fc6cca0e700(0000) GS:ffff9b83af0c0000(0000) knlGS:0000000000000000 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.020509] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.072342] CR2: 0000000000000002 CR3: 0000000817013000 CR4: 00000000001406e0 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.127206] Call Trace: Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.180602] notifier_call_chain+0x4a/0x70 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.235310] raw_notifier_call_chain+0x16/0x20 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.287923] call_netdevice_notifiers_info+0x35/0x60 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.342951] netdev_upper_dev_unlink+0x72/0xb0 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.395322] bond_upper_dev_unlink.isra.42+0x18/0x40 [bonding] Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.446520] __bond_release_one+0x170/0x550 [bonding] Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.499303] ? netdev_info+0x6c/0x90 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.550677] bond_release+0x10/0x20 [bonding] Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.602044] bond_option_slaves_set+0xe6/0x130 [bonding] Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.653333] __bond_opt_set+0xe2/0x3a0 [bonding] Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.703257] bond_opt_tryset_rtnl+0x56/0xa0 [bonding] Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.751799] bonding_sysfs_store_option+0x35/0x70 [bonding] Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.799933] dev_attr_store+0x18/0x30 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.846332] sysfs_kf_write+0x37/0x40 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.890905] kernfs_fop_write+0x11d/0x1b0 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.935415] __vfs_write+0x18/0x40 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.976508] vfs_write+0xb5/0x1a0 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.018692] SyS_write+0x55/0xc0 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.056645] entry_SYSCALL_64_fastpath+0x1e/0xad Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.096526] RIP: 0033:0x7fc6cc52bd20 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.136659] RSP: 002b:00007ffc13c78d18 EFLAGS: 00000246 ORIG_RAX: 0000000000000001 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.219432] RAX: ffffffffffffffda RBX: 00007fc6cc7f5b58 RCX: 00007fc6cc52bd20 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.262617] RDX: 0000000000000008 RSI: 000056541f282ea0 RDI: 0000000000000001 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.306153] RBP: 00007fc6cc7f5b00 R08: 00007fc6cc7f5c78 R09: 000056541f04b8a8 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.349657] R10: 000056541f282ea0 R11: 0000000000000246 R12: 00007fc6cc7f5b58 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.392097] R13: 0000000000002010 R14: 00007fc6cc7f5b58 R15: 000000000000270f Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.431871] Code: 39 be 68 ff ff ff 74 5b 4d 39 be 78 ff ff ff 74 48 83 45 c0 01 eb cb 8b 45 c4 85 c0 0f 84 42 ff ff ff 48 8b 45 b8 48 85 c0 74 03 <44> 8b 28 83 7d c0 02 75 21 83 7d c4 03 75 1b 41 8d 45 fc 83 f8 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.558999] RIP: mlx5_lag_netdev_event+0x1e6/0x230 [mlx5_core] RSP: ffffb2ed054f7bd0 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.645369] CR2: 0000000000000002 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.687127] ---[ end trace 92901adbd279c621 ]--- Mar 12 16:45:13 qa-h-vrt-038 systemd[1]: Reloading. we already fix the issue and we are going to send it upstream, I will send it also to canonical kernel team mailing list. Thanks, Talat ** Affects: linux (Ubuntu) Importance: Undecided Status: Incomplete -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1676786 Title: [Zesty] mlx5_core Kernel oops with bonding mode 1 and 6 Status in linux package in Ubuntu: Incomplete Bug description: We get kernel panic when we install a bond interface with two of Mellanox mlx5 NIC's and try to unload the bonding module. scenario: 1. network interfaces configuration # cat /etc/network/interfaces # This file describes the network interfaces available on your system # and how to activate them. For more information, see interfaces(5). # The loopback network interface auto lo iface lo inet loopback # The primary network interface auto eno1 iface eno1 inet dhcp #ens1f0 auto ens1f0 iface ens1f0 inet manual bond-master bond1 auto ens1f1 iface ens1f1 inet manual bond-master bond1 auto bond1 iface bond1 inet static address 27.65.194.1 netmask 255.255.255.0 bond-slaves ens1f0 ens1f1 bond-mode 1 bond-primary ens1f0 bond-miimon 100 iface bond1 inet6 static address 907c:c828:4d05:5bf8:0000:0000:0000:0002/127 # cat /etc/modprobe.d/bonding.conf options bonding mode=1 2. ifup bond1 3. modprobe -r bonding 4. OOPS Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.443796] Oops: 0000 [#1] SMP Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.444686] Modules linked in: mlx5_ib mlx5_core bonding mlx4_ib ib_core mlx4_en mlx4_core nfsv3 nfs fscache xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat libcrc32c nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp bridge stp llc ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter ipmi_ssif intel_rapl sb_edac edac_core x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc aesni_intel aes_x86_64 crypto_simd glue_helper cryptd intel_cstate joydev input_leds intel_rapl_perf serio_raw lpc_ich hpilo ipmi_si ioatdma ipmi_devintf dca ipmi_msghandler shpchp mac_hid acpi_power_meter nfsd auth_rpcgss nfs_acl lockd grace sunrpc ip_tables Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.469445] x_tables autofs4 hid_generic psmouse usbhid hid pata_acpi tg3 hpsa ptp scsi_transport_sas devlink pps_core wmi fjes [last unloaded: mlx5_core] Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.473672] CPU: 23 PID: 4846 Comm: ifenslave Not tainted 4.10.0-9-generic #11-Ubuntu Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.475894] Hardware name: HP ProLiant DL380p Gen8, BIOS P70 07/01/2015 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.478038] task: ffff9b8394e31680 task.stack: ffffb2ed054f4000 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.533408] RIP: 0010:mlx5_lag_netdev_event+0x1e6/0x230 [mlx5_core] Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.590069] RSP: 0018:ffffb2ed054f7bd0 EFLAGS: 00010202 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.646302] RAX: 0000000000000002 RBX: ffff9b7f825f6000 RCX: 0000000000000000 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.701966] RDX: 0000000000000000 RSI: 0000000400000400 RDI: ffff9b7f840a00b0 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.756395] RBP: ffffb2ed054f7c18 R08: ffffffffc02fb000 R09: ffff9b7fa3117ea8 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.810250] R10: 0000000000000000 R11: 000000000051a84e R12: 0000000000000001 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.863569] R13: 0000000000000004 R14: ffff9b7fa3117ea8 R15: ffffffff8992b108 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 540.916725] FS: 00007fc6cca0e700(0000) GS:ffff9b83af0c0000(0000) knlGS:0000000000000000 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.020509] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.072342] CR2: 0000000000000002 CR3: 0000000817013000 CR4: 00000000001406e0 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.127206] Call Trace: Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.180602] notifier_call_chain+0x4a/0x70 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.235310] raw_notifier_call_chain+0x16/0x20 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.287923] call_netdevice_notifiers_info+0x35/0x60 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.342951] netdev_upper_dev_unlink+0x72/0xb0 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.395322] bond_upper_dev_unlink.isra.42+0x18/0x40 [bonding] Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.446520] __bond_release_one+0x170/0x550 [bonding] Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.499303] ? netdev_info+0x6c/0x90 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.550677] bond_release+0x10/0x20 [bonding] Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.602044] bond_option_slaves_set+0xe6/0x130 [bonding] Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.653333] __bond_opt_set+0xe2/0x3a0 [bonding] Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.703257] bond_opt_tryset_rtnl+0x56/0xa0 [bonding] Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.751799] bonding_sysfs_store_option+0x35/0x70 [bonding] Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.799933] dev_attr_store+0x18/0x30 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.846332] sysfs_kf_write+0x37/0x40 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.890905] kernfs_fop_write+0x11d/0x1b0 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.935415] __vfs_write+0x18/0x40 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 541.976508] vfs_write+0xb5/0x1a0 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.018692] SyS_write+0x55/0xc0 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.056645] entry_SYSCALL_64_fastpath+0x1e/0xad Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.096526] RIP: 0033:0x7fc6cc52bd20 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.136659] RSP: 002b:00007ffc13c78d18 EFLAGS: 00000246 ORIG_RAX: 0000000000000001 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.219432] RAX: ffffffffffffffda RBX: 00007fc6cc7f5b58 RCX: 00007fc6cc52bd20 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.262617] RDX: 0000000000000008 RSI: 000056541f282ea0 RDI: 0000000000000001 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.306153] RBP: 00007fc6cc7f5b00 R08: 00007fc6cc7f5c78 R09: 000056541f04b8a8 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.349657] R10: 000056541f282ea0 R11: 0000000000000246 R12: 00007fc6cc7f5b58 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.392097] R13: 0000000000002010 R14: 00007fc6cc7f5b58 R15: 000000000000270f Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.431871] Code: 39 be 68 ff ff ff 74 5b 4d 39 be 78 ff ff ff 74 48 83 45 c0 01 eb cb 8b 45 c4 85 c0 0f 84 42 ff ff ff 48 8b 45 b8 48 85 c0 74 03 <44> 8b 28 83 7d c0 02 75 21 83 7d c4 03 75 1b 41 8d 45 fc 83 f8 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.558999] RIP: mlx5_lag_netdev_event+0x1e6/0x230 [mlx5_core] RSP: ffffb2ed054f7bd0 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.645369] CR2: 0000000000000002 Mar 12 16:44:32 qa-h-vrt-038 kernel: [ 542.687127] ---[ end trace 92901adbd279c621 ]--- Mar 12 16:45:13 qa-h-vrt-038 systemd[1]: Reloading. we already fix the issue and we are going to send it upstream, I will send it also to canonical kernel team mailing list. Thanks, Talat To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1676786/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp