Hello Jon, I tested this with 4.18.10 and it passes the fine-tuned test I have. I'll run it for a while to be sure, but looks good.
Thanks, -- Juhamatti to 1. marrask. 2018 klo 2.15 Jon Maloy ([email protected]) kirjoitti: > > Hi Juhamatti, > The second is probably the one that was fixed by commit d3092b2efca1c > ("tipc: fix unsafe rcu locking when accessing publication list") which was > delivered to 4.18 around a month ago. Not sure how far back this has been > applied, but if you look at the very latest 4.15 upstream you may be lucky. > > Unfortunately I don't recognize the first one, -it seems to be a timer > interrupt deleting a subscription object that is still locked while sending > an event. What I can say, is that the whole topology server was redesigned in > Linux 4.17, among other things because this kind of problems never seemed to > end. > > Do you have any possibility to upgrade your kernel to 4.19 and try again? > There has been a few bug fixes added during 4.18/4.19 that you may benefit > from. Otherwise I may have to look closer at this one, and see what can be > done. > > Regards > ///jon > > > > -----Original Message----- > > From: [email protected] <[email protected]> > > Sent: 31-Oct-18 10:23 > > To: [email protected] > > Cc: [email protected] > > Subject: Re: [tipc-discussion] TIPC scalability viewpoints > > > > Hello again, > > > > Here is another issue found in the same setup, looks a bit different. > > Please see below. > > > > BR, > > -- > > Juhamatti > > > > [ 6276.191737] Unable to remove publication from failed node [ 6276.191737] > > (type=18888, lower=43818, node=0x100136c, ref=936542876, > > key=936542877) [ 6276.205086] BUG: unable to handle kernel NULL pointer > > dereference at 0000000000000088 [ 6276.213016] IP: > > __call_rcu.constprop.69+0x25/0x210 > > [ 6276.217837] PGD 0 P4D 0 > > [ 6276.220405] Oops: 0002 [#1] SMP NOPTI [ 6276.224103] Modules linked in: > > tipc ip6_udp_tunnel udp_tunnel xt_CHECKSUM iptable_mangle > > ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 > > nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack xt_tcpudp bridge stp llc > > veth iptable_filter ipmi_ssif > > nls_iso8859_1 intel_rapl pnd2_edac x86_pkg_temp_thermal > > intel_powerclamp coretemp kvm_intel asix kvm usbnet input_leds irqbypass > > joydev intel_cstate mii intel_rapl_perf mac_hid ipmi_si ipmi_devintf shpchp > > ipmi_msghandler sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core > > iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables > > autofs4 > > btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy > > async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath > > linear hid_generic usbhid hid crct10dif_pclmul crc32_pclmul [ 6276.295019] > > ghash_clmulni_intel pcbc ast ttm aesni_intel drm_kms_helper aes_x86_64 > > syscopyarea crypto_simd glue_helper sysfillrect sysimgblt cryptd ixgbe > > fb_sys_fops igb i40e drm ahci dca i2c_algo_bit mdio libahci ptp i2c_ismt > > pps_core [ 6276.316114] CPU: 15 PID: 100 Comm: ksoftirqd/15 Not tainted > > 4.15.0-38-generic #41-Ubuntu [ 6276.324259] Hardware name: Supermicro > > Super Server/A2SDV-16C-TLN5F, BIOS 1.0b 12/12/2017 [ 6276.332413] RIP: > > 0010:__call_rcu.constprop.69+0x25/0x210 > > [ 6276.337773] RSP: 0018:ffffb215c34cbbd0 EFLAGS: 00010246 [ 6276.343033] > > RAX: 0000000000000076 RBX: ffff9a5046cdd110 RCX: 0000000000000001 [ > > 6276.350213] RDX: ffffffffbe4a9200 RSI: 0000000000000080 RDI: > > 0000000000000000 [ 6276.357402] RBP: ffffb215c34cbc08 R08: > > 0000000000000000 R09: 0000000000001550 [ 6276.364595] R10: > > 00000000000012df R11: 00000000ffffffff R12: ffff9a4fe3bbfc80 [ 6276.371778] > > R13: ffff9a5221a54740 R14: ffffffffbe4a9200 R15: 0000000000000080 [ > > 6276.378963] FS: 0000000000000000(0000) GS:ffff9a561f5c0000(0000) > > knlGS:0000000000000000 > > [ 6276.387096] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ > > 6276.392878] CR2: 0000000000000088 CR3: 00000006f4818000 CR4: > > 00000000003406e0 [ 6276.400057] Call Trace: > > [ 6276.402544] kfree_call_rcu+0x1a/0x20 [ 6276.406256] > > tipc_publ_notify+0xae/0x190 [tipc] [ 6276.410840] > > tipc_node_write_unlock+0xf3/0x100 [tipc] [ 6276.415979] > > tipc_node_link_down+0x99/0x140 [tipc] [ 6276.420858] > > tipc_node_timeout+0x126/0x130 [tipc] [ 6276.425644] ? > > tipc_node_calculate_timer.isra.13+0x60/0x60 [tipc] [ 6276.431795] > > call_timer_fn+0x30/0x130 [ 6276.435487] run_timer_softirq+0x3fb/0x450 [ > > 6276.439614] ? __switch_to_asm+0x34/0x70 [ 6276.443573] ? > > __switch_to+0xad/0x500 [ 6276.447262] ? __switch_to_asm+0x40/0x70 [ > > 6276.451219] ? __switch_to_asm+0x34/0x70 [ 6276.455178] ? > > __switch_to_asm+0x40/0x70 [ 6276.459141] __do_softirq+0xe4/0x2bb [ > > 6276.462750] run_ksoftirqd+0x22/0x60 [ 6276.466364] > > smpboot_thread_fn+0xfc/0x170 [ 6276.470397] kthread+0x121/0x140 [ > > 6276.473645] ? sort_range+0x30/0x30 [ 6276.477153] ? > > kthread_create_worker_on_cpu+0x70/0x70 > > [ 6276.482237] ret_from_fork+0x35/0x40 > > [ 6276.485835] Code: b7 f8 ff 0f 1f 00 0f 1f 44 00 00 55 48 89 e5 41 > > 57 41 56 41 55 41 54 49 89 ff 53 49 89 d6 48 83 ec 10 83 e7 07 0f 85 dd 00 > > 00 00 > > <49> 89 77 08 49 c7 07 00 00 00 00 9c 58 0f 1f 44 00 00 49 > > 89 c5 > > [ 6276.504818] RIP: __call_rcu.constprop.69+0x25/0x210 RSP: > > ffffb215c34cbbd0 [ 6276.511642] CR2: 0000000000000088 [ 6276.515000] ---[ > > end trace f418a9afa145bb5d ]--- [ 6276.528130] Kernel panic - not syncing: > > Fatal exception in interrupt [ 6276.534610] Kernel Offset: 0x3c000000 from > > 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff) > > [ 6276.553092] ---[ end Kernel panic - not syncing: Fatal exception in > > interrupt > > [ 6276.560285] WARNING: CPU: 15 PID: 100 at > > /build/linux-CuEUJv/linux-4.15.0/kernel/sched/core.c:1192 > > set_task_cpu+0x158/0x160 > > [ 6276.571459] Modules linked in: tipc ip6_udp_tunnel udp_tunnel > > xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 > > iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat > > nf_conntrack xt_tcpudp bridge stp llc veth iptable_filter ipmi_ssif > > nls_iso8859_1 intel_rapl pnd2_edac x86_pkg_temp_thermal > > intel_powerclamp coretemp kvm_intel asix kvm usbnet input_leds irqbypass > > joydev intel_cstate mii intel_rapl_perf mac_hid ipmi_si ipmi_devintf shpchp > > ipmi_msghandler sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core > > iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables > > autofs4 > > btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy > > async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath > > linear hid_generic usbhid hid crct10dif_pclmul crc32_pclmul [ 6276.642267] > > ghash_clmulni_intel pcbc ast ttm aesni_intel drm_kms_helper aes_x86_64 > > syscopyarea crypto_simd glue_helper sysfillrect sysimgblt cryptd ixgbe > > fb_sys_fops igb i40e drm ahci dca i2c_algo_bit mdio libahci ptp i2c_ismt > > pps_core > > [ 6276.663306] CPU: 15 PID: 100 Comm: ksoftirqd/15 Tainted: G D > > 4.15.0-38-generic #41-Ubuntu > > [ 6276.672734] Hardware name: Supermicro Super Server/A2SDV-16C-TLN5F, > > BIOS 1.0b 12/12/2017 [ 6276.683335] RIP: 0010:set_task_cpu+0x158/0x160 [ > > 6276.690288] RSP: 0018:ffff9a561f5c3dc8 EFLAGS: 00010006 [ 6276.698025] > > RAX: 0000000000000200 RBX: ffff9a50a7565900 RCX: 0000000000000000 [ > > 6276.707673] RDX: 0000000000000000 RSI: 0000000000000000 RDI: > > ffff9a50a7565900 [ 6276.717305] RBP: ffff9a561f5c3de8 R08: > > 0000000000000000 R09: 000000000000ffff [ 6276.726926] R10: ffff9a561f5c3c90 > > R11: 0000000000000000 R12: ffff9a50a756642c [ 6276.736527] R13: > > 0000000000000000 R14: 0000000000000000 R15: 0000000000022880 [ > > 6276.746122] FS: 0000000000000000(0000) GS:ffff9a561f5c0000(0000) > > knlGS:0000000000000000 > > [ 6276.756700] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ > > 6276.764891] CR2: 0000000000000088 CR3: 00000006f4818000 CR4: > > 00000000003406e0 [ 6276.774418] Call Trace: > > [ 6276.779188] <IRQ> > > [ 6276.783446] try_to_wake_up+0x159/0x480 [ 6276.789458] > > default_wake_function+0x12/0x20 [ 6276.795834] > > autoremove_wake_function+0x12/0x40 > > [ 6276.802444] __wake_up_common+0x73/0x130 [ 6276.808426] > > __wake_up_common_lock+0x80/0xc0 [ 6276.814730] > > __wake_up+0x13/0x20 [ 6276.819987] > > wake_up_klogd_work_func+0x40/0x60 [ 6276.826444] > > irq_work_run_list+0x52/0x80 [ 6276.832376] irq_work_run+0x2c/0x40 [ > > 6276.837876] flush_smp_call_function_queue+0x66/0xf0 > > [ 6276.844845] generic_smp_call_function_single_interrupt+0x13/0x30 > > [ 6276.852952] smp_call_function_interrupt+0x3e/0xd0 > > [ 6276.859743] call_function_interrupt+0x84/0x90 [ 6276.866171] </IRQ> [ > > 6276.870234] RIP: 0010:panic+0x1fe/0x244 [ 6276.876021] RSP: > > 0018:ffffb215c34cb918 EFLAGS: 00000286 ORIG_RAX: > > ffffffffffffff03 > > [ 6276.885572] RAX: 0000000000000041 RBX: 0000000000000000 RCX: > > 0000000000000006 [ 6276.894699] RDX: 0000000000000000 RSI: > > 0000000000000092 RDI: ffff9a561f5d6490 [ 6276.903823] RBP: > > ffffb215c34cb990 R08: 0000000000000000 R09: 000000000000157e [ > > 6276.912957] R10: 0000000000000008 R11: 00000000ffffffff R12: > > 0000000000000000 [ 6276.922071] R13: 0000000000000000 R14: > > 0000000000000009 R15: 0000000000000002 [ 6276.931172] ? > > panic+0x1f7/0x244 [ 6276.936358] oops_end+0xb6/0xd0 [ 6276.941463] > > no_context+0x17c/0x440 [ 6276.946934] ? __const_udelay+0x41/0x50 [ > > 6276.952753] __bad_area_nosemaphore+0xf3/0x1b0 [ 6276.959186] > > bad_area_nosemaphore+0x14/0x20 [ 6276.965378] > > __do_page_fault+0xd4/0x4d0 [ 6276.971226] ? up+0x32/0x50 [ 6276.976035] > > do_page_fault+0x2e/0xe0 [ 6276.981632] page_fault+0x25/0x50 [ > > 6276.986954] RIP: 0010:__call_rcu.constprop.69+0x25/0x210 > > [ 6276.994290] RSP: 0018:ffffb215c34cbbd0 EFLAGS: 00010246 [ 6277.001543] > > RAX: 0000000000000076 RBX: ffff9a5046cdd110 RCX: 0000000000000001 [ > > 6277.010723] RDX: ffffffffbe4a9200 RSI: 0000000000000080 RDI: > > 0000000000000000 [ 6277.019901] RBP: ffffb215c34cbc08 R08: > > 0000000000000000 R09: 0000000000001550 [ 6277.029015] R10: > > 00000000000012df R11: 00000000ffffffff R12: ffff9a4fe3bbfc80 [ 6277.038070] > > R13: ffff9a5221a54740 R14: ffffffffbe4a9200 R15: 0000000000000080 [ > > 6277.047129] kfree_call_rcu+0x1a/0x20 [ 6277.052708] > > tipc_publ_notify+0xae/0x190 [tipc] [ 6277.059161] > > tipc_node_write_unlock+0xf3/0x100 [tipc] [ 6277.066123] > > tipc_node_link_down+0x99/0x140 [tipc] [ 6277.072792] > > tipc_node_timeout+0x126/0x130 [tipc] [ 6277.079348] ? > > tipc_node_calculate_timer.isra.13+0x60/0x60 [tipc] [ 6277.087259] > > call_timer_fn+0x30/0x130 [ 6277.092670] run_timer_softirq+0x3fb/0x450 [ > > 6277.098438] ? __switch_to_asm+0x34/0x70 [ 6277.103961] ? > > __switch_to+0xad/0x500 [ 6277.109137] ? __switch_to_asm+0x40/0x70 [ > > 6277.114493] ? __switch_to_asm+0x34/0x70 [ 6277.119765] ? > > __switch_to_asm+0x40/0x70 [ 6277.125006] __do_softirq+0xe4/0x2bb [ > > 6277.129861] run_ksoftirqd+0x22/0x60 [ 6277.134699] > > smpboot_thread_fn+0xfc/0x170 [ 6277.139965] kthread+0x121/0x140 [ > > 6277.144439] ? sort_range+0x30/0x30 [ 6277.149163] ? > > kthread_create_worker_on_cpu+0x70/0x70 > > [ 6277.155452] ret_from_fork+0x35/0x40 > > [ 6277.160258] Code: 32 ff ff ff 80 8b 64 08 00 00 04 e9 4e ff ff ff 0f 0b > > e9 f2 fe > > ff ff f7 43 5c fd ff ff ff 0f 84 fc fe ff ff 0f 0b e9 > > f5 fe ff ff <0f> 0b e9 ff fe ff ff 90 0f 1f 44 00 00 55 48 89 e5 41 56 > > 41 55 > > [ 6277.181837] ---[ end trace f418a9afa145bb5e ]--- [ 6277.187806] > > ------------[ > > cut here ]------------ [ 6277.193765] sched: Unexpected reschedule of > > offline > > CPU#0! > > [ 6277.200584] WARNING: CPU: 15 PID: 100 at > > /build/linux-CuEUJv/linux-4.15.0/arch/x86/kernel/smp.c:128 > > native_smp_send_reschedule+0x3a/0x40 > > [ 6277.214247] Modules linked in: tipc ip6_udp_tunnel udp_tunnel > > xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 > > iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat > > nf_conntrack xt_tcpudp bridge stp llc veth iptable_filter ipmi_ssif > > nls_iso8859_1 intel_rapl pnd2_edac x86_pkg_temp_thermal > > intel_powerclamp coretemp kvm_intel asix kvm usbnet input_leds irqbypass > > joydev intel_cstate mii intel_rapl_perf mac_hid ipmi_si ipmi_devintf shpchp > > ipmi_msghandler sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core > > iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables > > autofs4 > > btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy > > async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath > > linear hid_generic usbhid hid crct10dif_pclmul crc32_pclmul [ 6277.291150] > > ghash_clmulni_intel pcbc ast ttm aesni_intel drm_kms_helper aes_x86_64 > > syscopyarea crypto_simd glue_helper sysfillrect sysimgblt cryptd ixgbe > > fb_sys_fops igb i40e drm ahci dca i2c_algo_bit mdio libahci ptp i2c_ismt > > pps_core > > [ 6277.315672] CPU: 15 PID: 100 Comm: ksoftirqd/15 Tainted: G D W > > 4.15.0-38-generic #41-Ubuntu > > [ 6277.326902] Hardware name: Supermicro Super Server/A2SDV-16C-TLN5F, > > BIOS 1.0b 12/12/2017 [ 6277.336859] RIP: > > 0010:native_smp_send_reschedule+0x3a/0x40 > > [ 6277.344224] RSP: 0018:ffff9a561f5c3de8 EFLAGS: 00010086 [ 6277.351330] > > RAX: 0000000000000000 RBX: ffff9a50a7565900 RCX: 0000000000000006 [ > > 6277.360358] RDX: 0000000000000007 RSI: 0000000000000092 RDI: > > ffff9a561f5d6490 [ 6277.369378] RBP: ffff9a561f5c3de8 R08: > > 0000000000000000 R09: 00000000000015cd [ 6277.378405] R10: > > ffff9a561f5c3dc8 R11: 00000000ffffffff R12: ffff9a50a756642c [ 6277.387428] > > R13: 0000000000000004 R14: 0000000000000046 R15: 0000000000022880 [ > > 6277.396489] FS: 0000000000000000(0000) GS:ffff9a561f5c0000(0000) > > knlGS:0000000000000000 > > [ 6277.406505] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ > > 6277.414171] CR2: 0000000000000088 CR3: 00000006f4818000 CR4: > > 00000000003406e0 [ 6277.423241] Call Trace: > > [ 6277.427619] <IRQ> > > [ 6277.431550] try_to_wake_up+0x3c6/0x480 [ 6277.437319] > > default_wake_function+0x12/0x20 [ 6277.443503] > > autoremove_wake_function+0x12/0x40 > > [ 6277.449960] __wake_up_common+0x73/0x130 [ 6277.455811] > > __wake_up_common_lock+0x80/0xc0 [ 6277.462004] > > __wake_up+0x13/0x20 [ 6277.467146] > > wake_up_klogd_work_func+0x40/0x60 [ 6277.473510] > > irq_work_run_list+0x52/0x80 [ 6277.479353] irq_work_run+0x2c/0x40 [ > > 6277.484753] flush_smp_call_function_queue+0x66/0xf0 > > [ 6277.491642] generic_smp_call_function_single_interrupt+0x13/0x30 > > [ 6277.499674] smp_call_function_interrupt+0x3e/0xd0 > > [ 6277.506405] call_function_interrupt+0x84/0x90 [ 6277.512787] </IRQ> [ > > 6277.516805] RIP: 0010:panic+0x1fe/0x244 [ 6277.522558] RSP: > > 0018:ffffb215c34cb918 EFLAGS: 00000286 ORIG_RAX: > > ffffffffffffff03 > > [ 6277.532076] RAX: 0000000000000041 RBX: 0000000000000000 RCX: > > 0000000000000006 [ 6277.541160] RDX: 0000000000000000 RSI: > > 0000000000000092 RDI: ffff9a561f5d6490 [ 6277.550232] RBP: > > ffffb215c34cb990 R08: 0000000000000000 R09: 000000000000157e [ > > 6277.559296] R10: 0000000000000008 R11: 00000000ffffffff R12: > > 0000000000000000 [ 6277.568356] R13: 0000000000000000 R14: > > 0000000000000009 R15: 0000000000000002 [ 6277.577422] ? > > panic+0x1f7/0x244 [ 6277.582578] oops_end+0xb6/0xd0 [ 6277.587644] > > no_context+0x17c/0x440 [ 6277.593041] ? __const_udelay+0x41/0x50 [ > > 6277.598779] __bad_area_nosemaphore+0xf3/0x1b0 [ 6277.605124] > > bad_area_nosemaphore+0x14/0x20 [ 6277.611212] > > __do_page_fault+0xd4/0x4d0 [ 6277.616958] ? up+0x32/0x50 [ 6277.621667] > > do_page_fault+0x2e/0xe0 [ 6277.627168] page_fault+0x25/0x50 [ > > 6277.632417] RIP: 0010:__call_rcu.constprop.69+0x25/0x210 > > [ 6277.639666] RSP: 0018:ffffb215c34cbbd0 EFLAGS: 00010246 [ 6277.646845] > > RAX: 0000000000000076 RBX: ffff9a5046cdd110 RCX: 0000000000000001 [ > > 6277.655951] RDX: ffffffffbe4a9200 RSI: 0000000000000080 RDI: > > 0000000000000000 [ 6277.665065] RBP: ffffb215c34cbc08 R08: > > 0000000000000000 R09: 0000000000001550 [ 6277.674176] R10: > > 00000000000012df R11: 00000000ffffffff R12: ffff9a4fe3bbfc80 [ 6277.683298] > > R13: ffff9a5221a54740 R14: ffffffffbe4a9200 R15: 0000000000000080 [ > > 6277.692426] kfree_call_rcu+0x1a/0x20 [ 6277.698025] > > tipc_publ_notify+0xae/0x190 [tipc] [ 6277.704432] > > tipc_node_write_unlock+0xf3/0x100 [tipc] [ 6277.711361] > > tipc_node_link_down+0x99/0x140 [tipc] [ 6277.718027] > > tipc_node_timeout+0x126/0x130 [tipc] [ 6277.724591] ? > > tipc_node_calculate_timer.isra.13+0x60/0x60 [tipc] [ 6277.732516] > > call_timer_fn+0x30/0x130 [ 6277.737940] run_timer_softirq+0x3fb/0x450 [ > > 6277.743701] ? __switch_to_asm+0x34/0x70 [ 6277.749216] ? > > __switch_to+0xad/0x500 [ 6277.754386] ? __switch_to_asm+0x40/0x70 [ > > 6277.759732] ? __switch_to_asm+0x34/0x70 [ 6277.764993] ? > > __switch_to_asm+0x40/0x70 [ 6277.770220] __do_softirq+0xe4/0x2bb [ > > 6277.775071] run_ksoftirqd+0x22/0x60 [ 6277.779899] > > smpboot_thread_fn+0xfc/0x170 [ 6277.785156] kthread+0x121/0x140 [ > > 6277.789619] ? sort_range+0x30/0x30 [ 6277.794333] ? > > kthread_create_worker_on_cpu+0x70/0x70 > > [ 6277.800617] ret_from_fork+0x35/0x40 > > [ 6277.805419] Code: 2d 61 60 01 73 17 48 8b 05 84 21 16 01 be fd 00 > > 00 00 48 8b 40 30 e8 36 b6 ba 00 5d c3 89 fe 48 c7 c7 60 fe 0b be e8 > > 06 48 03 00 <0f> 0b 5d c3 66 90 0f 1f 44 00 00 55 48 89 e5 53 48 83 ec > > 20 65 > > [ 6277.826959] ---[ end trace f418a9afa145bb5f ]--- ke 31. lokak. 2018 klo > > 14.39 > > [email protected] kirjoitti: > > > > > > Hello, > > > > > > Thanks Ying! While running a server with 250 LXCs and 2500 TIPC > > > sockets running with modified connection demo, I am seeing the problem > > > described below. The kernel is stock Ubuntu1804 4.15.0-38-generic with > > > 16 cores. It is fairly easy to reproduce, after starting up containers > > > it will happen within an hour or so. > > > > > > Please let me know if there is already a patch for it, I am happy to > > > try. I can also try fix candidates quite easily. > > > > > > BR, > > > -- > > > Juhamatti > > > > > > [ 3195.231191] BUG: unable to handle kernel paging request at > > > 0000000200001dc7 [ 3195.238220] IP: _raw_spin_lock_bh+0x1b/0x30 [ > > > 3195.242432] PGD 0 P4D 0 [ 3195.244988] Oops: 0002 [#1] SMP NOPTI [ > > > 3195.248668] Modules linked in: binfmt_misc tipc ip6_udp_tunnel > > > udp_tunnel xt_CHECKSUM iptable_mangle ipt_MASQUERADE > > > nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 > > > nf_nat_ipv4 nf_nat nf_conntrack xt_tcpudp bridge stp llc veth > > > iptable_filter ipmi_ssif nls_iso8859_1 intel_rapl pnd2_edac > > > x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm > > irqbypass > > > asix intel_cstate usbnet intel_rapl_perf joydev input_leds mii ipmi_si > > > ipmi_devintf ipmi_msghandler mac_hid shpchp sch_fq_codel ib_iser > > > rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi > > > scsi_transport_iscsi ip_tables x_tables autofs4 btrfs zstd_compress > > > raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor > > > async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear > > > hid_generic usbhid hid crct10dif_pclmul [ 3195.319457] crc32_pclmul > > > ghash_clmulni_intel pcbc ast aesni_intel ttm aes_x86_64 drm_kms_helper > > > crypto_simd glue_helper cryptd syscopyarea sysfillrect sysimgblt ixgbe > > > igb fb_sys_fops i40e drm ahci dca i2c_algo_bit libahci mdio i2c_ismt > > > ptp pps_core [ 3195.341632] CPU: 1 PID: 348 Comm: kworker/u32:7 Not > > > tainted 4.15.0-38-generic #41-Ubuntu [ 3195.349754] Hardware name: > > > Supermicro Super Server/A2SDV-16C-TLN5F, BIOS 1.0b 12/12/2017 [ > > > 3195.357891] Workqueue: tipc_send tipc_send_work [tipc] [ 3195.363061] > > > RIP: 0010:_raw_spin_lock_bh+0x1b/0x30 [ 3195.367788] RSP: > > > 0018:ffff90d3df243e90 EFLAGS: 00010246 [ 3195.373032] RAX: > > > 0000000000000000 RBX: ffff90d1317a6c18 RCX: ffff90d3df243f08 [ > > > 3195.380203] RDX: 0000000000000001 RSI: ffffffffc07d8290 RDI: > > > 0000000200001dc7 [ 3195.387393] RBP: ffff90d3df243e90 R08: > > > ffff90d3df243f10 R09: ffff90d3df25a780 [ 3195.394561] R10: > > > ffff90d3df243f10 R11: ffff90d3df25a770 R12: 0000000200001dc7 [ > > > 3195.401729] R13: ffff90d1317a6c00 R14: ffffffffc07d8290 R15: > > > ffff90d1317a6c18 [ 3195.408897] FS: 0000000000000000(0000) > > > GS:ffff90d3df240000(0000) > > > knlGS:0000000000000000 > > > [ 3195.417025] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ > > > 3195.422798] CR2: 0000000200001dc7 CR3: 00000007711f8000 CR4: > > > 00000000003406e0 [ 3195.429965] Call Trace: > > > [ 3195.432429] <IRQ> > > > [ 3195.434467] tipc_subscrp_timeout+0x25/0x90 [tipc] [ 3195.439311] > > > call_timer_fn+0x30/0x130 [ 3195.442994] run_timer_softirq+0x3fb/0x450 > > > [ 3195.447113] ? lapic_next_deadline+0x26/0x30 [ 3195.451404] ? > > > clockevents_program_event+0xd8/0x110 > > > [ 3195.456304] __do_softirq+0xe4/0x2bb [ 3195.459903] > > > do_softirq_own_stack+0x2a/0x40 [ 3195.464107] </IRQ> [ 3195.466219] > > > do_softirq.part.17+0x50/0x60 [ 3195.470250] > > > __local_bh_enable_ip+0x50/0x60 [ 3195.474456] > > > _raw_spin_unlock_bh+0x1e/0x20 [ 3195.478576] tipc_sk_rcv+0x15a/0x440 > > > [tipc] [ 3195.482782] tipc_node_xmit+0x147/0x190 [tipc] [ > > > 3195.487254] __tipc_sendstream+0x1a9/0x2d0 [tipc] [ 3195.491987] ? > > > aa_sk_perm+0x48/0x1a0 [ 3195.495587] tipc_sendstream+0x35/0x50 [tipc] > > > [ 3195.499976] tipc_send_packet+0x17/0x20 [tipc] [ 3195.504454] > > > sock_sendmsg+0x3e/0x50 [ 3195.507967] kernel_sendmsg+0x2b/0x30 [ > > > 3195.511663] tipc_send_to_sock+0x12f/0x290 [tipc] [ 3195.516395] ? > > > __switch_to_asm+0x40/0x70 [ 3195.520338] ? > > __switch_to_asm+0x34/0x70 > > > [ 3195.524280] ? __switch_to_asm+0x34/0x70 [ 3195.528222] ? > > > __switch_to_asm+0x40/0x70 [ 3195.532174] ? > > __switch_to_asm+0x34/0x70 > > > [ 3195.536133] tipc_send_work+0x23/0x40 [tipc] [ 3195.540433] > > > process_one_work+0x1de/0x410 [ 3195.544468] > > worker_thread+0x32/0x410 > > > [ 3195.548147] kthread+0x121/0x140 [ 3195.551403] ? > > > process_one_work+0x410/0x410 [ 3195.555611] ? > > > kthread_create_worker_on_cpu+0x70/0x70 > > > [ 3195.560686] ? do_syscall_64+0x73/0x130 [ 3195.564545] ? > > > SyS_exit_group+0x14/0x20 [ 3195.568404] ret_from_fork+0x35/0x40 [ > > > 3195.571998] Code: 00 5d c3 31 c0 5d c3 66 0f 1f 84 00 00 00 00 00 0f > > > 1f 44 00 00 55 65 81 05 8f 8a c7 73 00 02 00 00 48 89 e5 31 c0 ba > > > 01 00 00 00 <f0> 0f b1 17 85 c0 75 02 5d c3 89 c6 e8 24 12 74 ff 66 90 > > > 5d c3 [ 3195.590981] RIP: _raw_spin_lock_bh+0x1b/0x30 RSP: > > > ffff90d3df243e90 [ 3195.599011] CR2: 0000000200001dc7 [ 3195.604403] > > > ---[ end trace 4ce4054b7f724f81 ]--- [ 3195.619914] Kernel panic - not > > > syncing: Fatal exception in interrupt [ 3195.628264] Kernel Offset: > > > 0xaa00000 from 0xffffffff81000000 (relocation range: > > > 0xffffffff80000000-0xffffffffbfffffff) > > > [ 3195.649393] ---[ end Kernel panic - not syncing: Fatal exception in > > > interrupt [ 3195.658447] WARNING: CPU: 1 PID: 348 at > > > /build/linux-CuEUJv/linux-4.15.0/kernel/sched/core.c:1192 > > > set_task_cpu+0x158/0x160 > > > [ 3195.671428] Modules linked in: binfmt_misc tipc ip6_udp_tunnel > > > udp_tunnel xt_CHECKSUM iptable_mangle ipt_MASQUERADE > > > nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 > > > nf_nat_ipv4 nf_nat nf_conntrack xt_tcpudp bridge stp llc veth > > > iptable_filter ipmi_ssif nls_iso8859_1 intel_rapl pnd2_edac > > > x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm > > irqbypass > > > asix intel_cstate usbnet intel_rapl_perf joydev input_leds mii ipmi_si > > > ipmi_devintf ipmi_msghandler mac_hid shpchp sch_fq_codel ib_iser > > > rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi > > > scsi_transport_iscsi ip_tables x_tables autofs4 btrfs zstd_compress > > > raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor > > > async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear > > > hid_generic usbhid hid crct10dif_pclmul [ 3195.749731] crc32_pclmul > > > ghash_clmulni_intel pcbc ast aesni_intel ttm aes_x86_64 drm_kms_helper > > > crypto_simd glue_helper cryptd syscopyarea sysfillrect sysimgblt ixgbe > > > igb fb_sys_fops i40e drm ahci dca i2c_algo_bit libahci mdio i2c_ismt > > > ptp pps_core > > > [ 3195.775727] CPU: 1 PID: 348 Comm: kworker/u32:7 Tainted: G D > > > 4.15.0-38-generic #41-Ubuntu > > > [ 3195.787072] Hardware name: Supermicro Super Server/A2SDV-16C- > > TLN5F, > > > BIOS 1.0b 12/12/2017 [ 3195.797128] Workqueue: tipc_send > > > tipc_send_work [tipc] [ 3195.804212] RIP: > > > 0010:set_task_cpu+0x158/0x160 [ 3195.810603] RSP: > > > 0018:ffff90d3df243808 EFLAGS: 00010006 [ 3195.817773] RAX: > > > 0000000000000200 RBX: ffff90d2c5b11640 RCX: 0000000000000000 [ > > > 3195.826856] RDX: 0000000000000000 RSI: 0000000000000000 RDI: > > > ffff90d2c5b11640 [ 3195.835936] RBP: ffff90d3df243828 R08: > > > 0000000000000000 R09: 000000000000ffff [ 3195.845016] R10: > > > 0000000000000010 R11: 0000000000000000 R12: ffff90d2c5b1216c [ > > > 3195.854101] R13: 0000000000000000 R14: 0000000000000000 R15: > > > 0000000000022880 [ 3195.863174] FS: 0000000000000000(0000) > > > GS:ffff90d3df240000(0000) > > > knlGS:0000000000000000 > > > [ 3195.873202] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ > > > 3195.880878] CR2: 0000000200001dc7 CR3: 00000007711f8000 CR4: > > > 00000000003406e0 [ 3195.889943] Call Trace: > > > [ 3195.894313] <IRQ> > > > [ 3195.898250] try_to_wake_up+0x159/0x480 [ 3195.904009] > > > default_wake_function+0x12/0x20 [ 3195.910199] > > > autoremove_wake_function+0x12/0x40 > > > [ 3195.916660] __wake_up_common+0x73/0x130 [ 3195.922508] > > > __wake_up_common_lock+0x80/0xc0 [ 3195.928695] > > __wake_up+0x13/0x20 [ > > > 3195.933831] wake_up_klogd_work_func+0x40/0x60 [ 3195.940185] > > > irq_work_run_list+0x52/0x80 [ 3195.946026] ? > > > tick_sched_do_timer+0x50/0x50 [ 3195.952206] irq_work_tick+0x3b/0x50 > > > [ 3195.957696] update_process_times+0x42/0x60 [ 3195.963797] > > > tick_sched_handle+0x26/0x70 [ 3195.969635] ? > > > tick_sched_do_timer+0x50/0x50 [ 3195.975827] > > > tick_sched_timer+0x39/0x80 [ 3195.981576] > > > __hrtimer_run_queues+0xdf/0x230 [ 3195.987756] > > > hrtimer_interrupt+0xa0/0x1d0 [ 3195.993676] > > > smp_apic_timer_interrupt+0x6f/0x130 > > > [ 3196.000208] apic_timer_interrupt+0x84/0x90 [ 3196.006312] RIP: > > > 0010:panic+0x1fe/0x244 [ 3196.012062] RSP: 0018:ffff90d3df243bd8 > > > EFLAGS: 00000286 ORIG_RAX: > > > ffffffffffffff11 > > > [ 3196.021578] RAX: 0000000000000041 RBX: 0000000000000000 RCX: > > > 0000000000000006 [ 3196.030668] RDX: 0000000000000000 RSI: > > > 0000000000000092 RDI: ffff90d3df256490 [ 3196.039773] RBP: > > > ffff90d3df243c50 R08: 0000000000000000 R09: 000000000000113b [ > > > 3196.048892] R10: 0000000000000008 R11: 00000000ffffffff R12: > > > 0000000000000000 [ 3196.058026] R13: 0000000000000000 R14: > > > 0000000000000009 R15: 0000000000000002 [ 3196.067174] ? > > > panic+0x1f7/0x244 [ 3196.072419] oops_end+0xb6/0xd0 [ 3196.077569] > > > no_context+0x17c/0x440 [ 3196.083061] > > > __bad_area_nosemaphore+0xf3/0x1b0 [ 3196.089520] > > > bad_area_nosemaphore+0x14/0x20 [ 3196.095714] > > > __do_page_fault+0xd4/0x4d0 [ 3196.101560] ? > > > update_load_avg+0x57f/0x6e0 [ 3196.107657] do_page_fault+0x2e/0xe0 [ > > > 3196.113242] page_fault+0x25/0x50 [ 3196.118560] RIP: > > > 0010:_raw_spin_lock_bh+0x1b/0x30 [ 3196.125207] RSP: > > > 0018:ffff90d3df243e90 EFLAGS: 00010246 [ 3196.132299] RAX: > > > 0000000000000000 RBX: ffff90d1317a6c18 RCX: ffff90d3df243f08 [ > > > 3196.141292] RDX: 0000000000000001 RSI: ffffffffc07d8290 RDI: > > > 0000000200001dc7 [ 3196.150282] RBP: ffff90d3df243e90 R08: > > > ffff90d3df243f10 R09: ffff90d3df25a780 [ 3196.159277] R10: > > > ffff90d3df243f10 R11: ffff90d3df25a770 R12: 0000000200001dc7 [ > > > 3196.168262] R13: ffff90d1317a6c00 R14: ffffffffc07d8290 R15: > > > ffff90d1317a6c18 [ 3196.177238] ? tipc_subscrb_release_cb+0x30/0x30 > > > [tipc] [ 3196.184271] ? tipc_subscrb_release_cb+0x30/0x30 [tipc] [ > > > 3196.191262] tipc_subscrp_timeout+0x25/0x90 [tipc] [ 3196.197751] > > > call_timer_fn+0x30/0x130 [ 3196.203029] run_timer_softirq+0x3fb/0x450 > > > [ 3196.208673] ? lapic_next_deadline+0x26/0x30 [ 3196.214423] ? > > > clockevents_program_event+0xd8/0x110 > > > [ 3196.220707] __do_softirq+0xe4/0x2bb [ 3196.225616] > > > do_softirq_own_stack+0x2a/0x40 [ 3196.231114] </IRQ> [ 3196.234491] > > > do_softirq.part.17+0x50/0x60 [ 3196.239759] > > > __local_bh_enable_ip+0x50/0x60 [ 3196.245196] > > > _raw_spin_unlock_bh+0x1e/0x20 [ 3196.250555] tipc_sk_rcv+0x15a/0x440 > > > [tipc] [ 3196.255986] tipc_node_xmit+0x147/0x190 [tipc] [ > > > 3196.261677] __tipc_sendstream+0x1a9/0x2d0 [tipc] [ 3196.267615] ? > > > aa_sk_perm+0x48/0x1a0 [ 3196.272420] tipc_sendstream+0x35/0x50 [tipc] > > > [ 3196.278014] tipc_send_packet+0x17/0x20 [tipc] [ 3196.283673] > > > sock_sendmsg+0x3e/0x50 [ 3196.288359] kernel_sendmsg+0x2b/0x30 [ > > > 3196.293212] tipc_send_to_sock+0x12f/0x290 [tipc] [ 3196.299094] ? > > > __switch_to_asm+0x40/0x70 [ 3196.304205] ? > > __switch_to_asm+0x34/0x70 > > > [ 3196.309301] ? __switch_to_asm+0x34/0x70 [ 3196.314379] ? > > > __switch_to_asm+0x40/0x70 [ 3196.319446] ? > > __switch_to_asm+0x34/0x70 > > > [ 3196.324510] tipc_send_work+0x23/0x40 [tipc] [ 3196.329931] > > > process_one_work+0x1de/0x410 [ 3196.335103] > > worker_thread+0x32/0x410 > > > [ 3196.339919] kthread+0x121/0x140 [ 3196.344299] ? > > > process_one_work+0x410/0x410 [ 3196.349640] ? > > > kthread_create_worker_on_cpu+0x70/0x70 > > > [ 3196.355853] ? do_syscall_64+0x73/0x130 [ 3196.360851] ? > > > SyS_exit_group+0x14/0x20 [ 3196.365841] ret_from_fork+0x35/0x40 [ > > > 3196.370567] Code: 32 ff ff ff 80 8b 64 08 00 00 04 e9 4e ff ff ff 0f > > > 0b e9 f2 fe ff ff f7 43 5c fd ff ff ff 0f 84 fc fe ff ff 0f 0b e9 > > > f5 fe ff ff <0f> 0b e9 ff fe ff ff 90 0f 1f 44 00 00 55 48 89 e5 41 56 > > > 41 55 > > > [ 3196.391999] ---[ end trace 4ce4054b7f724f82 ]--- [ 3196.397922] > > > ------------[ cut here ]------------ [ 3196.403838] sched: Unexpected > > > reschedule of offline CPU#0! > > > [ 3196.410627] WARNING: CPU: 1 PID: 348 at > > > /build/linux-CuEUJv/linux-4.15.0/arch/x86/kernel/smp.c:128 > > > native_smp_send_reschedule+0x3a/0x40 > > > [ 3196.424161] Modules linked in: binfmt_misc tipc ip6_udp_tunnel > > > udp_tunnel xt_CHECKSUM iptable_mangle ipt_MASQUERADE > > > nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 > > > nf_nat_ipv4 nf_nat nf_conntrack xt_tcpudp bridge stp llc veth > > > iptable_filter ipmi_ssif nls_iso8859_1 intel_rapl pnd2_edac > > > x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm > > irqbypass > > > asix intel_cstate usbnet intel_rapl_perf joydev input_leds mii ipmi_si > > > ipmi_devintf ipmi_msghandler mac_hid shpchp sch_fq_codel ib_iser > > > rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi > > > scsi_transport_iscsi ip_tables x_tables autofs4 btrfs zstd_compress > > > raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor > > > async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear > > > hid_generic usbhid hid crct10dif_pclmul [ 3196.500686] crc32_pclmul > > > ghash_clmulni_intel pcbc ast aesni_intel ttm aes_x86_64 drm_kms_helper > > > crypto_simd glue_helper cryptd syscopyarea sysfillrect sysimgblt ixgbe > > > igb fb_sys_fops i40e drm ahci dca i2c_algo_bit libahci mdio i2c_ismt > > > ptp pps_core > > > [ 3196.526181] CPU: 1 PID: 348 Comm: kworker/u32:7 Tainted: G D W > > > 4.15.0-38-generic #41-Ubuntu > > > [ 3196.537321] Hardware name: Supermicro Super Server/A2SDV-16C- > > TLN5F, > > > BIOS 1.0b 12/12/2017 [ 3196.547206] Workqueue: tipc_send > > > tipc_send_work [tipc] [ 3196.554142] RIP: > > > 0010:native_smp_send_reschedule+0x3a/0x40 > > > [ 3196.561423] RSP: 0018:ffff90d3df243790 EFLAGS: 00010082 [ > > > 3196.568445] RAX: 0000000000000000 RBX: 0000000000000000 RCX: > > > 0000000000000006 [ 3196.577395] RDX: 0000000000000007 RSI: > > > 0000000000000086 RDI: ffff90d3df256490 [ 3196.586344] RBP: > > > ffff90d3df243790 R08: 0000000000000000 R09: 000000000000119e [ > > > 3196.595297] R10: 000000000000025f R11: 00000000ffffffff R12: > > > ffff90d3df222880 [ 3196.604262] R13: ffff90d2c5b11640 R14: > > > ffff90d3df243848 R15: ffff90d3df222880 [ 3196.613232] FS: > > > 0000000000000000(0000) GS:ffff90d3df240000(0000) > > > knlGS:0000000000000000 > > > [ 3196.623178] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ > > > 3196.630768] CR2: 0000000200001dc7 CR3: 00000007711f8000 CR4: > > > 00000000003406e0 [ 3196.639761] Call Trace: > > > [ 3196.644061] <IRQ> > > > [ 3196.647923] resched_curr+0x5d/0xc0 [ 3196.653276] > > > check_preempt_curr+0x7a/0x90 [ 3196.659130] > > ttwu_do_wakeup+0x1e/0x140 > > > [ 3196.664745] ttwu_do_activate+0x77/0x80 [ 3196.670433] > > > try_to_wake_up+0x1d6/0x480 [ 3196.676117] > > > default_wake_function+0x12/0x20 [ 3196.682237] > > > autoremove_wake_function+0x12/0x40 > > > [ 3196.688617] __wake_up_common+0x73/0x130 [ 3196.694389] > > > __wake_up_common_lock+0x80/0xc0 [ 3196.700494] > > __wake_up+0x13/0x20 [ > > > 3196.705549] wake_up_klogd_work_func+0x40/0x60 [ 3196.711831] > > > irq_work_run_list+0x52/0x80 [ 3196.717598] ? > > > tick_sched_do_timer+0x50/0x50 [ 3196.723707] irq_work_tick+0x3b/0x50 > > > [ 3196.729118] update_process_times+0x42/0x60 [ 3196.735141] > > > tick_sched_handle+0x26/0x70 [ 3196.740900] ? > > > tick_sched_do_timer+0x50/0x50 [ 3196.747014] > > > tick_sched_timer+0x39/0x80 [ 3196.752684] > > > __hrtimer_run_queues+0xdf/0x230 [ 3196.758789] > > > hrtimer_interrupt+0xa0/0x1d0 [ 3196.764640] > > > smp_apic_timer_interrupt+0x6f/0x130 > > > [ 3196.771101] apic_timer_interrupt+0x84/0x90 [ 3196.777130] RIP: > > > 0010:panic+0x1fe/0x244 [ 3196.782802] RSP: 0018:ffff90d3df243bd8 > > > EFLAGS: 00000286 ORIG_RAX: > > > ffffffffffffff11 > > > [ 3196.792240] RAX: 0000000000000041 RBX: 0000000000000000 RCX: > > > 0000000000000006 [ 3196.801259] RDX: 0000000000000000 RSI: > > > 0000000000000092 RDI: ffff90d3df256490 [ 3196.810293] RBP: > > > ffff90d3df243c50 R08: 0000000000000000 R09: 000000000000113b [ > > > 3196.819342] R10: 0000000000000008 R11: 00000000ffffffff R12: > > > 0000000000000000 [ 3196.828408] R13: 0000000000000000 R14: > > > 0000000000000009 R15: 0000000000000002 [ 3196.837484] ? > > > panic+0x1f7/0x244 [ 3196.842654] oops_end+0xb6/0xd0 [ 3196.847725] > > > no_context+0x17c/0x440 [ 3196.853142] > > > __bad_area_nosemaphore+0xf3/0x1b0 [ 3196.859519] > > > bad_area_nosemaphore+0x14/0x20 [ 3196.865635] > > > __do_page_fault+0xd4/0x4d0 [ 3196.871401] ? > > > update_load_avg+0x57f/0x6e0 [ 3196.877434] do_page_fault+0x2e/0xe0 [ > > > 3196.882942] page_fault+0x25/0x50 [ 3196.888121] RIP: > > > 0010:_raw_spin_lock_bh+0x1b/0x30 [ 3196.894624] RSP: > > > 0018:ffff90d3df243e90 EFLAGS: 00010246 [ 3196.901641] RAX: > > > 0000000000000000 RBX: ffff90d1317a6c18 RCX: ffff90d3df243f08 [ > > > 3196.910595] RDX: 0000000000000001 RSI: ffffffffc07d8290 RDI: > > > 0000000200001dc7 [ 3196.919541] RBP: ffff90d3df243e90 R08: > > > ffff90d3df243f10 R09: ffff90d3df25a780 [ 3196.928459] R10: > > > ffff90d3df243f10 R11: ffff90d3df25a770 R12: 0000000200001dc7 [ > > > 3196.937309] R13: ffff90d1317a6c00 R14: ffffffffc07d8290 R15: > > > ffff90d1317a6c18 [ 3196.946097] ? tipc_subscrb_release_cb+0x30/0x30 > > > [tipc] [ 3196.952912] ? tipc_subscrb_release_cb+0x30/0x30 [tipc] [ > > > 3196.959636] tipc_subscrp_timeout+0x25/0x90 [tipc] [ 3196.965850] > > > call_timer_fn+0x30/0x130 [ 3196.970861] run_timer_softirq+0x3fb/0x450 > > > [ 3196.976282] ? lapic_next_deadline+0x26/0x30 [ 3196.981856] ? > > > clockevents_program_event+0xd8/0x110 > > > [ 3196.988004] __do_softirq+0xe4/0x2bb [ 3196.992839] > > > do_softirq_own_stack+0x2a/0x40 [ 3196.998277] </IRQ> [ 3197.001619] > > > do_softirq.part.17+0x50/0x60 [ 3197.006875] > > > __local_bh_enable_ip+0x50/0x60 [ 3197.012300] > > > _raw_spin_unlock_bh+0x1e/0x20 [ 3197.017639] tipc_sk_rcv+0x15a/0x440 > > > [tipc] [ 3197.023057] tipc_node_xmit+0x147/0x190 [tipc] [ > > > 3197.028715] __tipc_sendstream+0x1a9/0x2d0 [tipc] [ 3197.034618] ? > > > aa_sk_perm+0x48/0x1a0 [ 3197.039384] tipc_sendstream+0x35/0x50 [tipc] > > > [ 3197.044919] tipc_send_packet+0x17/0x20 [tipc] [ 3197.050539] > > > sock_sendmsg+0x3e/0x50 [ 3197.055207] kernel_sendmsg+0x2b/0x30 [ > > > 3197.060044] tipc_send_to_sock+0x12f/0x290 [tipc] [ 3197.065927] ? > > > __switch_to_asm+0x40/0x70 [ 3197.071048] ? > > __switch_to_asm+0x34/0x70 > > > [ 3197.076162] ? __switch_to_asm+0x34/0x70 [ 3197.081257] ? > > > __switch_to_asm+0x40/0x70 [ 3197.086345] ? > > __switch_to_asm+0x34/0x70 > > > [ 3197.091425] tipc_send_work+0x23/0x40 [tipc] [ 3197.096854] > > > process_one_work+0x1de/0x410 [ 3197.102030] > > worker_thread+0x32/0x410 > > > [ 3197.106854] kthread+0x121/0x140 [ 3197.111239] ? > > > process_one_work+0x410/0x410 [ 3197.116576] ? > > > kthread_create_worker_on_cpu+0x70/0x70 > > > [ 3197.122788] ? do_syscall_64+0x73/0x130 [ 3197.127786] ? > > > SyS_exit_group+0x14/0x20 [ 3197.132784] ret_from_fork+0x35/0x40 [ > > > 3197.137508] Code: 2d 61 60 01 73 17 48 8b 05 84 21 16 01 be fd 00 > > > 00 00 48 8b 40 30 e8 36 b6 ba 00 5d c3 89 fe 48 c7 c7 60 fe ab 8c e8 > > > 06 48 03 00 <0f> 0b 5d c3 66 90 0f 1f 44 00 00 55 48 89 e5 53 48 83 ec > > > 20 65 > > > [ 3197.158946] ---[ end trace 4ce4054b7f724f83 ]--- [ 3197.164963] > > > unchecked MSR access error: WRMSR to 0x83f (tried to write > > > 0x00000000000000f6) at rIP: 0xffffffff8ba6c1a8 > > > (native_write_msr+0x8/0x30) > > > [ 3197.179360] Call Trace: > > > [ 3197.183165] <IRQ> > > > [ 3197.186532] native_apic_msr_write+0x2b/0x40 [ 3197.192143] > > > x2apic_send_IPI_self+0x20/0x30 [ 3197.197660] > > > arch_irq_work_raise+0x2a/0x40 [ 3197.203080] > > irq_work_queue+0x8d/0xa0 > > > [ 3197.208058] dbs_update_util_handler+0x80/0xa0 [ 3197.213822] > > > update_load_avg+0x5c5/0x6e0 [ 3197.219068] > > enqueue_entity+0x5e/0x6b0 > > > [ 3197.224124] enqueue_task_fair+0x6c/0x7f0 [ 3197.229449] ? > > > __update_load_avg_blocked_se.isra.37+0xd1/0x150 > > > [ 3197.236608] activate_task+0x54/0xc0 [ 3197.241510] ? > > > sched_clock+0x9/0x10 [ 3197.246338] ttwu_do_activate+0x49/0x80 [ > > > 3197.251514] try_to_wake_up+0x1d6/0x480 [ 3197.256684] > > > default_wake_function+0x12/0x20 [ 3197.262288] > > > autoremove_wake_function+0x12/0x40 > > > [ 3197.268162] __wake_up_common+0x73/0x130 [ 3197.273428] > > > __wake_up_common_lock+0x80/0xc0 [ 3197.279026] > > __wake_up+0x13/0x20 [ > > > 3197.283562] wake_up_klogd_work_func+0x40/0x60 [ 3197.289321] > > > irq_work_run_list+0x52/0x80 [ 3197.294545] ? > > > tick_sched_do_timer+0x50/0x50 [ 3197.300116] irq_work_tick+0x3b/0x50 > > > [ 3197.304987] update_process_times+0x42/0x60 [ 3197.310455] > > > tick_sched_handle+0x26/0x70 [ 3197.315672] ? > > > tick_sched_do_timer+0x50/0x50 [ 3197.321233] > > > tick_sched_timer+0x39/0x80 [ 3197.326351] > > > __hrtimer_run_queues+0xdf/0x230 [ 3197.331909] > > > hrtimer_interrupt+0xa0/0x1d0 [ 3197.337205] > > > smp_apic_timer_interrupt+0x6f/0x130 > > > [ 3197.343108] apic_timer_interrupt+0x84/0x90 [ 3197.348574] RIP: > > > 0010:panic+0x1fe/0x244 [ 3197.353674] RSP: 0018:ffff90d3df243bd8 > > > EFLAGS: 00000286 ORIG_RAX: > > > ffffffffffffff11 > > > [ 3197.362537] RAX: 0000000000000041 RBX: 0000000000000000 RCX: > > > 0000000000000006 [ 3197.370972] RDX: 0000000000000000 RSI: > > > 0000000000000092 RDI: ffff90d3df256490 [ 3197.379405] RBP: > > > ffff90d3df243c50 R08: 0000000000000000 R09: 000000000000113b [ > > > 3197.387843] R10: 0000000000000008 R11: 00000000ffffffff R12: > > > 0000000000000000 [ 3197.396291] R13: 0000000000000000 R14: > > > 0000000000000009 R15: 0000000000000002 [ 3197.404743] ? > > > panic+0x1f7/0x244 [ 3197.409283] oops_end+0xb6/0xd0 [ 3197.413730] > > > no_context+0x17c/0x440 [ 3197.418521] > > > __bad_area_nosemaphore+0xf3/0x1b0 [ 3197.424273] > > > bad_area_nosemaphore+0x14/0x20 [ 3197.429763] > > > __do_page_fault+0xd4/0x4d0 [ 3197.434912] ? > > > update_load_avg+0x57f/0x6e0 [ 3197.440318] do_page_fault+0x2e/0xe0 [ > > > 3197.445203] page_fault+0x25/0x50 [ 3197.449827] RIP: > > > 0010:_raw_spin_lock_bh+0x1b/0x30 [ 3197.455862] RSP: > > > 0018:ffff90d3df243e90 EFLAGS: 00010246 [ 3197.462432] RAX: > > > 0000000000000000 RBX: ffff90d1317a6c18 RCX: ffff90d3df243f08 [ > > > 3197.470940] RDX: 0000000000000001 RSI: ffffffffc07d8290 RDI: > > > 0000000200001dc7 [ 3197.479454] RBP: ffff90d3df243e90 R08: > > > ffff90d3df243f10 R09: ffff90d3df25a780 [ 3197.487973] R10: > > > ffff90d3df243f10 R11: ffff90d3df25a770 R12: 0000000200001dc7 [ > > > 3197.496495] R13: ffff90d1317a6c00 R14: ffffffffc07d8290 R15: > > > ffff90d1317a6c18 [ 3197.505043] ? tipc_subscrb_release_cb+0x30/0x30 > > > [tipc] [ 3197.511689] ? tipc_subscrb_release_cb+0x30/0x30 [tipc] [ > > > 3197.518319] tipc_subscrp_timeout+0x25/0x90 [tipc] [ 3197.524511] > > > call_timer_fn+0x30/0x130 [ 3197.529573] run_timer_softirq+0x3fb/0x450 > > > [ 3197.535073] ? lapic_next_deadline+0x26/0x30 [ 3197.540684] ? > > > clockevents_program_event+0xd8/0x110 > > > [ 3197.546832] __do_softirq+0xe4/0x2bb [ 3197.551680] > > > do_softirq_own_stack+0x2a/0x40 [ 3197.557086] </IRQ> [ 3197.560362] > > > do_softirq.part.17+0x50/0x60 [ 3197.565561] > > > __local_bh_enable_ip+0x50/0x60 [ 3197.570937] > > > _raw_spin_unlock_bh+0x1e/0x20 [ 3197.576228] tipc_sk_rcv+0x15a/0x440 > > > [tipc] [ 3197.581602] tipc_node_xmit+0x147/0x190 [tipc] [ > > > 3197.587245] __tipc_sendstream+0x1a9/0x2d0 [tipc] [ 3197.593144] ? > > > aa_sk_perm+0x48/0x1a0 [ 3197.597909] tipc_sendstream+0x35/0x50 [tipc] > > > [ 3197.603461] tipc_send_packet+0x17/0x20 [tipc] [ 3197.609101] > > > sock_sendmsg+0x3e/0x50 [ 3197.613767] kernel_sendmsg+0x2b/0x30 [ > > > 3197.618609] tipc_send_to_sock+0x12f/0x290 [tipc] [ 3197.624497] ? > > > __switch_to_asm+0x40/0x70 [ 3197.629601] ? > > __switch_to_asm+0x34/0x70 > > > [ 3197.634697] ? __switch_to_asm+0x34/0x70 [ 3197.639781] ? > > > __switch_to_asm+0x40/0x70 [ 3197.644846] ? > > __switch_to_asm+0x34/0x70 > > > [ 3197.649909] tipc_send_work+0x23/0x40 [tipc] [ 3197.655317] > > > process_one_work+0x1de/0x410 [ 3197.660470] > > worker_thread+0x32/0x410 > > > [ 3197.665274] kthread+0x121/0x140 [ 3197.669636] ? > > > process_one_work+0x410/0x410 [ 3197.674955] ? > > > kthread_create_worker_on_cpu+0x70/0x70 > > > [ 3197.681146] ? do_syscall_64+0x73/0x130 [ 3197.686118] ? > > > SyS_exit_group+0x14/0x20 [ 3197.691094] ret_from_fork+0x35/0x40 pe > > > 26. lokak. 2018 klo 6.38 Ying Xue ([email protected]) kirjoitti: > > > > > > > > Hi Juhamatti, > > > > > > > > This is a good test plan indeed. If you encounter any problem during > > > > the testing, please share it here. > > > > > > > > From my view, TIPC had fully supported networking nemespace at least > > > > three years ago and its sockets could be fully isolated between > > containers. > > > > > > > > Thanks, > > > > Ying > > > > > > > > On 10/26/2018 11:12 AM, [email protected] wrote: > > > > > Hello, > > > > > > > > > > I'm planning to test TIPC scalability in more detail, especially > > > > > regarding the binding table and topology (service tracking) > > > > > service with large number of nodes (~1k) and sockets (100-1k). Is > > > > > it enough to do it with LXC containers with isolated namespaces to > > > > > get realistic results? My concern here is that if TIPC > > > > > implementation would not provide full isolation to sockets e.g. > > > > > for performance reasons inside the same kernel, then this may not > > > > > provide results matching to real environments. VMs are the other > > > > > choice, but they tend to be heavy. Are the namespaces fully isolated > > inside the kernel? > > > > > > > > > > Thanks for the info, > > > > > -- > > > > > Juhamatti > > > > > > > > > > > > > > > _______________________________________________ > > > > > tipc-discussion mailing list > > > > > [email protected] > > > > > https://lists.sourceforge.net/lists/listinfo/tipc-discussion > > > > > > > > > > > _______________________________________________ > > tipc-discussion mailing list > > [email protected] > > https://lists.sourceforge.net/lists/listinfo/tipc-discussion _______________________________________________ tipc-discussion mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/tipc-discussion
