Re: 2.6.23-rc2-mm2
Dave Young wrote: On 8/10/07, Dave Young [EMAIL PROTECTED] wrote: On 8/10/07, Andrew Morton [EMAIL PROTECTED] wrote: On Fri, 10 Aug 2007 16:18:38 +0800 Dave Young [EMAIL PROTECTED] wrote: Hi, 2.6.23-rc2-mm2: config with CONFIG_NF_CONNTRACK_AMANDA=y boot oops: hand copyed : the EIP: helper_hash backtrace: do_basic_setup ... nf_conntrack_amanda_init error_code do_vivide_error ... cc netdev please send config presumably nf_ct_helper_hsize is zero I'd have thought that the same bug would be present in mainline - there's nothing in git-net.patch. (hunts around, discovers that there's a net-2.6.24 tree now. Bah) MY config file for test: -cut--- Hi, config the CONFIG_NF_CONNTRACK_AMANDA=m, reboot , still oops, but this time the point is nf_conntrack_ftp_init, (I doubt the nf_conntrack_core is not initialized.) boot with vga_ask and select 80x60, copyed more infomations: -pre oops:--- sysctl table check failed: /net/netfilter/nf_conntrack_generic_timeout .3.19.13 Missing strategy ... nf_conntrack_14_proto_sctp4 protocol register failed ... -oops: divide error: [#1] SMP EIP: 0060: [c088477b] EFLAGS 00010246 Process swapper ... nf_conntrack_ftp_init ... Code: ... EIP ... helper_hash +0x1b/0x24 Does not Oops here but is broken. Here is what I get : modprobe nf_conntrack_amanda WARNING: Error inserting nf_conntrack (/lib/modules/2.6.23-rc2-mm2/kernel/net/netfilter/nf_conntrack.ko): Cannot allocate memory FATAL: Error inserting nf_conntrack_amanda (/lib/modules/2.6.23-rc2-mm2/kernel/net/netfilter/nf_conntrack_amanda.ko): Unknown symbol in module, or unknown parameter (see dmesg) ( from dmesg ) ... [ 1094.768867] nf_conntrack version 0.5.0 (10240 buckets, 40960 max) [ 1094.769078] sysctl table check failed: /net/netfilter/nf_conntrack_generic_timeout .3.19.13 Missing strategy [ 1094.780956] nf_conntrack_amanda: Unknown symbol nf_ct_expect_alloc [ 1094.781055] nf_conntrack_amanda: Unknown symbol nf_ct_expect_put [ 1094.781242] nf_conntrack_amanda: Unknown symbol nf_ct_expect_init [ 1094.781371] nf_conntrack_amanda: Unknown symbol nf_conntrack_helper_unregister [ 1094.781468] nf_conntrack_amanda: Unknown symbol __nf_ct_refresh_acct [ 1094.781580] nf_conntrack_amanda: Unknown symbol nf_ct_expect_related [ 1094.781673] nf_conntrack_amanda: Unknown symbol nf_conntrack_helper_register ... Regards dave - Gabriel - To unsubscribe from this list: send the line unsubscribe netdev in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html
Re: 2.6.23-rc2-mm2
Andrew Morton pisze: ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.23-rc2/2.6.23-rc2-mm2/ Yet another sysctl table check failed [ 88.949055] Netfilter messages via NETLINK v0.30. [ 89.485399] nf_conntrack version 0.5.0 (16384 buckets, 65536 max) [ 89.491892] sysctl table check failed: /net/netfilter/nf_conntrack_generic_timeout .3.19.13 Missing strategy [ 89.558178] xt_state: Unknown symbol nf_conntrack_untracked [ 89.563942] xt_state: Unknown symbol nf_ct_l3proto_module_put [ 89.569870] xt_state: Unknown symbol nf_ct_l3proto_try_module_get [ 90.852319] NET: Registered protocol family 17 Regards, Michal -- LOG http://www.stardust.webpages.pl/log/ - To unsubscribe from this list: send the line unsubscribe netdev in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html
Re: 2.6.23-rc2-mm2
Michal Piotrowski wrote: Andrew Morton pisze: ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.23-rc2/2.6.23-rc2-mm2/ Yet another sysctl table check failed [ 88.949055] Netfilter messages via NETLINK v0.30. [ 89.485399] nf_conntrack version 0.5.0 (16384 buckets, 65536 max) [ 89.491892] sysctl table check failed: /net/netfilter/nf_conntrack_generic_timeout .3.19.13 Missing strategy [ 89.558178] xt_state: Unknown symbol nf_conntrack_untracked [ 89.563942] xt_state: Unknown symbol nf_ct_l3proto_module_put [ 89.569870] xt_state: Unknown symbol nf_ct_l3proto_try_module_get [ 90.852319] NET: Registered protocol family 17 And maybe this ? http://lkml.org/lkml/2007/8/10/112 Regards, Michal Gabriel - To unsubscribe from this list: send the line unsubscribe netdev in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html
Re: 2.6.23-rc2-mm2
Gabriel C [EMAIL PROTECTED] writes: Michal Piotrowski wrote: Andrew Morton pisze: ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.23-rc2/2.6.23-rc2-mm2/ Yet another sysctl table check failed [ 88.949055] Netfilter messages via NETLINK v0.30. [ 89.485399] nf_conntrack version 0.5.0 (16384 buckets, 65536 max) [ 89.491892] sysctl table check failed: /net/netfilter/nf_conntrack_generic_timeout .3.19.13 Missing strategy [ 89.558178] xt_state: Unknown symbol nf_conntrack_untracked [ 89.563942] xt_state: Unknown symbol nf_ct_l3proto_module_put [ 89.569870] xt_state: Unknown symbol nf_ct_l3proto_try_module_get [ 90.852319] NET: Registered protocol family 17 And maybe this ? Could be. sysctl table check failure looks legitimate. At a quick skim I can't tell for certain if failure to register the sysctl table will keep the module from loading but it might. It looks like another canidate on the pile of kill the broken binary sysctl. Eric - To unsubscribe from this list: send the line unsubscribe netdev in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html
Re: 2.6.23-rc2-mm2
On 10/08/07, Gabriel C [EMAIL PROTECTED] wrote: Michal Piotrowski wrote: Andrew Morton pisze: ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.23-rc2/2.6.23-rc2-mm2/ Yet another sysctl table check failed [ 88.949055] Netfilter messages via NETLINK v0.30. [ 89.485399] nf_conntrack version 0.5.0 (16384 buckets, 65536 max) [ 89.491892] sysctl table check failed: /net/netfilter/nf_conntrack_generic_timeout .3.19.13 Missing strategy [ 89.558178] xt_state: Unknown symbol nf_conntrack_untracked [ 89.563942] xt_state: Unknown symbol nf_ct_l3proto_module_put [ 89.569870] xt_state: Unknown symbol nf_ct_l3proto_try_module_get [ 90.852319] NET: Registered protocol family 17 And maybe this ? http://lkml.org/lkml/2007/8/10/112 Exactly the same issue here while modprobe nf_conntrack_amanda. Gabriel Regards, Michal -- LOG http://www.stardust.webpages.pl/log/ - To unsubscribe from this list: send the line unsubscribe netdev in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html
Re: 2.6.23-rc2-mm2
On Fri, 10 Aug 2007 11:33:41 -0600 [EMAIL PROTECTED] (Eric W. Biederman) wrote: Gabriel C [EMAIL PROTECTED] writes: Michal Piotrowski wrote: Andrew Morton pisze: ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.23-rc2/2.6.23-rc2-mm2/ Yet another sysctl table check failed [ 88.949055] Netfilter messages via NETLINK v0.30. [ 89.485399] nf_conntrack version 0.5.0 (16384 buckets, 65536 max) [ 89.491892] sysctl table check failed: /net/netfilter/nf_conntrack_generic_timeout .3.19.13 Missing strategy [ 89.558178] xt_state: Unknown symbol nf_conntrack_untracked [ 89.563942] xt_state: Unknown symbol nf_ct_l3proto_module_put [ 89.569870] xt_state: Unknown symbol nf_ct_l3proto_try_module_get [ 90.852319] NET: Registered protocol family 17 And maybe this ? Could be. sysctl table check failure looks legitimate. At a quick skim I can't tell for certain if failure to register the sysctl table will keep the module from loading but it might. It looks like another canidate on the pile of kill the broken binary sysctl. There seems to be rather a lot of damage here. I assume that the sysctl changes are what caused the netfilter oopses. - nf_conntrack_init() calls nf_conntrack_expect_init() which fails due to sysctl problems. - nf_conntrack_init() bales out without calling nf_conntrack_helper_init() So nf_ct_helper_hsize never gets initialised. - Later, netfilter client code calls helper_hash(), which gets a divide-by-zero due to nf_ct_helper_hsize==0. yeah, that's a netfilter bug, but we're trying to get kernels tested here. If I'm feeling energetic I'll drop the sysctl changes and do rc2-mm3. Probably I won't feel energetic, but we'll need a lot of fixes here before I can release the sysctl changes in another -mm, please. - To unsubscribe from this list: send the line unsubscribe netdev in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html
Re: 2.6.23-rc2-mm2
On 8/10/07, Dave Young [EMAIL PROTECTED] wrote: On 8/10/07, Andrew Morton [EMAIL PROTECTED] wrote: On Fri, 10 Aug 2007 16:18:38 +0800 Dave Young [EMAIL PROTECTED] wrote: Hi, 2.6.23-rc2-mm2: config with CONFIG_NF_CONNTRACK_AMANDA=y boot oops: hand copyed : the EIP: helper_hash backtrace: do_basic_setup ... nf_conntrack_amanda_init error_code do_vivide_error ... cc netdev please send config presumably nf_ct_helper_hsize is zero I'd have thought that the same bug would be present in mainline - there's nothing in git-net.patch. (hunts around, discovers that there's a net-2.6.24 tree now. Bah) MY config file for test: -cut--- Hi, config the CONFIG_NF_CONNTRACK_AMANDA=m, reboot , still oops, but this time the point is nf_conntrack_ftp_init, (I doubt the nf_conntrack_core is not initialized.) boot with vga_ask and select 80x60, copyed more infomations: -pre oops:--- sysctl table check failed: /net/netfilter/nf_conntrack_generic_timeout .3.19.13 Missing strategy ... nf_conntrack_14_proto_sctp4 protocol register failed ... -oops: divide error: [#1] SMP EIP: 0060: [c088477b] EFLAGS 00010246 Process swapper ... nf_conntrack_ftp_init ... Code: ... EIP ... helper_hash +0x1b/0x24 Regards dave - To unsubscribe from this list: send the line unsubscribe netdev in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html
Re: 2.6.23-rc2-mm2: possible irq lock inversion dependency detected
Hello, Have fun :) = [ INFO: possible irq lock inversion dependency detected ] 2.6.23-rc2-mm2 #1 - depscan.sh/5928 just changed the state of lock: (_xmit_ETHER){-+..}, at: [c03cb669] dev_watchdog+0x17/0xcc but this lock took another, soft-irq-unsafe lock in the past: (tp-lock){--..} and interrupts could create inverse lock ordering between them. other info that might help us debug this: no locks held by depscan.sh/5928. the first lock's dependencies: - (_xmit_ETHER){-+..} ops: 27 { initial-use at: [c0138ea9] __lock_acquire+0x217/0x11ac [c0139ed7] lock_acquire+0x99/0xb2 [c045282a] _spin_lock_bh+0x3a/0x47 [c03bc0a6] dev_set_rx_mode+0x14/0x3b [c03bc5af] dev_change_flags+0x68/0x190 [c03fcb5c] devinet_ioctl+0x4af/0x652 [c03fd442] inet_ioctl+0x56/0x71 [c03b152a] sock_ioctl+0xa5/0x1d4 [c0178a52] do_ioctl+0x22/0x71 [c0178af6] vfs_ioctl+0x55/0x29e [c0178d72] sys_ioctl+0x33/0x69 [c01041da] sysenter_past_esp+0x5f/0x99 [] 0x in-softirq-W at: [c0139384] __lock_acquire+0x6f2/0x11ac [c0139ed7] lock_acquire+0x99/0xb2 [c04527e3] _spin_lock+0x35/0x42 [c03cb669] dev_watchdog+0x17/0xcc [c01224b7] run_timer_softirq+0x14b/0x1a9 [c011ecc2] __do_softirq+0x5b/0xb2 [c011ed66] do_softirq+0x4d/0x4f [c011f04b] irq_exit+0x48/0x4a [c01058f8] do_IRQ+0x98/0xd0 [c010444e] common_interrupt+0x2e/0x34 [] 0x hardirq-on-W at: [c01393d0] __lock_acquire+0x73e/0x11ac [c0139ed7] lock_acquire+0x99/0xb2 [c045282a] _spin_lock_bh+0x3a/0x47 [c03bc0a6] dev_set_rx_mode+0x14/0x3b [c03bc5af] dev_change_flags+0x68/0x190 [c03fcb5c] devinet_ioctl+0x4af/0x652 [c03fd442] inet_ioctl+0x56/0x71 [c03b152a] sock_ioctl+0xa5/0x1d4 [c0178a52] do_ioctl+0x22/0x71 [c0178af6] vfs_ioctl+0x55/0x29e [c0178d72] sys_ioctl+0x33/0x69 [c01041da] sysenter_past_esp+0x5f/0x99 [] 0x } ... key at: [c087e9e8] netdev_xmit_lock_key+0x8/0x1c0 - (tp-lock){--..} ops: 44 { initial-use at: [c0138ea9] __lock_acquire+0x217/0x11ac [c0139ed7] lock_acquire+0x99/0xb2 [c04527e3] _spin_lock+0x35/0x42 [de84d6e2] rtl8139_interrupt+0x27/0x469 [8139too] [c0148b92] request_irq+0xba/0x108 [de84e52d] rtl8139_open+0x2f/0x1e2 [8139too] [c03be80d] dev_open+0x37/0x76 [c03bc5d5] dev_change_flags+0x8e/0x190 [c03fcb5c] devinet_ioctl+0x4af/0x652 [c03fd442] inet_ioctl+0x56/0x71 [c03b152a] sock_ioctl+0xa5/0x1d4 [c0178a52] do_ioctl+0x22/0x71 [c0178af6] vfs_ioctl+0x55/0x29e [c0178d72] sys_ioctl+0x33/0x69 [c01041da] sysenter_past_esp+0x5f/0x99 [] 0x softirq-on-W at: [c01393f9] __lock_acquire+0x767/0x11ac [c0139ed7] lock_acquire+0x99/0xb2 [c04527e3] _spin_lock+0x35/0x42 [de84d6e2] rtl8139_interrupt+0x27/0x469 [8139too] [c01487ed] free_irq+0x11b/0x146 [de84ec90] rtl8139_close+0x8a/0x14a [8139too] [c03bd5d3] dev_close+0x57/0x74 [c03bc5d5] dev_change_flags+0x8e/0x190 [c03fcb5c] devinet_ioctl+0x4af/0x652 [c03fd442] inet_ioctl+0x56/0x71 [c03b152a] sock_ioctl+0xa5/0x1d4 [c0178a52] do_ioctl+0x22/0x71 [c0178af6] vfs_ioctl+0x55/0x29e [c0178d72] sys_ioctl+0x33/0x69 [c01041da] sysenter_past_esp+0x5f/0x99 [] 0x hardirq-on-W at: [c01393d0] __lock_acquire+0x73e/0x11ac
Re: 2.6.23-rc2-mm2
On Fri, 10 Aug 2007 16:18:38 +0800 Dave Young [EMAIL PROTECTED] wrote: Hi, 2.6.23-rc2-mm2: config with CONFIG_NF_CONNTRACK_AMANDA=y boot oops: hand copyed : the EIP: helper_hash backtrace: do_basic_setup ... nf_conntrack_amanda_init error_code do_vivide_error ... cc netdev please send config presumably nf_ct_helper_hsize is zero I'd have thought that the same bug would be present in mainline - there's nothing in git-net.patch. (hunts around, discovers that there's a net-2.6.24 tree now. Bah) - To unsubscribe from this list: send the line unsubscribe netdev in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html
Re: 2.6.23-rc2-mm2 -- conntrack divide error
Gabriel C napsal(a): Dave Young wrote: On 8/10/07, Dave Young [EMAIL PROTECTED] wrote: On 8/10/07, Andrew Morton [EMAIL PROTECTED] wrote: On Fri, 10 Aug 2007 16:18:38 +0800 Dave Young [EMAIL PROTECTED] wrote: Hi, 2.6.23-rc2-mm2: config with CONFIG_NF_CONNTRACK_AMANDA=y boot oops: hand copyed : the EIP: helper_hash backtrace: do_basic_setup ... nf_conntrack_amanda_init error_code do_vivide_error ... cc netdev please send config presumably nf_ct_helper_hsize is zero I think so. 2.6.23-rc2-mm1 seemed be OK. I'd have thought that the same bug would be present in mainline - there's nothing in git-net.patch. (hunts around, discovers that there's a net-2.6.24 tree now. Bah) MY config file for test: -cut--- Hi, config the CONFIG_NF_CONNTRACK_AMANDA=m, reboot , still oops, but this time the point is nf_conntrack_ftp_init, (I doubt the nf_conntrack_core is not initialized.) boot with vga_ask and select 80x60, copyed more infomations: -pre oops:--- sysctl table check failed: /net/netfilter/nf_conntrack_generic_timeout .3.19.13 Missing strategy ... nf_conntrack_14_proto_sctp4 protocol register failed ... -oops: divide error: [#1] SMP EIP: 0060: [c088477b] EFLAGS 00010246 Process swapper ... nf_conntrack_ftp_init ... Code: ... EIP ... helper_hash +0x1b/0x24 Does not Oops here but is broken. My $0.02: http://www.fi.muni.cz/~xslaby/sklad/panics/nf_oops.png http://www.fi.muni.cz/~xslaby/sklad/panics/nf_oops.config -- Jiri Slaby ([EMAIL PROTECTED]) Faculty of Informatics, Masaryk University - To unsubscribe from this list: send the line unsubscribe netdev in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html
Re: 2.6.23-rc2-mm2
Andrew Morton [EMAIL PROTECTED] writes: There seems to be rather a lot of damage here. I assume that the sysctl changes are what caused the netfilter oopses. - nf_conntrack_init() calls nf_conntrack_expect_init() which fails due to sysctl problems. In particular sysctl_check_table finds issues with the sysctl table so register_sysctl_table refuses to register it. - nf_conntrack_init() bales out without calling nf_conntrack_helper_init() So nf_ct_helper_hsize never gets initialised. - Later, netfilter client code calls helper_hash(), which gets a divide-by-zero due to nf_ct_helper_hsize==0. yeah, that's a netfilter bug, but we're trying to get kernels tested here. If I'm feeling energetic I'll drop the sysctl changes and do rc2-mm3. Probably I won't feel energetic, but we'll need a lot of fixes here before I can release the sysctl changes in another -mm, please. As a cheap workaround it should be possible to disable SYSCTL support in 2.6.23-rc2-mm2 to get around these issues. Andrew for the moment I have just sent you fixes for all of the issues that I am aware of. Mostly they are cheap kill the sys_sysctl() support patches. Hopefully that is enough to bring the pain level down to manageable. I hadn't anticipated subsystems failing because they could not register their sysctl tables. I was simply expecting things not to show up in /proc/sys. And more of the pain of making working sysctl tables to be pushed back to developers. Eric - To unsubscribe from this list: send the line unsubscribe netdev in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html
Re: 2.6.23-rc2-mm2
Andrew Morton [EMAIL PROTECTED] writes: On Fri, 10 Aug 2007 11:33:41 -0600 [EMAIL PROTECTED] (Eric W. Biederman) wrote: Gabriel C [EMAIL PROTECTED] writes: Michal Piotrowski wrote: Andrew Morton pisze: ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.23-rc2/2.6.23-rc2-mm2/ Yet another sysctl table check failed [ 88.949055] Netfilter messages via NETLINK v0.30. [ 89.485399] nf_conntrack version 0.5.0 (16384 buckets, 65536 max) [ 89.491892] sysctl table check failed: /net/netfilter/nf_conntrack_generic_timeout .3.19.13 Missing strategy [ 89.558178] xt_state: Unknown symbol nf_conntrack_untracked [ 89.563942] xt_state: Unknown symbol nf_ct_l3proto_module_put [ 89.569870] xt_state: Unknown symbol nf_ct_l3proto_try_module_get [ 90.852319] NET: Registered protocol family 17 And maybe this ? Could be. sysctl table check failure looks legitimate. At a quick skim I can't tell for certain if failure to register the sysctl table will keep the module from loading but it might. It looks like another canidate on the pile of kill the broken binary sysctl. There seems to be rather a lot of damage here. I assume that the sysctl changes are what caused the netfilter oopses. - nf_conntrack_init() calls nf_conntrack_expect_init() which fails due to sysctl problems. - nf_conntrack_init() bales out without calling nf_conntrack_helper_init() So nf_ct_helper_hsize never gets initialised. - Later, netfilter client code calls helper_hash(), which gets a divide-by-zero due to nf_ct_helper_hsize==0. yeah, that's a netfilter bug, but we're trying to get kernels tested here. If I'm feeling energetic I'll drop the sysctl changes and do rc2-mm3. Probably I won't feel energetic, but we'll need a lot of fixes here before I can release the sysctl changes in another -mm, please. Under construction. Eric - To unsubscribe from this list: send the line unsubscribe netdev in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html