Re: ext3 SMP bug ? PANIC in __d_find_alias

2007-12-12 Thread Mitch

It is on:

$ uname -a
Linux home 2.6.23 #5 SMP PREEMPT Sun Oct 21 23:08:50 GST 2007 i686 
unknown unknown GNU/Linux


And yes it happened on previous kernels also at least since .21
I've had 6 panics so far randomly, but generally when doing a "updatedb" 
(from find(1)) which seems to trigger it ever so often if there is other 
activity also going on.


M

 Original Message ----
Subject: Re: ext3 SMP bug  ?  PANIC in __d_find_alias
Date: Wed, 12 Dec 2007 20:36:40 +0100
From: Rafael J. Wysocki <[EMAIL PROTECTED]>
To: Mitch <[EMAIL PROTECTED]>
CC: [EMAIL PROTECTED], linux-ext4@vger.kernel.org
References: <[EMAIL PROTECTED]>

[Added CC to [EMAIL PROTECTED]

On Wednesday, 12 of December 2007, Mitch wrote:
Can anyone help with this ? This seems to be a true SMP bug - the same 
kernel on another UP machine is working fine (although different h/w). 
Seems like stress (find for example) can easily trigger this. Does it 
look like i have a bad filesystem ? Can anyone help me figure out which 
one ? The fact that this is tainted (due to nvidia) is a red herring i 
think because both my machines (the SMP and UP one) are using the same 
nvidia module and the panic is in ext3 code.


Which kernel is this?

Did it happen with any previous kernel?


Dec 10 03:02:43 home kernel: BUG: unable to handle kernel NULL pointer 
dereference at virtual address 

Dec 10 03:02:43 home kernel:  printing eip:
Dec 10 03:02:43 home kernel: c01761fc
Dec 10 03:02:43 home kernel: *pdpt = 198a6001
Dec 10 03:02:43 home kernel: *pde = 
Dec 10 03:02:43 home kernel: Oops:  [#1]
Dec 10 03:02:43 home kernel: PREEMPT SMP
Dec 10 03:02:43 home kernel: Modules linked in: loop nls_iso8859_1 
nls_cp437 vfat fat tun iptable_nat nvidia(P) appletalk psnap llc nfsd expo
rtfs lockd sunrpc xt_limit xt_tcpudp iptable_mangle ipt_LOG 
ipt_MASQUERADE nf_nat ipt_TOS ipt_REJECT nf_conntrack_irc 
nf_conntrack_ftp nf_con
ntrack_ipv4 xt_state nf_conntrack iptable_filter ip_tables x_tables 
ftdi_sio usbserial forcedeth snd_hda_intel snd_seq_oss snd_seq_midi_event
  snd_seq snd_seq_device snd_pcm_oss snd_pcm snd_timer snd_page_alloc 
snd_mixer_oss snd usb_storage ehci_hcd ohci_hcd it87 hwmon_vid i2c_dev i

2c_core
Dec 10 03:02:43 home kernel: CPU:1
Dec 10 03:02:43 home kernel: EIP:0060:[__d_find_alias+44/192] 
Tainted: PVLI

Dec 10 03:02:43 home kernel: EFLAGS: 00010282   (2.6.23 #5)
Dec 10 03:02:43 home kernel: EIP is at __d_find_alias+0x2c/0xc0
Dec 10 03:02:43 home kernel: eax:    ebx: c03579bc   ecx: 
   edx: 4000
Dec 10 03:02:44 home kernel: esi: f55d58bc   edi:    ebp: 
0001   esp: d479dda4
Dec 10 03:02:44 home kernel: ds: 007b   es: 007b   fs: 00d8  gs: 0033 
ss: 0068
Dec 10 03:02:44 home kernel: Process find (pid: 8233, ti=d479c000 
task=f6d35ab0 task.ti=d479c000)
Dec 10 03:02:44 home kernel: Stack: f55d58a4 ebf42f00 f6735800 ebf42f00 
c017832f f55d58a4 ebf42f00 f6735800
Dec 10 03:02:44 home kernel:c01ad386 c0177755 ebf42f60 d479de38 
ebf42f00 e85bf2fc c0357e80 ebf42f00
Dec 10 03:02:44 home kernel:d479df04 c016d242 d479de44 f7c04740 
f1352a98 f1352b0c d479de38 00034c98

Dec 10 03:02:44 home kernel: Call Trace:
Dec 10 03:02:44 home kernel:  [d_splice_alias+95/208] 
d_splice_alias+0x5f/0xd0

Dec 10 03:02:44 home kernel:  [ext3_lookup+230/288] ext3_lookup+0xe6/0x120
Dec 10 03:02:44 home kernel:  [d_alloc+309/416] d_alloc+0x135/0x1a0
Dec 10 03:02:44 home kernel:  [do_lookup+290/416] do_lookup+0x122/0x1a0
Dec 10 03:02:44 home kernel:  [__link_path_walk+1873/3408] 
__link_path_walk+0x751/0xd50
Dec 10 03:02:44 home kernel:  [link_path_walk+101/192] 
link_path_walk+0x65/0xc0
Dec 10 03:02:44 home kernel:  [link_path_walk+69/192] 
link_path_walk+0x45/0xc0
Dec 10 03:02:44 home kernel:  [nameidata_to_filp+53/64] 
nameidata_to_filp+0x35/0x40

Dec 10 03:02:44 home kernel:  [do_filp_open+75/96] do_filp_open+0x4b/0x60
Dec 10 03:02:44 home kernel:  [do_path_lookup+120/448] 
do_path_lookup+0x78/0x1c0

Dec 10 03:02:44 home kernel:  [getname+160/192] getname+0xa0/0xc0
Dec 10 03:02:44 home kernel:  [__user_walk_fd+59/96] 
__user_walk_fd+0x3b/0x60

Dec 10 03:02:44 home kernel:  [vfs_lstat_fd+31/80] vfs_lstat_fd+0x1f/0x50
Dec 10 03:02:44 home kernel:  [nameidata_to_filp+53/64] 
nameidata_to_filp+0x35/0x40

Dec 10 03:02:44 home kernel:  [do_filp_open+75/96] do_filp_open+0x4b/0x60
Dec 10 03:02:44 home kernel:  [sys_lstat64+15/48] sys_lstat64+0xf/0x30
Dec 10 03:02:44 home kernel:  [__fput+257/352] __fput+0x101/0x160
Dec 10 03:02:44 home kernel:  [mntput_no_expire+19/96] 
mntput_no_expire+0x13/0x60

Dec 10 03:02:44 home kernel:  [filp_close+71/128] filp_close+0x47/0x80
Dec 10 03:02:44 home kernel:  [sys_close+102/208] sys_close+0x66/0xd0
Dec 10 03:02:44 home kernel:  [sysenter_past_esp+95/133] 
sysenter_past_esp+0x5f/0x85

Dec 10 03:02:44 home kernel:  ===
Dec 10 03:02:44 home kernel: Code: 89 c1 89 d5 57 56 8d 70 18 53 8b 

Re: ext3 SMP bug ? PANIC in __d_find_alias

2007-12-12 Thread Rafael J. Wysocki
[Added CC to [EMAIL PROTECTED]

On Wednesday, 12 of December 2007, Mitch wrote:
> Can anyone help with this ? This seems to be a true SMP bug - the same 
> kernel on another UP machine is working fine (although different h/w). 
> Seems like stress (find for example) can easily trigger this. Does it 
> look like i have a bad filesystem ? Can anyone help me figure out which 
> one ? The fact that this is tainted (due to nvidia) is a red herring i 
> think because both my machines (the SMP and UP one) are using the same 
> nvidia module and the panic is in ext3 code.

Which kernel is this?

Did it happen with any previous kernel?


> Dec 10 03:02:43 home kernel: BUG: unable to handle kernel NULL pointer 
> dereference at virtual address 
> Dec 10 03:02:43 home kernel:  printing eip:
> Dec 10 03:02:43 home kernel: c01761fc
> Dec 10 03:02:43 home kernel: *pdpt = 198a6001
> Dec 10 03:02:43 home kernel: *pde = 
> Dec 10 03:02:43 home kernel: Oops:  [#1]
> Dec 10 03:02:43 home kernel: PREEMPT SMP
> Dec 10 03:02:43 home kernel: Modules linked in: loop nls_iso8859_1 
> nls_cp437 vfat fat tun iptable_nat nvidia(P) appletalk psnap llc nfsd expo
> rtfs lockd sunrpc xt_limit xt_tcpudp iptable_mangle ipt_LOG 
> ipt_MASQUERADE nf_nat ipt_TOS ipt_REJECT nf_conntrack_irc 
> nf_conntrack_ftp nf_con
> ntrack_ipv4 xt_state nf_conntrack iptable_filter ip_tables x_tables 
> ftdi_sio usbserial forcedeth snd_hda_intel snd_seq_oss snd_seq_midi_event
>   snd_seq snd_seq_device snd_pcm_oss snd_pcm snd_timer snd_page_alloc 
> snd_mixer_oss snd usb_storage ehci_hcd ohci_hcd it87 hwmon_vid i2c_dev i
> 2c_core
> Dec 10 03:02:43 home kernel: CPU:1
> Dec 10 03:02:43 home kernel: EIP:0060:[__d_find_alias+44/192] 
> Tainted: PVLI
> Dec 10 03:02:43 home kernel: EFLAGS: 00010282   (2.6.23 #5)
> Dec 10 03:02:43 home kernel: EIP is at __d_find_alias+0x2c/0xc0
> Dec 10 03:02:43 home kernel: eax:    ebx: c03579bc   ecx: 
>    edx: 4000
> Dec 10 03:02:44 home kernel: esi: f55d58bc   edi:    ebp: 
> 0001   esp: d479dda4
> Dec 10 03:02:44 home kernel: ds: 007b   es: 007b   fs: 00d8  gs: 0033 
> ss: 0068
> Dec 10 03:02:44 home kernel: Process find (pid: 8233, ti=d479c000 
> task=f6d35ab0 task.ti=d479c000)
> Dec 10 03:02:44 home kernel: Stack: f55d58a4 ebf42f00 f6735800 ebf42f00 
> c017832f f55d58a4 ebf42f00 f6735800
> Dec 10 03:02:44 home kernel:c01ad386 c0177755 ebf42f60 d479de38 
> ebf42f00 e85bf2fc c0357e80 ebf42f00
> Dec 10 03:02:44 home kernel:d479df04 c016d242 d479de44 f7c04740 
> f1352a98 f1352b0c d479de38 00034c98
> Dec 10 03:02:44 home kernel: Call Trace:
> Dec 10 03:02:44 home kernel:  [d_splice_alias+95/208] 
> d_splice_alias+0x5f/0xd0
> Dec 10 03:02:44 home kernel:  [ext3_lookup+230/288] ext3_lookup+0xe6/0x120
> Dec 10 03:02:44 home kernel:  [d_alloc+309/416] d_alloc+0x135/0x1a0
> Dec 10 03:02:44 home kernel:  [do_lookup+290/416] do_lookup+0x122/0x1a0
> Dec 10 03:02:44 home kernel:  [__link_path_walk+1873/3408] 
> __link_path_walk+0x751/0xd50
> Dec 10 03:02:44 home kernel:  [link_path_walk+101/192] 
> link_path_walk+0x65/0xc0
> Dec 10 03:02:44 home kernel:  [link_path_walk+69/192] 
> link_path_walk+0x45/0xc0
> Dec 10 03:02:44 home kernel:  [nameidata_to_filp+53/64] 
> nameidata_to_filp+0x35/0x40
> Dec 10 03:02:44 home kernel:  [do_filp_open+75/96] do_filp_open+0x4b/0x60
> Dec 10 03:02:44 home kernel:  [do_path_lookup+120/448] 
> do_path_lookup+0x78/0x1c0
> Dec 10 03:02:44 home kernel:  [getname+160/192] getname+0xa0/0xc0
> Dec 10 03:02:44 home kernel:  [__user_walk_fd+59/96] 
> __user_walk_fd+0x3b/0x60
> Dec 10 03:02:44 home kernel:  [vfs_lstat_fd+31/80] vfs_lstat_fd+0x1f/0x50
> Dec 10 03:02:44 home kernel:  [nameidata_to_filp+53/64] 
> nameidata_to_filp+0x35/0x40
> Dec 10 03:02:44 home kernel:  [do_filp_open+75/96] do_filp_open+0x4b/0x60
> Dec 10 03:02:44 home kernel:  [sys_lstat64+15/48] sys_lstat64+0xf/0x30
> Dec 10 03:02:44 home kernel:  [__fput+257/352] __fput+0x101/0x160
> Dec 10 03:02:44 home kernel:  [mntput_no_expire+19/96] 
> mntput_no_expire+0x13/0x60
> Dec 10 03:02:44 home kernel:  [filp_close+71/128] filp_close+0x47/0x80
> Dec 10 03:02:44 home kernel:  [sys_close+102/208] sys_close+0x66/0xd0
> Dec 10 03:02:44 home kernel:  [sysenter_past_esp+95/133] 
> sysenter_past_esp+0x5f/0x85
> Dec 10 03:02:44 home kernel:  ===
> Dec 10 03:02:44 home kernel: Code: 89 c1 89 d5 57 56 8d 70 18 53 8b 40 
> 18 31 db 39 c6 74 6c 0f b7 51 6a 31 ff 81 e2 00 f0 00 00 eb 0a 85 ed 7
> 4 6a 39 ce 74 2e 89 c8 <8b> 08 0f 18 01 90 81 fa 00 40 00 00 8d 58 bc 74 
> 06 f6 43 04 10
> Dec 10 03:02:44 home kernel: EIP: [__d_find_alias+44/192] 
> __d_find_alias+0x2c/0xc0 SS:ESP 0068:d479dda4
> Dec 10 03:02:44 home kernel: note: find[8233] exited with preempt_count 1
> Dec 10 03:02:44 home kernel: BUG: scheduling while atomic: 
> find/0x1002/8233
> Dec 10 03:02:44 home kernel:  [schedule+1474/1728] 
> __sched_text_start+0x5c2/0x6c0
> Dec 10