Here,
i suspect memory corruption as the identical kernel runs on
identical hardware with identical installation on 60+ Machines.

Before the crash i see this in the syslog ..

Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000100)
Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000100)
Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000900)
Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000100)
Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000800)
Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000a00)
Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000800)
Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000100)
Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000800)
Sep 30 14:41:36 mgr2 kernel: swap_free: swap-space map bad (entry 00000100)
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000900
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000800
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000500
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000800
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000800
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000c00
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000900
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000800
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000100
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000800
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000100
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000100
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000300
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000100
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000100
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000a00
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000100
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000800
Sep 30 14:41:36 mgr2 kernel: free_one_pmd: bad directory entry 00000100

(flo@ping)/tmp# ksymoops -k /dev/null -l /dev/null -m /tmp/System.map-2.2.17 -o /tmp 
</tmp/oops 
ksymoops 2.3.4 on i686 2.2.16.  Options used
     -V (default)
     -k /dev/null (specified)
     -l /dev/null (specified)
     -o /tmp (specified)
     -m /tmp/System.map-2.2.17 (specified)

Error (regular_file): read_ksyms /dev/null is not a regular file, ignored
No modules in ksyms, skipping objects
No ksyms, skipping lsmod
Sep 30 14:41:36 mgr2 kernel: Unable to handle kernel NULL pointer dereference at 
virtual address 000005ed
Sep 30 14:41:36 mgr2 kernel: current->tss.cr3 = 07aea000, %cr3 = 07aea000
Sep 30 14:41:36 mgr2 kernel: *pde = 00000000
Sep 30 14:41:36 mgr2 kernel: Oops: 0000
Sep 30 14:41:36 mgr2 kernel: CPU:    0
Sep 30 14:41:36 mgr2 kernel: EIP:    0010:[free_wait+49/116]
Sep 30 14:41:36 mgr2 kernel: EFLAGS: 00010087
Sep 30 14:41:36 mgr2 kernel: eax: 00000006   ebx: c44bf160   ecx: 000005e9   edx: 
000005e9
Sep 30 14:41:36 mgr2 kernel: esi: c44bf15c   edi: c44bf000   ebp: 00000287   esp: 
c7a43eec
Sep 30 14:41:36 mgr2 kernel: ds: 0018   es: 0018   ss: 0018
Sep 30 14:41:36 mgr2 kernel: Process named (pid: 141, process nr: 9, 
stackpage=c7a43000)
Sep 30 14:41:36 mgr2 kernel: Stack: 00800000 00000018 00000004 00000000 00000000 
c012e0ab c44bf000 00000000 
Sep 30 14:41:36 mgr2 kernel:        00000018 c012e07c 00000001 00000004 c08f5438 
00000000 00000001 c08f5438 
Sep 30 14:41:36 mgr2 kernel:        de826e24 c7a42000 c7a42000 00000000 00000000 
c44bf000 00000009 c2ad7000 
Sep 30 14:41:36 mgr2 kernel: Call Trace: [do_select+487/512] [do_select+440/512] 
[sys_select+881/1176] [sys_gettimeofday+32/148] [system_call+52/56] 
Sep 30 14:41:36 mgr2 kernel: Code: 8b 41 04 39 d8 74 09 89 c2 8b 42 04 39 d8 75 f7 89 
4a 04 55 
Using defaults from ksymoops -t elf32-i386 -a i386

Code;  00000000 Before first symbol
00000000 <_EIP>:
Code;  00000000 Before first symbol
   0:   8b 41 04                  mov    0x4(%ecx),%eax
Code;  00000003 Before first symbol
   3:   39 d8                     cmp    %ebx,%eax
Code;  00000005 Before first symbol
   5:   74 09                     je     10 <_EIP+0x10> 00000010 Before first symbol
Code;  00000007 Before first symbol
   7:   89 c2                     mov    %eax,%edx
Code;  00000009 Before first symbol
   9:   8b 42 04                  mov    0x4(%edx),%eax
Code;  0000000c Before first symbol
   c:   39 d8                     cmp    %ebx,%eax
Code;  0000000e Before first symbol
   e:   75 f7                     jne    7 <_EIP+0x7> 00000007 Before first symbol
Code;  00000010 Before first symbol
  10:   89 4a 04                  mov    %ecx,0x4(%edx)
Code;  00000013 Before first symbol
  13:   55                        push   %ebp

1 error issued.  Results may not be reliable.
You have new mail in /var/spool/mail/flo

And directly afterwards:

(flo@ping)/usr/src/linux# ksymoops -k /dev/null -l /dev/null -m /tmp/System.map-2.2.17 
-o /tmp </tmp/oops2
ksymoops 2.3.4 on i686 2.2.16.  Options used
     -V (default)
     -k /dev/null (specified)
     -l /dev/null (specified)
     -o /tmp (specified)
     -m /tmp/System.map-2.2.17 (specified)

Error (regular_file): read_ksyms /dev/null is not a regular file, ignored
No modules in ksyms, skipping objects
No ksyms, skipping lsmod
Sep 30 14:41:36 mgr2 kernel: Unable to handle kernel NULL pointer dereference at 
virtual address 00000804
Sep 30 14:41:36 mgr2 kernel: current->tss.cr3 = 06ea4000, %cr3 = 06ea4000
Sep 30 14:41:36 mgr2 kernel: *pde = 00000000
Sep 30 14:41:36 mgr2 kernel: Oops: 0000
Sep 30 14:41:36 mgr2 kernel: CPU:    0
Sep 30 14:41:36 mgr2 kernel: EIP:    0010:[free_wait+16/116]
Sep 30 14:41:36 mgr2 kernel: EFLAGS: 00010206
Sep 30 14:41:36 mgr2 kernel: eax: c0296608   ebx: 00000000   ecx: c01cf940   edx: 
c0262ec0
Sep 30 14:41:36 mgr2 kernel: esi: c468d10c   edi: 00000800   ebp: 00000000   esp: 
c5d7deec
Sep 30 14:41:36 mgr2 kernel: ds: 0018   es: 0018   ss: 0018
Sep 30 14:41:36 mgr2 kernel: Process usrd (pid: 30075, process nr: 63, 
stackpage=c5d7d000)
Sep 30 14:41:36 mgr2 kernel: Stack: c60edbc0 00000000 00000000 00000000 00000000 
c012e0ab c468d000 c0110528 
Sep 30 14:41:36 mgr2 kernel:        00000000 c012e07c 00000000 00000000 c60edbc0 
00000000 c2070900 00000001 
Sep 30 14:41:36 mgr2 kernel:        08112be0 00000282 c5d7c000 00000000 00000000 
c468d000 0000030a c011acf3 
Sep 30 14:41:36 mgr2 kernel: Call Trace: [do_select+487/512] [process_timeout+0/20] 
[do_select+440/512] [handle_mm_fault+279/316] [sys_select+881/1176] 
[do_page_fault+263/992] [error_code+45/52] 
Sep 30 14:41:36 mgr2 kernel: Code: 8b 47 04 89 c6 c1 e6 04 03 77 08 85 c0 74 3b 90 ff 
4f 04 83 
Using defaults from ksymoops -t elf32-i386 -a i386

Code;  00000000 Before first symbol
00000000 <_EIP>:
Code;  00000000 Before first symbol
   0:   8b 47 04                  mov    0x4(%edi),%eax
Code;  00000003 Before first symbol
   3:   89 c6                     mov    %eax,%esi
Code;  00000005 Before first symbol
   5:   c1 e6 04                  shl    $0x4,%esi
Code;  00000008 Before first symbol
   8:   03 77 08                  add    0x8(%edi),%esi
Code;  0000000b Before first symbol
   b:   85 c0                     test   %eax,%eax
Code;  0000000d Before first symbol
   d:   74 3b                     je     4a <_EIP+0x4a> 0000004a Before first symbol
Code;  0000000f Before first symbol
   f:   90                        nop    
Code;  00000010 Before first symbol
  10:   ff 4f 04                  decl   0x4(%edi)
Code;  00000013 Before first symbol
  13:   83 00 00                  addl   $0x0,(%eax)


1 error issued.  Results may not be reliable.


Other suspicions ?

Flo
-- 
Florian Lohoff          [EMAIL PROTECTED]                  +49-5201-669912
      "Write only memory - Oops. Time for my medication again ..."

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
Please read the FAQ at http://www.tux.org/lkml/

Reply via email to