Re: Memory management issues with 2.4.4

2001-05-04 Thread Jorge Nerin

Marcelo Tosatti wrote:

 >
 > On Wed, 2 May 2001, Jorge Nerin wrote:
 >
 >> Short version:
 >> Under very heavy thrashing (about four hours) the system either lockups
 >> or OOM handler kills a task even when there is swap space left.
 >
 >
 > First of all, please try to reproduce the problem with 2.4.5-pre1.
 >
 > If it still happens with pre1, please show us the output of "cat
 > /proc/slabinfo" when the kernel is about to trigger the OOM killer.
 >
 > Thanks.
 >


Well, as I had said this morning I have feed the Oops to ksymoops, note 
that I may have mirtyped something, but anyway here is the output of 
ksymoops:


ksymoops 2.3.4 on i586 2.4.5-pre1.  Options used
  -V (default)
  -k /proc/ksyms (default)
  -l /proc/modules (default)
  -o /lib/modules/2.4.5-pre1/ (default)
  -m /usr/src/linux/System.map (default)

Warning: You did not tell me where to find symbol information.  I will
assume that the log matches the kernel and modules that are running
right now and I'll use the default options above for symbol resolution.
If the current kernel and/or modules do not match the log, you can get
more accurate output by telling me the kernel version and where to find
map, modules, ksyms etc.  ksymoops -h explains the options.

Warning (compare_maps): ksyms_base symbol 
__VERSIONED_SYMBOL(shmem_file_setup) not found in System.map.  Ignoring 
ksyms_base entry
invalid operand: 
CPU: 1
EIP: 0010:[]
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010096
eax: 001b ebx:  ecx: 0001 edx: 0001
esi: c021b960 edi: c5fe2000 ebp: c5fe3ce0 esp: c5fe3c88
Stack: c01e89a5 c01c8af6 02c5  c021b960 c021b954  
0001
0020 00cc c02961c0 0120 c012ca08 c5fe2000  
c216960
c21b954  c5fe2000 0080 0001 c5fe2000 0001 0001 
c12da64
Call Trace:   [] [] [] [] 
[] [] []
  [] [] [] [] [] 
[] [] []
  [] [] [] [] [] 
[] [] []
  [] [] [] [] [] 
[] [] []
Code: 0f 0b 8d 65 b4 5b 5f 89 ec 5d c3 55 89 e5 83 ec 18 57 56

 >>EIP; 0c0123c4 Before first symbol   <=
Trace; c012ca08 
Trace; c012da64 <__alloc_pages+16c/2c0>
Trace; c012dbcc <__get_free_pages+14/20>
Trace; c0113bbb 
Trace; c01058c9 
Trace; c0106d07 
Trace; c010e568 
Trace; c01054ea 
Trace; c010e591 
Trace; c010e568 
Trace; c01746de 
Trace; c0172dfd 
Trace; c0173fcc 
Trace; c017405c 
Trace; c0108516 
Trace; c0108709 
Trace; c0106de0 
Trace; c010fc60 
Trace; c0129e78 
Trace; c0129c78 
Trace; c0129e52 
Trace; c0129e78 
Trace; c0129f56 
Trace; c0129e78 
Trace; c0129f83 <__kmem_cache_shrink+b/6c>
Trace; c0129a29 
Trace; c01471a4 
Trace; c012cc9b 
Trace; c012cd2b 
Trace; c0105000 
Trace; c01054f3 
Code;  0c0123c4 Before first symbol
 <_EIP>:
Code;  0c0123c4 Before first symbol   <=
0:   0f 0b ud2a  <=
Code;  0c0123c6 Before first symbol
2:   8d 65 b4  lea0xffb4(%ebp),%esp
Code;  0c0123c9 Before first symbol
5:   5bpop%ebx
Code;  0c0123ca Before first symbol
6:   5fpop%edi
Code;  0c0123cb Before first symbol
7:   89 ec mov%ebp,%esp
Code;  0c0123cd Before first symbol
9:   5dpop%ebp
Code;  0c0123ce Before first symbol
a:   c3ret
Code;  0c0123cf Before first symbol
b:   55push   %ebp
Code;  0c0123d0 Before first symbol
c:   89 e5 mov%esp,%ebp
Code;  0c0123d2 Before first symbol
e:   83 ec 18  sub$0x18,%esp
Code;  0c0123d5 Before first symbol
   11:   57push   %edi
Code;  0c0123d6 Before first symbol
   12:   56push   %esi

Kernel panic Aiee killing interrupt handler

2 warnings issued.  Results may not be reliable.

As I said I have tried to no make errors, but I copied it at 7 in the 
morning, so who knows ... ;-)

-- 
Jorge Nerin
<[EMAIL PROTECTED]>



-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: Memory management issues with 2.4.4

2001-05-04 Thread Jorge Nerín

Marcelo Tosatti wrote:

> 
> On Wed, 2 May 2001, Jorge Nerin wrote:
> 
>> Short version:
>> Under very heavy thrashing (about four hours) the system either lockups 
>> or OOM handler kills a task even when there is swap space left.
> 
> 
> First of all, please try to reproduce the problem with 2.4.5-pre1. 
> 
> If it still happens with pre1, please show us the output of "cat
> /proc/slabinfo" when the kernel is about to trigger the OOM killer.
> 
> Thanks.
> 
Well, I have tried with 2.4.5-pre1 compiled form SMP, and the result has been that 
this morning when I wake up the system has the console black (is there any way to 
prevent cons.saver from blanking the screen) and the disks where quiet, so I 
SysRQ-Sync, Umount, powerOff and then, at the last command the console wake up and I 
have been saluted with:

Kernel BUG in sched.c:709!
Invalid operand: 
Dump copied by hand, but not yet filtered by ksymoops (I'm at work now).
kernel panic Aiee, killing interrupt handler!
In interrupt not syncing.

This afternoon when I return home I will feed the stack dump to ksymoops and post the 
results, I mail this now just to see if someone sees the ligth.

-- 
Jorge Nerin
<[EMAIL PROTECTED]>

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: Memory management issues with 2.4.4

2001-05-04 Thread Jorge Nerín

Marcelo Tosatti wrote:

 
 On Wed, 2 May 2001, Jorge Nerin wrote:
 
 Short version:
 Under very heavy thrashing (about four hours) the system either lockups 
 or OOM handler kills a task even when there is swap space left.
 
 
 First of all, please try to reproduce the problem with 2.4.5-pre1. 
 
 If it still happens with pre1, please show us the output of cat
 /proc/slabinfo when the kernel is about to trigger the OOM killer.
 
 Thanks.
 
Well, I have tried with 2.4.5-pre1 compiled form SMP, and the result has been that 
this morning when I wake up the system has the console black (is there any way to 
prevent cons.saver from blanking the screen) and the disks where quiet, so I 
SysRQ-Sync, Umount, powerOff and then, at the last command the console wake up and I 
have been saluted with:

Kernel BUG in sched.c:709!
Invalid operand: 
Dump copied by hand, but not yet filtered by ksymoops (I'm at work now).
kernel panic Aiee, killing interrupt handler!
In interrupt not syncing.

This afternoon when I return home I will feed the stack dump to ksymoops and post the 
results, I mail this now just to see if someone sees the ligth.

-- 
Jorge Nerin
[EMAIL PROTECTED]

-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: Memory management issues with 2.4.4

2001-05-04 Thread Jorge Nerin

Marcelo Tosatti wrote:

 
  On Wed, 2 May 2001, Jorge Nerin wrote:
 
  Short version:
  Under very heavy thrashing (about four hours) the system either lockups
  or OOM handler kills a task even when there is swap space left.
 
 
  First of all, please try to reproduce the problem with 2.4.5-pre1.
 
  If it still happens with pre1, please show us the output of cat
  /proc/slabinfo when the kernel is about to trigger the OOM killer.
 
  Thanks.
 


Well, as I had said this morning I have feed the Oops to ksymoops, note 
that I may have mirtyped something, but anyway here is the output of 
ksymoops:


ksymoops 2.3.4 on i586 2.4.5-pre1.  Options used
  -V (default)
  -k /proc/ksyms (default)
  -l /proc/modules (default)
  -o /lib/modules/2.4.5-pre1/ (default)
  -m /usr/src/linux/System.map (default)

Warning: You did not tell me where to find symbol information.  I will
assume that the log matches the kernel and modules that are running
right now and I'll use the default options above for symbol resolution.
If the current kernel and/or modules do not match the log, you can get
more accurate output by telling me the kernel version and where to find
map, modules, ksyms etc.  ksymoops -h explains the options.

Warning (compare_maps): ksyms_base symbol 
__VERSIONED_SYMBOL(shmem_file_setup) not found in System.map.  Ignoring 
ksyms_base entry
invalid operand: 
CPU: 1
EIP: 0010:[c0123c4]
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010096
eax: 001b ebx:  ecx: 0001 edx: 0001
esi: c021b960 edi: c5fe2000 ebp: c5fe3ce0 esp: c5fe3c88
Stack: c01e89a5 c01c8af6 02c5  c021b960 c021b954  
0001
0020 00cc c02961c0 0120 c012ca08 c5fe2000  
c216960
c21b954  c5fe2000 0080 0001 c5fe2000 0001 0001 
c12da64
Call Trace:   [c012ca08] [c012da64] [c012dbcc] [c0113bbb] 
[c01058c9] [c0106d07] [c010e568]
  [c01054ea] [c010e591] [c010e568] [c01746de] [c0172dfd] 
[c0173fcc] [c017405c] [c0108516]
  [c0108709] [c0106de0] [c010fc60] [c0129e78] [c0129c78] 
[c0129e52] [c0129e78] [c0129f56]
  [c0129e78] [c0129f83] [c0129a29] [c01471a4] [c012cc9b] 
[c012cd2b] [c0105000] [c01054f3]
Code: 0f 0b 8d 65 b4 5b 5f 89 ec 5d c3 55 89 e5 83 ec 18 57 56

 EIP; 0c0123c4 Before first symbol   =
Trace; c012ca08 free_shortage+1c/100
Trace; c012da64 __alloc_pages+16c/2c0
Trace; c012dbcc __get_free_pages+14/20
Trace; c0113bbb do_fork+93/770
Trace; c01058c9 sys_clone+1d/24
Trace; c0106d07 system_call+37/40
Trace; c010e568 apm_magic+0/8
Trace; c01054ea kernel_thread+1a/30
Trace; c010e591 apm_power_off+21/3c
Trace; c010e568 apm_magic+0/8
Trace; c01746de handle_sysrq+de/230
Trace; c0172dfd handle_scancode+1bd/318
Trace; c0173fcc handle_kbd_event+130/1a4
Trace; c017405c keyboard_interrupt+1c/28
Trace; c0108516 handle_IRQ_event+52/7c
Trace; c0108709 do_IRQ+99/ec
Trace; c0106de0 ret_from_intr+0/20
Trace; c010fc60 smp_call_function+8c/c0
Trace; c0129e78 do_ccupdate_local+0/40
Trace; c0129c78 kmem_cache_create+220/3e0
Trace; c0129e52 smp_call_function_all_cpus+1a/40
Trace; c0129e78 do_ccupdate_local+0/40
Trace; c0129f56 drain_cpu_caches+9e/c0
Trace; c0129e78 do_ccupdate_local+0/40
Trace; c0129f83 __kmem_cache_shrink+b/6c
Trace; c0129a29 kmem_slab_destroy+79/a8
Trace; c01471a4 shrink_dcache_memory+2c/30
Trace; c012cc9b do_try_to_free_pages+5f/7c
Trace; c012cd2b kswapd+73/110
Trace; c0105000 init+0/1b0
Trace; c01054f3 kernel_thread+23/30
Code;  0c0123c4 Before first symbol
 _EIP:
Code;  0c0123c4 Before first symbol   =
0:   0f 0b ud2a  =
Code;  0c0123c6 Before first symbol
2:   8d 65 b4  lea0xffb4(%ebp),%esp
Code;  0c0123c9 Before first symbol
5:   5bpop%ebx
Code;  0c0123ca Before first symbol
6:   5fpop%edi
Code;  0c0123cb Before first symbol
7:   89 ec mov%ebp,%esp
Code;  0c0123cd Before first symbol
9:   5dpop%ebp
Code;  0c0123ce Before first symbol
a:   c3ret
Code;  0c0123cf Before first symbol
b:   55push   %ebp
Code;  0c0123d0 Before first symbol
c:   89 e5 mov%esp,%ebp
Code;  0c0123d2 Before first symbol
e:   83 ec 18  sub$0x18,%esp
Code;  0c0123d5 Before first symbol
   11:   57push   %edi
Code;  0c0123d6 Before first symbol
   12:   56push   %esi

Kernel panic Aiee killing interrupt handler

2 warnings issued.  Results may not be reliable.

As I said I have tried to no make errors, but I copied it at 7 in the 
morning, so who knows ... ;-)

-- 
Jorge Nerin
[EMAIL PROTECTED]



-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  

Re: Memory management issues with 2.4.4

2001-05-02 Thread Marcelo Tosatti



On Wed, 2 May 2001, Jorge Nerin wrote:

> Short version:
> Under very heavy thrashing (about four hours) the system either lockups 
> or OOM handler kills a task even when there is swap space left.

First of all, please try to reproduce the problem with 2.4.5-pre1. 

If it still happens with pre1, please show us the output of "cat
/proc/slabinfo" when the kernel is about to trigger the OOM killer.

Thanks.



-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Memory management issues with 2.4.4

2001-05-02 Thread Jorge Nerin

Short version:
Under very heavy thrashing (about four hours) the system either lockups 
or OOM handler kills a task even when there is swap space left.

Long version:
My system is a dual 2x200MMX in a Gigabyte 586DX with 96Mb and 226Mb of 
swap this way:

[root@quartz ~]# swapon -s
FilenameTypeSizeUsedPriority
/dev/hda7   partition   96348   64561
/dev/hdc6   partition   130276  64601

Well, I have tried to compile Mozilla 0.8.1 since the day it came out, 
but I always lockup in the same place, wich it's begining to be a bit 
frustrating ;-)

The problem is that compiling the file  
content/base/src/nsStyleContext.o makes cc1plus grow up to a size of 
141M, at this point the system is in heavy thrashing, kswapd is using 
about 7-12% of CPU time and cc1plus is using 8-14%.

If I use a SMP kernel the system always ends up frozen, some times after 
about almost one day of uptime and compiling since booting, and another 
times  it gets frozen in much less time, three or four hours.

If I use a non SMP kernel the system doesn't lokup but after some hours, 
it varies between 6-8h, the cc1plus procces get a kill signal by OOM 
killer, althought there is plenty of swap space left ((96Mb RAM + 226Mb 
swap) - (140Mb - tiny amount used by the system) = plenty ;-).

For this tries I have left the system in single user, so no cron jobs, 
no network trafic, no etc...

I suspect there is a race in the swap handling in SMP, and that the OOM 
doesn't take into account the swap space left sometimes.

I don't know what to try next, suggestions?

-- 
Jorge Nerin
<[EMAIL PROTECTED]>

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Memory management issues with 2.4.4

2001-05-02 Thread Jorge Nerin

Short version:
Under very heavy thrashing (about four hours) the system either lockups 
or OOM handler kills a task even when there is swap space left.

Long version:
My system is a dual 2x200MMX in a Gigabyte 586DX with 96Mb and 226Mb of 
swap this way:

[root@quartz ~]# swapon -s
FilenameTypeSizeUsedPriority
/dev/hda7   partition   96348   64561
/dev/hdc6   partition   130276  64601

Well, I have tried to compile Mozilla 0.8.1 since the day it came out, 
but I always lockup in the same place, wich it's begining to be a bit 
frustrating ;-)

The problem is that compiling the file  
content/base/src/nsStyleContext.o makes cc1plus grow up to a size of 
141M, at this point the system is in heavy thrashing, kswapd is using 
about 7-12% of CPU time and cc1plus is using 8-14%.

If I use a SMP kernel the system always ends up frozen, some times after 
about almost one day of uptime and compiling since booting, and another 
times  it gets frozen in much less time, three or four hours.

If I use a non SMP kernel the system doesn't lokup but after some hours, 
it varies between 6-8h, the cc1plus procces get a kill signal by OOM 
killer, althought there is plenty of swap space left ((96Mb RAM + 226Mb 
swap) - (140Mb - tiny amount used by the system) = plenty ;-).

For this tries I have left the system in single user, so no cron jobs, 
no network trafic, no etc...

I suspect there is a race in the swap handling in SMP, and that the OOM 
doesn't take into account the swap space left sometimes.

I don't know what to try next, suggestions?

-- 
Jorge Nerin
[EMAIL PROTECTED]

-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



Re: Memory management issues with 2.4.4

2001-05-02 Thread Marcelo Tosatti



On Wed, 2 May 2001, Jorge Nerin wrote:

 Short version:
 Under very heavy thrashing (about four hours) the system either lockups 
 or OOM handler kills a task even when there is swap space left.

First of all, please try to reproduce the problem with 2.4.5-pre1. 

If it still happens with pre1, please show us the output of cat
/proc/slabinfo when the kernel is about to trigger the OOM killer.

Thanks.



-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/