LTP Results for 2.6.x and 2.4.x

2005-01-18 Thread Bryce Harrington
Here's an updated summary of our LTP regression test runs against the
2.6.x and 2.4.x kernels on RedHat 9.0:

http://developer.osdl.org/bryce/ltp/


Briefly, here are the numbers for the most recent kernels:

Patch Name   TestReq#   LTP Ver   CPUPASS  FAIL  WARN  BROK
---
linux-2.6.10   299759   20041105  2-way  2196 6 2 6
patch-2.6.10-rc3   299166   20041007  2-way  2199 6 2 6
patch-2.6.10-rc2   298746   20041007  2-way  2198 8 2 6
patch-2.6.10-rc1   298400   20041007  2-way  2198 6 2 6


Patch Name   TestReq#   LTP   CPUPASS  FAIL  WARN  BROK
---
patch-2.4.29-rc3   300054   20041105  2-way  2210 3 2 3
patch-2.4.29-rc1   299873   20041105  2-way  2210 3 2 3
patch-2.4.29-pre2  299601   20041105  2-way  2210 3 2 3
patch-2.4.29-pre1  298976   20041007  2-way  2210 3 2 3
linux-2.4.28   298851   20041007  2-way  2210 3 2 3


A summary and a detailed report of the current failures on 2.6.10 is
available at:

   http://khack.osdl.org/299759/results/FAIL_summary.txt
   http://developer.osdl.org/bryce/ltp/failrpt_299759_2.6.10.txt

I've run into some issues with patch-2.6.11-rc1 and the latest LTP, but
will post numbers when I've sorted those out.

Bryce

On Fri, 1 Oct 2004, Linus Torvalds wrote:
> On Fri, 1 Oct 2004, Bryce Harrington wrote:
> >
> >  madvise02 7   FAIL : madvise failed with wrong errno, expected
> >   errno = 22, got errno = 12 : Cannot allocate
> >   memory
> >See: ltp/testcases/kernel/syscalls/madvise/madvise02.c
>
> Are you running this test on a 64-bit kernel with a 32-bit test
> environment? This failure _looks_ that way, apparently because the
> compatibility layer doesn't sign-extend "len". And quite frankly,
> sign-extending it would be silly, although I think it would make the test
> happy.
>
>   Linus
>


-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Kernel Panic with LTP on 2.6.11-rc1 (was Re: LTP Results for 2.6.x and 2.4.x)

2005-01-21 Thread Bryce Harrington
On Tue, 18 Jan 2005, Bryce Harrington wrote:
> Here's an updated summary of our LTP regression test runs against the
> 2.6.x and 2.4.x kernels on RedHat 9.0:
>
> http://developer.osdl.org/bryce/ltp/
>
> I've run into some issues with patch-2.6.11-rc1 and the latest LTP, but
> will post numbers when I've sorted those out.

Okay, it looks like there is a regression of Linux 2.6.11-rc1 on LTP.

I've run a bunch of tests to narrow down and rule things out:

   * Other tests (e.g. open_posix) run ok on this kernel
   * This version of LTP is working fine on other kernels
   * Not a hardware problem; same issue occurs on all of our 2-ways
   * Not distro-dependent; problem occurs for me on RH 9.0, and SuSE 9.0
 and 9.2

Here is the output from the test causing the panic:

<<>>
tag=gf13 stime=110659
cmdline="mkfifo gffifo18; growfiles -b -W gf13 -e 1 -u -i 0 -L 30 -I r
-r 1-4096 gffifo18"
contacts=""
analysis=exit
initiation_status="ok"
<<>>
growfiles(gf13): 17094 DEBUG1 Using random seed of 1106350453
Kernel panic - not syncing: Out of memory and no killable processes...


The full output are available at these links:

FAIL   LTP  2.6.11-rc1  SuSE 9.0  2-way  http://khack.osdl.org/stp/300213/
FAIL   LTP  2.6.11-rc1  SuSE 9.2  2-way  http://khack.osdl.org/stp/300219/
FAIL   LTP  2.6.11-rc1  SuSE 9.2  1-way  http://khack.osdl.org/stp/300209/

OK LTP  2.6.10  SuSE 9.2  2-way  http://khack.osdl.org/stp/300230
OK LTP  2.6.10  SuSE 9.0  2-way  http://khack.osdl.org/stp/300229
OK LTP  2.6.10  RH 9.02-way  http://khack.osdl.org/stp/300228
OK OPTS 2.6.11-rc1  RH 9.22-way  http://khack.osdl.org/stp/300227


These are all run on the December version of LTP (20041203).
growfiles.c is in ltp-full-20041203/testcases/kernel/fs/doio.

Bryce


-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [Dev] Re: Kernel Panic with LTP on 2.6.11-rc1 (was Re: LTP Results for 2.6.x and 2.4.x)

2005-01-21 Thread Bryce Harrington
On Fri, 21 Jan 2005, Chris Wright wrote:
> * Andrew Morton ([EMAIL PROTECTED]) wrote:
> > Bryce Harrington <[EMAIL PROTECTED]> wrote:
> > I am unable to find the oops trace amongst all that stuff.  Help?
> >
> > (It would have been handy to include it in the bug report, actually)
>
> Yes, it would.  Or at least some better granularity leading up to the
> problem.  I ran growfiles locally on 2.6.11-rc-bk and didn't have any
> problem.  Could you strace growfiles and see what it was doing when it
> killed the machine?

Okay, I'll set up another run and try collecting that info.  Is there
any other data that would be useful to collect while I'm at it?

Bryce

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [LTP] Re: [Dev] Re: Kernel Panic with LTP on 2.6.11-rc1 (was Re: LTP Results for 2.6.x and 2.4.x)

2005-01-21 Thread Bryce Harrington
On Fri, 21 Jan 2005, Bryce Harrington wrote:
> On Fri, 21 Jan 2005, Chris Wright wrote:
> > * Andrew Morton ([EMAIL PROTECTED]) wrote:
> > > Bryce Harrington <[EMAIL PROTECTED]> wrote:
> > > I am unable to find the oops trace amongst all that stuff.  Help?
> > >
> > > (It would have been handy to include it in the bug report, actually)
> >
> > Yes, it would.  Or at least some better granularity leading up to the
> > problem.  I ran growfiles locally on 2.6.11-rc-bk and didn't have any
> > problem.  Could you strace growfiles and see what it was doing when it
> > killed the machine?
>
> Okay, I'll set up another run and try collecting that info.  Is there
> any other data that would be useful to collect while I'm at it?

Well, I'm not having much luck.  strace isn't installed on the system
(and is giving errors when trying to compile it).  Also, the ssh session
(and sshd) quits whenever I try running the following growfiles command
manually, so I'm having trouble replicating the kernel panic manually.

# growfiles -W gf14 -b -e 1 -u -i 0 -L 20 -w -l -C 1 -T 10 glseek19 glseek19.2

Anyway, if anyone wants to investigate this further, I can provide
access to the machine (email me).  Otherwise, I'm probably just going to
wait for -rc2 and see if the problem's still there.

Thanks,
Bryce



-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Kernel 2.6.11-rc1/2 goes Postal on LTP

2005-01-22 Thread Bryce Harrington
On Fri, 21 Jan 2005, Chris Wright wrote:
> * Bryce Harrington ([EMAIL PROTECTED]) wrote:
> > Well, I'm not having much luck.  strace isn't installed on the system
> > (and is giving errors when trying to compile it).  Also, the ssh session
> > (and sshd) quits whenever I try running the following growfiles command
> > manually, so I'm having trouble replicating the kernel panic manually.
>
> Sounds very much like oom killer gone nuts.
>
> > # growfiles -W gf14 -b -e 1 -u -i 0 -L 20 -w -l -C 1 -T 10 glseek19 
> > glseek19.2
> >
> > Anyway, if anyone wants to investigate this further, I can provide
> > access to the machine (email me).  Otherwise, I'm probably just going to
> > wait for -rc2 and see if the problem's still there.
>
> Wait no longer, it's here ;-)

Hmm, still the kernel is going nutso.  LTP and everything else on the
system is getting killed, including the test manager process.

Below's a bit more info scraped from the console.  This is from a run on
RH 9.0.  It looks like LTP got through 722 of its 2270 tests before the
kernel goes postal on it.  This time it got through all the growfile
commands before it died (see bottom of this post), however it looks like
the same growfile cmd I reported earlier is the one causing the problem.

When I run:

mkfifo gffifo18;
strace growfiles -b -W gf13 -e 1 -u -i 0 -L 30 -I r -r 1-4096 gffifo18 &> 
/tmp/growfiles_strace_log.txt

The following happens:

* I get this strace log:  
http://developer.osdl.org/bryce/growfiles_strace_log.txt
* The ssh session dies and returns to login prompt
* A bunch of stuff similar to below is spewed to console

Bryce


Console
===
...snip...
Memory: 905212k/917504k available (2211k kernel code, 11840k reserved,
871k data, 192k init, 0k highmem)
...snip...

HighMem per-cpu: empty

Free pages:3828kB (0kB HighMem)
Active:341 inactive:297 dirty:0 writeback:1 unstable:0 free:957
slab:1591 mapped:614 pagetables:172
DMA free:68kB min:68kB low:84kB high:100kB active:0kB inactive:0kB
present:16384kB pages_scanned:0 all_unreclaimable? yes
protections[]: 0 0 0
Normal free:3760kB min:3756kB low:4692kB high:5632kB active:1364kB
inactive:1188kB present:901120kB pages_scanned:1571 all_unreclaimable?
no
protections[]: 0 0 0
HighMem free:0kB min:128kB low:160kB high:192kB active:0kB inactive:0kB
present:0kB pages_scanned:0 all_unreclaimable? no
protections[]: 0 0 0
DMA: 1*4kB 0*8kB 0*16kB 0*32kB 1*64kB 0*128kB 0*256kB 0*512kB 0*1024kB
0*2048kB 0*4096kB = 68kB
Normal: 6*4kB 5*8kB 1*16kB 1*32kB 1*64kB 0*128kB 0*256kB 1*512kB
1*1024kB 1*2048kB 0*4096kB = 3760kB
HighMem: empty
Swap cache: add 4541, delete 4457, find 33/60, race 0+0
Out of Memory: Killed process 2103 (jserver).
oom-killer: gfp_mask=0x1d2
DMA per-cpu:
cpu 0 hot: low 2, high 6, batch 1
cpu 0 cold: low 0, high 2, batch 1
cpu 1 hot: low 2, high 6, batch 1
cpu 1 cold: low 0, high 2, batch 1
Normal per-cpu:
cpu 0 hot: low 32, high 96, batch 16
cpu 0 cold: low 0, high 32, batch 16
cpu 1 hot: low 32, high 96, batch 16
cpu 1 cold: low 0, high 32, batch 16
HighMem per-cpu: empty

Free pages:3764kB (0kB HighMem)
Active:360 inactive:267 dirty:0 writeback:0 unstable:0 free:941
slab:1520 mapped:614 pagetables:167
DMA free:68kB min:68kB low:84kB high:100kB active:0kB inactive:0kB
present:16384kB pages_scanned:0 all_unreclaimable? yes
protections[]: 0 0 0
Normal free:3696kB min:3756kB low:4692kB high:5632kB active:1440kB
inactive:1068kB present:901120kB pages_scanned:3461 all_unreclaimable?
yes
protections[]: 0 0 0
HighMem free:0kB min:128kB low:160kB high:192kB active:0kB inactive:0kB
present:0kB pages_scanned:0 all_unreclaimable? no
protections[]: 0 0 0
DMA: 1*4kB 0*8kB 0*16kB 0*32kB 1*64kB 0*128kB 0*256kB 0*512kB 0*1024kB
0*2048kB 0*4096kB = 68kB
Normal: 0*4kB 0*8kB 1*16kB 1*32kB 1*64kB 0*128kB 0*256kB 1*512kB
1*1024kB 1*2048kB 0*4096kB = 3696kB
HighMem: empty
Swap cache: add 4994, delete 4918, find 106/200, race 0+0
Out of Memory: Killed process 2052 (ntpd).
oom-killer: gfp_mask=0xd2
DMA per-cpu:
cpu 0 hot: low 2, high 6, batch 1
cpu 0 cold: low 0, high 2, batch 1
cpu 1 hot: low 2, high 6, batch 1
cpu 1 cold: low 0, high 2, batch 1
Normal per-cpu:
cpu 0 hot: low 32, high 96, batch 16
cpu 0 cold: low 0, high 32, batch 16
cpu 1 hot: low 32, high 96, batch 16
cpu 1 cold: low 0, high 32, batch 16
HighMem per-cpu: empty

Free pages:3764kB (0kB HighMem)
Active:0 inactive:13 dirty:0 writeback:0 unstable:0 free:941 slab:1482
mapped:0 pagetables:162
DMA free:68kB min:68kB low:84kB high:100kB active:0kB inactive:0kB
present:16384kB pages_scanned:0 all_unreclaimable? yes
protections[]: 0 0 0
Normal free:3696kB min:3756kB low:4692kB high:5632kB active:0kB
inactive:52kB present:901120kB pages_scanned:134 all_unreclaimable? yes
protections[]: 0 0 0
HighMem free:0kB min:128kB low:160kB high:192kB active:0kB inactive:0kB
present:0kB pages_scanned:0 all_unrecla

Re: [torvalds@osdl.org: Re: Memory leak in 2.6.11-rc1?]

2005-01-24 Thread Bryce Harrington
Hi Chris,

I applied the patch and reran the test on a RH 9.0 system.  LTP is
continuing past where it failed before, and processes are not getting
killed, so I assume the OOM killer is no longer getting activated.

There is a new behavior, though.  Now the test is hanging indefinitely
on the nptl01 test.  I am assuming that since it passed the spot that it
had failed before, that this is an unrelated issue.

Bryce


On Mon, 24 Jan 2005, Chris Wright wrote:
> Any chance you could re-try with this patch applied?
>
> - Forwarded message from Linus Torvalds <[EMAIL PROTECTED]> -
>
> Date: Mon, 24 Jan 2005 14:35:47 -0800 (PST)
> From: Linus Torvalds <[EMAIL PROTECTED]>
> To: Andrew Morton <[EMAIL PROTECTED]>
> cc: Jens Axboe <[EMAIL PROTECTED]>, [EMAIL PROTECTED], [EMAIL PROTECTED],
>linux-kernel@vger.kernel.org, [EMAIL PROTECTED]
> Subject: Re: Memory leak in 2.6.11-rc1?
>
>
>
> On Mon, 24 Jan 2005, Andrew Morton wrote:
> >
> > Would indicate that the new pipe code is leaking.
>
> Duh. It's the pipe merging.
>
>   Linus
>
> 
> --- 1.40/fs/pipe.c2005-01-15 12:01:16 -08:00
> +++ edited/fs/pipe.c  2005-01-24 14:35:09 -08:00
> @@ -630,13 +630,13 @@
>   struct pipe_inode_info *info = inode->i_pipe;
>
>   inode->i_pipe = NULL;
> - if (info->tmp_page)
> - __free_page(info->tmp_page);
>   for (i = 0; i < PIPE_BUFFERS; i++) {
>   struct pipe_buffer *buf = info->bufs + i;
>   if (buf->ops)
>   buf->ops->release(info, buf);
>   }
> + if (info->tmp_page)
> + __free_page(info->tmp_page);
>   kfree(info);
>  }
>
> -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to [EMAIL PROTECTED]
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at  http://www.tux.org/lkml/
>
> - End forwarded message -
>
>



-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [torvalds@osdl.org: Re: Memory leak in 2.6.11-rc1?]

2005-01-25 Thread Bryce Harrington
On Mon, 24 Jan 2005, Andrew Morton wrote:
> Bryce Harrington <[EMAIL PROTECTED]> wrote:
> >
> > There is a new behavior, though.  Now the test is hanging indefinitely
> >  on the nptl01 test.  I am assuming that since it passed the spot that it
> >  had failed before, that this is an unrelated issue.
>
> I'm finding that nptl01 somtimes gets stuck and sometimes works OK.  Try
> running it by hand a few times.
>
> It could be an application bug or a kernel bug.

Okay, so it doesn't sound like anything to be worried about.

Thanks,
Bryce

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/