Re: TCP connection hang problem with 2.6.16.16, e1000.

2006-05-31 Thread Herbert Xu
On Wed, May 31, 2006 at 04:45:36PM +, Ben Greear wrote: > > May 31 08:07:00 xeon-dt kernel: irqbalance: page allocation failure. order:1, > mode:0x20 > May 31 08:07:00 xeon-dt kernel: [<78147393>] __alloc_pages+0x1d8/0x2b8 > May 31 08:07:00 xeon-dt kernel: [<7815c525>] kmem_getpages+0x2f/0x8

Re: TCP connection hang problem with 2.6.16.16, e1000 (found work-around)

2006-05-31 Thread Ben Greear
Stephen Hemminger wrote: On Wed, 31 May 2006 14:45:13 -0700 Ben Greear <[EMAIL PROTECTED]> wrote: Ok, I think I found the problem. It appears to be at least partially self-inflicted. I was modifying the kernel buffer settings via /proc/* to improve performance. I noticed that when I used ke

Re: TCP connection hang problem with 2.6.16.16, e1000 (found work-around)

2006-05-31 Thread Stephen Hemminger
On Wed, 31 May 2006 14:45:13 -0700 Ben Greear <[EMAIL PROTECTED]> wrote: > Ok, I think I found the problem. It appears to be at least partially > self-inflicted. I was modifying the kernel buffer settings via /proc/* > to improve performance. I noticed that when I used kernel-defaults, > the TC

Re: TCP connection hang problem with 2.6.16.16, e1000 (found work-around)

2006-05-31 Thread Ben Greear
Ok, I think I found the problem. It appears to be at least partially self-inflicted. I was modifying the kernel buffer settings via /proc/* to improve performance. I noticed that when I used kernel-defaults, the TCP connection hangs stopped (and ran ~20% slower). My personal opinion is that th

Re: TCP connection hang problem with 2.6.16.16, e1000.

2006-05-31 Thread Ben Greear
Ben Greear wrote: I haven't seen this problem on 2.6.13, so I'm now starting a manual bisect to see if I can narrow down where the problem appeared. Turns out, I can reproduce it in 2.6.13, and 2.6.9. I haven't tried anything older. I also tried to reproduce it using a simpler traffic genera

Re: TCP connection hang problem with 2.6.16.16, e1000.

2006-05-31 Thread Ben Greear
Auke Kok wrote: Ben Greear wrote: Kernel is 2.6.16.16 with my patches, including a patch to the e1000. what patch? I don't see it anywhere. Well, it's reproducible with only the send-to-self patch, so I figured it wouldn't help to show the e1000 patches. Also, it is reproducible on tg3,

Re: TCP connection hang problem with 2.6.16.16, e1000.

2006-05-31 Thread Auke Kok
Ben Greear wrote: Kernel is 2.6.16.16 with my patches, including a patch to the e1000. what patch? I don't see it anywhere. Cheers, Auke - To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kerne

Re: TCP connection hang problem with 2.6.16.16, e1000.

2006-05-31 Thread Ben Greear
Ben Greear wrote: Kernel is 2.6.16.16 with my patches, including a patch to the e1000. I also rebuilt a fresh kernel with only the attached send-to-self patch. I see the hang, but there was no OOM messages, probably because the machine was freshly rebooted and had plenty of buffers available.