Firewire lockup on 2.6.22.3 in SMP but not UP, old drivers

2007-08-23 Thread Andrew Burgess
I can pull video via firewire from a set top box using test_mpeg2 for hours (97GB is my current record) if using a UP kernel. The machine locks hard after about 500MB using an SMP kernel. No messages, no altsysreq keys. It seems to lock faster if I make the machine busy (disk and cpu). One time

Firewire lockup on 2.6.22.3 in SMP but not UP, old drivers

2007-08-23 Thread Andrew Burgess
I can pull video via firewire from a set top box using test_mpeg2 for hours (97GB is my current record) if using a UP kernel. The machine locks hard after about 500MB using an SMP kernel. No messages, no altsysreq keys. It seems to lock faster if I make the machine busy (disk and cpu). One time

Re: Oops in scsi_send_eh_cmnd 2.6.21-rc5-git6,7,10,13

2007-04-06 Thread Andrew Burgess
James Bottomley wrote: > On Fri, 2007-04-06 at 12:32 -0400, Douglas Gilbert wrote: > > That last line should be: > > request_buffer[7] = 10; /* minimum size per SPC: 18 bytes */ > > Er, yes, I'm afraid when I see n-7 I always think n is the lenght, not n > is the last byte ... OK.

Re: Oops in scsi_send_eh_cmnd 2.6.21-rc5-git6,7,10,13

2007-04-06 Thread Andrew Burgess
James Bottomley wrote: >It's actually a long standing bug in the 3w- driver. Apparently it >assumes request sense is always the use_sg == 0 case. This is what it >does on a request sense: >static int tw_scsiop_request_sense(TW_Device_Extension *tw_dev, int request_id) >{ >

Re: Oops in scsi_send_eh_cmnd 2.6.21-rc5-git6,7,10,13

2007-04-06 Thread Andrew Burgess
James Bottomley said: > On Thu, 2007-04-05 at 19:21 -0700, Andrew Burgess wrote: > > 2.6.20.4 with your patch dies in the memcpy (as does 21-gitN) > > > > 2.6.20.4 without your patch dies in the subsequent __free_page > > with a null pointer ref at 000...008 >

Re: Oops in scsi_send_eh_cmnd 2.6.21-rc5-git6,7,10,13

2007-04-06 Thread Andrew Burgess
James Bottomley said: On Thu, 2007-04-05 at 19:21 -0700, Andrew Burgess wrote: 2.6.20.4 with your patch dies in the memcpy (as does 21-gitN) 2.6.20.4 without your patch dies in the subsequent __free_page with a null pointer ref at 000...008 James should I try your posted patch

Re: Oops in scsi_send_eh_cmnd 2.6.21-rc5-git6,7,10,13

2007-04-06 Thread Andrew Burgess
James Bottomley wrote: It's actually a long standing bug in the 3w- driver. Apparently it assumes request sense is always the use_sg == 0 case. This is what it does on a request sense: static int tw_scsiop_request_sense(TW_Device_Extension *tw_dev, int request_id) {

Re: Oops in scsi_send_eh_cmnd 2.6.21-rc5-git6,7,10,13

2007-04-06 Thread Andrew Burgess
James Bottomley wrote: On Fri, 2007-04-06 at 12:32 -0400, Douglas Gilbert wrote: That last line should be: request_buffer[7] = 10; /* minimum size per SPC: 18 bytes */ Er, yes, I'm afraid when I see n-7 I always think n is the lenght, not n is the last byte ... OK. That

Re: Oops in scsi_send_eh_cmnd 2.6.21-rc5-git6,7,10,13

2007-04-05 Thread Andrew Burgess
On Thu, 2007-04-05 at 15:36 -0700, David Miller wrote: > From: Andrew Burgess <[EMAIL PROTECTED]> > Date: Thu, 5 Apr 2007 15:13:27 -0700 > > > David, do you see any other problems with scsi_send_eh_cmnd? > > > > I've switched back to 2.6.18 which seems to

Re: Oops in scsi_send_eh_cmnd 2.6.21-rc5-git6,7,10,13

2007-04-05 Thread Andrew Burgess
Chuck Ebbert wrote: >Andrew Burgess wrote: > >> Apr 5 03:45:16 cichlid kernel: 3w-: scsi2: Command failed: status = >> 0xc7, flags = 0x7f, unit #4. >> Apr 5 03:45:20 cichlid kernel: 3w-: scsi2: Command failed: status = >> 0xc7, flags = 0x80, unit #4. >

Oops in scsi_send_eh_cmnd 2.6.21-rc5-git6,7,10

2007-04-05 Thread Andrew Burgess
The machine is x86_64 SMP. I also got the oops in the Fedora kernels: 2.6.20-1.2933.fc6 and 2.6.20-1.3017.fc7. The system isn't locked solid but it seems anything touching the scsi disks hangs. I also twice got this early in the boot and it stopped booting. Anything I can do to help just ask.

Re: Oops in scsi_send_eh_cmnd 2.6.21-rc5-git6,7,10,13

2007-04-05 Thread Andrew Burgess
Chuck Ebbert wrote: Andrew Burgess wrote: Apr 5 03:45:16 cichlid kernel: 3w-: scsi2: Command failed: status = 0xc7, flags = 0x7f, unit #4. Apr 5 03:45:20 cichlid kernel: 3w-: scsi2: Command failed: status = 0xc7, flags = 0x80, unit #4. Apr 5 03:47:20 cichlid kernel: 3w-

Re: Oops in scsi_send_eh_cmnd 2.6.21-rc5-git6,7,10,13

2007-04-05 Thread Andrew Burgess
On Thu, 2007-04-05 at 15:36 -0700, David Miller wrote: From: Andrew Burgess [EMAIL PROTECTED] Date: Thu, 5 Apr 2007 15:13:27 -0700 David, do you see any other problems with scsi_send_eh_cmnd? I've switched back to 2.6.18 which seems to not oops and am happy to try patches. Does

Oops in scsi_send_eh_cmnd 2.6.21-rc5-git6,7,10

2007-04-05 Thread Andrew Burgess
The machine is x86_64 SMP. I also got the oops in the Fedora kernels: 2.6.20-1.2933.fc6 and 2.6.20-1.3017.fc7. The system isn't locked solid but it seems anything touching the scsi disks hangs. I also twice got this early in the boot and it stopped booting. Anything I can do to help just ask.

Re: CPU load

2007-02-12 Thread Andrew Burgess
On 12/02/07, Vassili Karpov <[EMAIL PROTECTED]> wrote: > > How does the kernel calculates the value it places in `/proc/stat' at > 4th position (i.e. "idle: twiddling thumbs")? > .. > > Later small kernel module was developed that tried to time how much > time is spent in the idle handler inside

Re: md: md6_raid5 crash 2.6.20

2007-02-12 Thread Andrew Burgess
>However >this "kernel BUG" is something newly introduced in 2.6.20 which should >be fixed in 2.6.20.1. Patch is below. I am using raid6. Am I at risk after applying this patch? Thanks for your time! - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a

Re: CPU load

2007-02-12 Thread Andrew Burgess
On 12/02/07, Vassili Karpov [EMAIL PROTECTED] wrote: How does the kernel calculates the value it places in `/proc/stat' at 4th position (i.e. idle: twiddling thumbs)? .. Later small kernel module was developed that tried to time how much time is spent in the idle handler inside the kernel

Re: md: md6_raid5 crash 2.6.20

2007-02-12 Thread Andrew Burgess
However this kernel BUG is something newly introduced in 2.6.20 which should be fixed in 2.6.20.1. Patch is below. I am using raid6. Am I at risk after applying this patch? Thanks for your time! - To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to

Re: 2.6.20-rt5 Oops on boot

2007-02-10 Thread Andrew Burgess
>I have terrible news: 2.6.20-rt5 does not boot at all on a couple >machines I was brave enough to try -- a [EMAIL PROTECTED] SMP/HT desktop, and a >Core2 Duo [EMAIL PROTECTED] laptop. Ditto for me on an ASUS AMD64 x2, just hangs, I have no serial console. 2.6.20-rc5-rt7 booted ok (the last one I

Re: 2.6.20-rt5 Oops on boot

2007-02-10 Thread Andrew Burgess
I have terrible news: 2.6.20-rt5 does not boot at all on a couple machines I was brave enough to try -- a [EMAIL PROTECTED] SMP/HT desktop, and a Core2 Duo [EMAIL PROTECTED] laptop. Ditto for me on an ASUS AMD64 x2, just hangs, I have no serial console. 2.6.20-rc5-rt7 booted ok (the last one I

Re: Swap prefetch merge plans

2007-02-09 Thread Andrew Burgess
>I'm stuck developing code I'm having trouble proving it helps. Normal users >find it helps and an artificial testcase shows it helps, but that is not >enough, since the normal users will be tainted in their opinion, and the >artificial testcase will always be artificial. My mistake of

Re: Swap prefetch merge plans

2007-02-09 Thread Andrew Burgess
I'm stuck developing code I'm having trouble proving it helps. Normal users find it helps and an artificial testcase shows it helps, but that is not enough, since the normal users will be tainted in their opinion, and the artificial testcase will always be artificial. My mistake of developing

Re: Kernel 2.6.12 and 2.6.13 hangs for a while on boot

2005-09-02 Thread Andrew Burgess
>I'm having little hangs while booting with kernels 2.6.12 and 2.6.13-rc1, rc2 >and rc3. >It is strange that 2.6.12-rc1 booted ok without hangs. >Hangs appears just before mounting filesystems message and before configuring >system to use udev. I had a similar problem (intermittant) and

Re: Kernel 2.6.12 and 2.6.13 hangs for a while on boot

2005-09-02 Thread Andrew Burgess
I'm having little hangs while booting with kernels 2.6.12 and 2.6.13-rc1, rc2 and rc3. It is strange that 2.6.12-rc1 booted ok without hangs. Hangs appears just before mounting filesystems message and before configuring system to use udev. I had a similar problem (intermittant) and narrowed it

Re: 2.6.13-rc6-rt3

2005-08-18 Thread Andrew Burgess
>I haven't used any of the RT patches since V0.7.51-xx, but I upgraded to -rt8 >yesterday and had a couple of problems. I've just noticed you released -rt9, >but I don't think my problem is listed as fixed.. I'll upgrade anyway, in a >minute. >The problem I'm having is that when the kernel

Re: 2.6.13-rc6-rt3

2005-08-18 Thread Andrew Burgess
I haven't used any of the RT patches since V0.7.51-xx, but I upgraded to -rt8 yesterday and had a couple of problems. I've just noticed you released -rt9, but I don't think my problem is listed as fixed.. I'll upgrade anyway, in a minute. The problem I'm having is that when the kernel probes

Re: rc6 keeps hanging and blanking displays where rc4-mm1 works

2005-08-12 Thread Andrew Burgess
>Unless you are using data=journal and have turned write cache off on >your IDE drives that is expected. I seem to recall reading that after a power failure a modern IDE drive gets enough power from the motor spinning down to always be able to write the cache to the media. Am I wrong about that

Re: BUGS: Real-Time Preemption 2.6.13-rc5-RT-V0.7.52-16 with CONFIG_DEBUG_SLAB

2005-08-12 Thread Andrew Burgess
>> Here is 2.6.13-rc5-RT-V0.7.52-16 SMP with CONFIG_DEBUG_SLAB set. >> How may I help? >ok, could you try the -53-05 (or later) kernel, does it work any better? Hi Ingo I will try it. But I think I wrongly pointed the finger to RT. I can't boot 13rc6 vanilla in smp and I seem to be having bttv

Re: bttv oopses, is this a valid v4l test?

2005-08-12 Thread Andrew Burgess
>> I find that if I just cat /dev/video to a file it grows about >> 5MB/sec. Is this 'good enough' to test the bttv/bt878 pci/dma >> oopses? I want to test in single user mode to minimize damage >> to my filesystem. > bttv, on overlay mode does something that is not very often used by

Re: BUGS: Real-Time Preemption 2.6.13-rc5-RT-V0.7.52-16 with CONFIG_DEBUG_SLAB

2005-08-12 Thread Andrew Burgess
Here is 2.6.13-rc5-RT-V0.7.52-16 SMP with CONFIG_DEBUG_SLAB set. How may I help? ok, could you try the -53-05 (or later) kernel, does it work any better? Hi Ingo I will try it. But I think I wrongly pointed the finger to RT. I can't boot 13rc6 vanilla in smp and I seem to be having bttv

Re: bttv oopses, is this a valid v4l test?

2005-08-12 Thread Andrew Burgess
I find that if I just cat /dev/video to a file it grows about 5MB/sec. Is this 'good enough' to test the bttv/bt878 pci/dma oopses? I want to test in single user mode to minimize damage to my filesystem. bttv, on overlay mode does something that is not very often used by other types of

Re: rc6 keeps hanging and blanking displays where rc4-mm1 works

2005-08-12 Thread Andrew Burgess
Unless you are using data=journal and have turned write cache off on your IDE drives that is expected. I seem to recall reading that after a power failure a modern IDE drive gets enough power from the motor spinning down to always be able to write the cache to the media. Am I wrong about that

BUGS: Real-Time Preemption 2.6.13-rc5-RT-V0.7.52-16 with CONFIG_DEBUG_SLAB

2005-08-10 Thread Andrew Burgess
Here is 2.6.13-rc5-RT-V0.7.52-16 SMP with CONFIG_DEBUG_SLAB set. How may I help? messages: Aug 10 11:31:56 cichlid kernel: Linux version 2.6.13-rc5-RT-V0.7.52-16-smp-5-slabdebug ([EMAIL PROTECTED]) (gcc version 3.4.4 20050721 (Red Hat 3.4.4-2)) #5 SMP Wed Aug 10 10:36:09 PDT 2005 ... Aug 10

Intermittant hang/reboot at startup: 2.6.13-rc6 SMP (also no mouse and usb error)

2005-08-10 Thread Andrew Burgess
2.6.13rc6 SMP (single P4 HT) messages: ... Aug 10 11:32:02 cichlid kernel: 3ware Storage Controller device driver for Linux v1.26.02.001. Aug 10 11:32:02 cichlid kernel: PCI: Found IRQ 10 for device :02:06.0 >>> Hangs here one time, rebooted itself here one time <<< The third time it came

Oops: Real-Time Preemption 2.6.13-rc5-RT-V0.7.52-16 kmem_cache_alloc?

2005-08-10 Thread Andrew Burgess
I'm making another kernel with SLAB_DEBUG set, sound correct? messages: Aug 9 17:22:26 cichlid kernel: Linux version 2.6.13-rc5-RT-V0.7.52-16-up-1 ([EMAIL PROTECTED]) (gcc version 3.4.4 20050721 (Red Hat 3.4.4-2)) #1 Mon Aug 8 16:02:26 PDT 2005 ... Aug 9 17:22:27 cichlid kernel: Kernel

Oops: Real-Time Preemption 2.6.13-rc5-RT-V0.7.52-16 kmem_cache_alloc?

2005-08-10 Thread Andrew Burgess
I'm making another kernel with SLAB_DEBUG set, sound correct? messages: Aug 9 17:22:26 cichlid kernel: Linux version 2.6.13-rc5-RT-V0.7.52-16-up-1 ([EMAIL PROTECTED]) (gcc version 3.4.4 20050721 (Red Hat 3.4.4-2)) #1 Mon Aug 8 16:02:26 PDT 2005 ... Aug 9 17:22:27 cichlid kernel: Kernel

Intermittant hang/reboot at startup: 2.6.13-rc6 SMP (also no mouse and usb error)

2005-08-10 Thread Andrew Burgess
2.6.13rc6 SMP (single P4 HT) messages: ... Aug 10 11:32:02 cichlid kernel: 3ware Storage Controller device driver for Linux v1.26.02.001. Aug 10 11:32:02 cichlid kernel: PCI: Found IRQ 10 for device :02:06.0 Hangs here one time, rebooted itself here one time The third time it came up

BUGS: Real-Time Preemption 2.6.13-rc5-RT-V0.7.52-16 with CONFIG_DEBUG_SLAB

2005-08-10 Thread Andrew Burgess
Here is 2.6.13-rc5-RT-V0.7.52-16 SMP with CONFIG_DEBUG_SLAB set. How may I help? messages: Aug 10 11:31:56 cichlid kernel: Linux version 2.6.13-rc5-RT-V0.7.52-16-smp-5-slabdebug ([EMAIL PROTECTED]) (gcc version 3.4.4 20050721 (Red Hat 3.4.4-2)) #5 SMP Wed Aug 10 10:36:09 PDT 2005 ... Aug 10

BUG: Real-Time Preemption 2.6.13-rc5-RT-V0.7.52-16

2005-08-09 Thread Andrew Burgess
This particular module (uhci-hcd) caused hangs with many recent up rt kernels when loaded by rc.sysinit so I put it in the hotplug blacklist and loaded it manually later. This time I got a BUG. On a probably seperate issue: I've tried this smp kernel a few times and half the time I have no

BUG: Real-Time Preemption 2.6.13-rc5-RT-V0.7.52-16

2005-08-09 Thread Andrew Burgess
This particular module (uhci-hcd) caused hangs with many recent up rt kernels when loaded by rc.sysinit so I put it in the hotplug blacklist and loaded it manually later. This time I got a BUG. On a probably seperate issue: I've tried this smp kernel a few times and half the time I have no

Re: 2.6.12.3 Oops

2005-08-02 Thread Andrew Burgess
I said: >Daniel Walker said: >>You might want to enable slab debugging. Here's how, >OK thanks. I think I'll also switch to 2.6.13rc4 just in case it's something >that's >already been fixed. Am compiling a slab debug 13rc4 now... Which dies at boot before writing /var/log/messages. So I tried my

Re: 2.6.12.3 Oops

2005-08-02 Thread Andrew Burgess
I said: Daniel Walker said: You might want to enable slab debugging. Here's how, OK thanks. I think I'll also switch to 2.6.13rc4 just in case it's something that's already been fixed. Am compiling a slab debug 13rc4 now... Which dies at boot before writing /var/log/messages. So I tried my slab

Re: 2.6.12.3 Oops

2005-08-01 Thread Andrew Burgess
Daniel Walker said: >> Seems to be triggered by mplayer but not right away (30 minutes sometimes), >> sometimes >> no mplayer is necessary. >> >> This is a busy machine. There is continuous usb soundcard (3 soundcards) and >> usb ethernet activity (news server and alot of downloading) and video

2.6.12.3 Oops

2005-08-01 Thread Andrew Burgess
Stock 2.6.12.3 #2 Sun Jul 31 16:55:16 PDT 2005 i686 i686 i386 GNU/Linux Seems to be triggered by mplayer but not right away (30 minutes sometimes), sometimes no mplayer is necessary. This is a busy machine. There is continuous usb soundcard (3 soundcards) and usb ethernet activity (news server

2.6.12.3 Oops

2005-08-01 Thread Andrew Burgess
Stock 2.6.12.3 #2 Sun Jul 31 16:55:16 PDT 2005 i686 i686 i386 GNU/Linux Seems to be triggered by mplayer but not right away (30 minutes sometimes), sometimes no mplayer is necessary. This is a busy machine. There is continuous usb soundcard (3 soundcards) and usb ethernet activity (news server

Re: 2.6.12.3 Oops

2005-08-01 Thread Andrew Burgess
Daniel Walker said: Seems to be triggered by mplayer but not right away (30 minutes sometimes), sometimes no mplayer is necessary. This is a busy machine. There is continuous usb soundcard (3 soundcards) and usb ethernet activity (news server and alot of downloading) and video is being

Re: a 15 GB file on tmpfs

2005-07-21 Thread Andrew Burgess
On Wed, Jul 20, 2005 at 02:16:36PM +0200, Bastiaan Naber wrote: > I have a 15 GB file which I want to place in memory via tmpfs. I want to do > this because I need to have this data accessible with a very low seek time. You don't want tmpfs. You want either (1) ramfs and copy the data once at

Re: a 15 GB file on tmpfs

2005-07-21 Thread Andrew Burgess
On Wed, Jul 20, 2005 at 02:16:36PM +0200, Bastiaan Naber wrote: I have a 15 GB file which I want to place in memory via tmpfs. I want to do this because I need to have this data accessible with a very low seek time. You don't want tmpfs. You want either (1) ramfs and copy the data once at boot

Re: Oops on boot with both 2.2.17 and 2.4.0t8p6

2000-09-08 Thread Andrew Burgess
> >Oops from 2.2.17 (some more before this, but it went offscreen): ... > You need to capture and decode the first oops. Compile a kernel with a > serial console and capture the oops log on a second machine. Or set your console for more than 80x25 using SVGATextMode. I use

Re: Problems with bluesmoke.c in 2.2.17

2000-09-08 Thread Andrew Burgess
And here's another sample output for you: CPU 1: Machine Check Exception: 0004<0>Bank 0: f20001000800general protection fault: CPU:1 EIP:0010:[mcheck_fault+263/368] EFLAGS: 00010246 ... I seldom get a log entry, most of the time I get the first line on all my

Re: Problems with bluesmoke.c in 2.2.17

2000-09-08 Thread Andrew Burgess
> Based on what bluesmoke.c said about my 2nd PII-333 CPU I just got Intel to give me an RMA number for its replacement. Thank you Alan Cox ;-) : ~v - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] Please read the FAQ

Re: Problems with bluesmoke.c in 2.2.17

2000-09-08 Thread Andrew Burgess
Based on what bluesmoke.c said about my 2nd PII-333 CPU I just got Intel to give me an RMA number for its replacement. Thank you Alan Cox ;-) : ~v - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] Please read the FAQ at

Re: Problems with bluesmoke.c in 2.2.17

2000-09-08 Thread Andrew Burgess
sorry about that truncated email, this is the rest of what I ment to send And here's another sample output for you: CPU 1: Machine Check Exception: 00040Bank 0: f20001000800general protection fault: CPU:1 EIP:0010:[mcheck_fault+263/368] EFLAGS: 00010246 ... I

Re: Oops on boot with both 2.2.17 and 2.4.0t8p6

2000-09-08 Thread Andrew Burgess
Oops from 2.2.17 (some more before this, but it went offscreen): ... You need to capture and decode the first oops. Compile a kernel with a serial console and capture the oops log on a second machine. Or set your console for more than 80x25 using SVGATextMode. I use /usr/sbin/SVGATextMode