Bug#437344: linux-image-2.6.21-2-686: seemingly random SATA disk lockup with ICH5 intel bridge

2007-08-11 Thread Mathieu Roy
Package: linux-image-2.6.21-2-686
Version: 2.6.21-6
Severity: important


Hello,

I'm trying to get to run a Seagate 250GB hard disk. It works with Microsoft 
Windows XP so it is not a hardware issue.

Sometimes the system boots perfectly, sometimes an error show up soon in the 
boot process an then the hard disk, while recognized, does not allow access to 
anything on it, including its partition table, making the system temporarily 
locked each time it tries to access it.

It is not a partition issue or whatever, the partition model is simple and was 
erased and cleaned several times.

The report is made against the latest kernel I have installed but it fails also 
with kernel 2.6.15.


Here comes a boot process, the interesting part is the following. If it does 
not show up, then the disk works perfectly. Considering the problem is random 
to me (maybe there a logic, but I do not get it)

Aug 10 12:15:29 eos kernel: SCSI subsystem initialized
Aug 10 12:15:29 eos kernel: [] __report_bad_irq+0x36/0x75
Aug 10 12:15:29 eos kernel: [] note_interrupt+0x1a8/0x1e3
Aug 10 12:15:29 eos kernel: [] usb_hcd_irq+0x21/0x4e [usbcore]
Aug 10 12:15:29 eos kernel: [] handle_IRQ_event+0x23/0x51
Aug 10 12:15:29 eos kernel: [] handle_fasteoi_irq+0x86/0xa6
Aug 10 12:15:29 eos kernel: [] do_IRQ+0x55/0x6f
Aug 10 12:15:29 eos kernel: [] irq_exit+0x53/0x6b
Aug 10 12:15:29 eos kernel: [] smp_apic_timer_interrupt+0x71/0x7d
Aug 10 12:15:29 eos kernel: [] common_interrupt+0x23/0x28
Aug 10 12:15:29 eos kernel: ===
Aug 10 12:15:29 eos kernel: hda: ST380021A, ATA DISK drive

later comes up things like:

Aug 10 12:15:29 eos kernel: ATA: abnormal status 0x7F on port 0x0001c007
Aug 10 12:15:29 eos kernel: scsi1 : ata_piix
Aug 10 12:15:29 eos kernel: ata2.00: ATA-7: ST3250620AS, 3.AAK, max UDMA/133
Aug 10 12:15:29 eos kernel: ata2.00: 488397168 sectors, multi 16: LBA48 NCQ 
(depth 0/32)
Aug 10 12:15:29 eos kernel: ata2.00: qc timeout (cmd 0xef)
Aug 10 12:15:29 eos kernel: ata2: failed to recover some devices, retrying in 5 
secs
Aug 10 12:15:29 eos kernel: ata2.00: qc timeout (cmd 0xef)
Aug 10 12:15:29 eos kernel: ata2.00: limiting speed to UDMA/133:PIO3
Aug 10 12:15:29 eos kernel: ata2: failed to recover some devices, retrying in 5 
secs
Aug 10 12:15:29 eos kernel: ata2.00: qc timeout (cmd 0xef)
Aug 10 12:15:29 eos kernel: ata2.00: disabled


It comes from this boot messages (ending here by a CTRL-ALT-DEL)

#
Aug 10 12:15:29 eos syslog-ng[2986]: syslog-ng starting up; version='2.0.0' 
Aug 10 12:15:29 eos kernel: Linux version 2.6.21-2-686 (Debian 2.6.21-6) 
([EMAIL PROTECTED]) (gcc version 4.1.3 20070629 (prerelease) (Debian 4.1.2-13)) 
#1 SMP Wed Jul 11 03:53:02 UTC 2007
Aug 10 12:15:29 eos kernel: BIOS-provided physical RAM map:
Aug 10 12:15:29 eos kernel: sanitize start
Aug 10 12:15:29 eos kernel: sanitize end
Aug 10 12:15:29 eos kernel: copy_e820_map() start:  size: 
0009f800 end: 0009f800 type: 1
Aug 10 12:15:29 eos kernel: copy_e820_map() type is E820_RAM
Aug 10 12:15:29 eos kernel: copy_e820_map() start: 0009f800 size: 
0800 end: 000a type: 2
Aug 10 12:15:29 eos kernel: copy_e820_map() start: 000f size: 
0001 end: 0010 type: 2
Aug 10 12:15:29 eos kernel: copy_e820_map() start: 0010 size: 
3fef end: 3fff type: 1
Aug 10 12:15:29 eos kernel: copy_e820_map() type is E820_RAM
Aug 10 12:15:29 eos kernel: copy_e820_map() start: 3fff size: 
3000 end: 3fff3000 type: 4
Aug 10 12:15:29 eos kernel: copy_e820_map() start: 3fff3000 size: 
d000 end: 4000 type: 3
Aug 10 12:15:29 eos kernel: copy_e820_map() start: fec0 size: 
0140 end: 0001 type: 2
Aug 10 12:15:29 eos kernel: BIOS-e820:  - 0009f800 
(usable)
Aug 10 12:15:29 eos kernel: BIOS-e820: 0009f800 - 000a 
(reserved)
Aug 10 12:15:29 eos kernel: BIOS-e820: 000f - 0010 
(reserved)
Aug 10 12:15:29 eos kernel: BIOS-e820: 0010 - 3fff 
(usable)
Aug 10 12:15:29 eos kernel: BIOS-e820: 3fff - 3fff3000 
(ACPI NVS)
Aug 10 12:15:29 eos kernel: BIOS-e820: 3fff3000 - 4000 
(ACPI data)
Aug 10 12:15:29 eos kernel: BIOS-e820: fec0 - 0001 
(reserved)
Aug 10 12:15:29 eos kernel: 127MB HIGHMEM available.
Aug 10 12:15:29 eos kernel: 896MB LOWMEM available.
Aug 10 12:15:29 eos kernel: found SMP MP-table at 000f5350
Aug 10 12:15:29 eos kernel: Zone PFN ranges:
Aug 10 12:15:29 eos kernel: DMA 0 -> 4096
Aug 10 12:15:29 eos kernel: Normal   4096 ->   229376
Aug 10 12:15:29 eos kernel: HighMem229376 ->   262128
Aug 10 12:15:29 eos kernel: early_node_map[1] active PFN ranges
Aug 10 12:15:29 eos kernel: 0:0 ->   262128
Aug 

Bug#437344: linux-image-2.6.21-2-686: seemingly random SATA disk lockup with ICH5 intel bridge

2007-08-12 Thread maximilian attems
tags 437344 moreinfo
stop


On Sun, 12 Aug 2007, Mathieu Roy wrote:

> Package: linux-image-2.6.21-2-686
> Version: 2.6.21-6
> Severity: important

stopped reading at that point, there is newer linux images
in the archive install them directly from unstable.

if you can still reproduce trouble checkout 2.6.23-rc2,
see trunk apt lines
http://wiki.debian.org/DebianKernel

 
thanks

-- 
maks
 


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]