Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-21 Thread Tejun Heo

Christian wrote:

I've tested multiple kernels (including -mm series) in the range of 2.6.19.7
(before sata_nv adma support went in) up to 2.6.20-rc4.
Every NCQ enabled kernel I've tested showed ata errors in dmesg. So I came to 
the conclusion that my system was faulty. I ran memtest86+ for a long time, 
but no errors were found. After some fiddling with my HW I discovered that 
the nforce chipset fan induced some kind of electro magnetic interference to 
the southbridge, which could clearly be heard as a low frequency noise if I 
plugged in my speakers to the onboard sound. After replacing the fan, my 
system is stable again. Now running 2.6.21-rc3-mm2+rsdlv31 without errors. 
Really strange problem he ;-)


Man, that's the strangest way to solve ATA command failures I've ever 
heard of.  Kudos to you for finding it out.  :-)


--
tejun
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-21 Thread Christian
On Monday 19 March 2007 08:39:15 Tejun Heo wrote:
> Christian wrote:
> > Yes, for me the problem was introduced recently. I have moved around
> > terabytes (sic!) on my discs with older kernels and I never got errors.
>
> There is always the possibility of disk going bad, so it would be great
> if you can boot an older kernel and verify that the problem doesn't
> occur on it.
>
> Thanks.

I've tested multiple kernels (including -mm series) in the range of 2.6.19.7
(before sata_nv adma support went in) up to 2.6.20-rc4.
Every NCQ enabled kernel I've tested showed ata errors in dmesg. So I came to 
the conclusion that my system was faulty. I ran memtest86+ for a long time, 
but no errors were found. After some fiddling with my HW I discovered that 
the nforce chipset fan induced some kind of electro magnetic interference to 
the southbridge, which could clearly be heard as a low frequency noise if I 
plugged in my speakers to the onboard sound. After replacing the fan, my 
system is stable again. Now running 2.6.21-rc3-mm2+rsdlv31 without errors. 
Really strange problem he ;-)

-Christian
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-21 Thread Christian
On Monday 19 March 2007 08:39:15 Tejun Heo wrote:
 Christian wrote:
  Yes, for me the problem was introduced recently. I have moved around
  terabytes (sic!) on my discs with older kernels and I never got errors.

 There is always the possibility of disk going bad, so it would be great
 if you can boot an older kernel and verify that the problem doesn't
 occur on it.

 Thanks.

I've tested multiple kernels (including -mm series) in the range of 2.6.19.7
(before sata_nv adma support went in) up to 2.6.20-rc4.
Every NCQ enabled kernel I've tested showed ata errors in dmesg. So I came to 
the conclusion that my system was faulty. I ran memtest86+ for a long time, 
but no errors were found. After some fiddling with my HW I discovered that 
the nforce chipset fan induced some kind of electro magnetic interference to 
the southbridge, which could clearly be heard as a low frequency noise if I 
plugged in my speakers to the onboard sound. After replacing the fan, my 
system is stable again. Now running 2.6.21-rc3-mm2+rsdlv31 without errors. 
Really strange problem he ;-)

-Christian
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-21 Thread Tejun Heo

Christian wrote:

I've tested multiple kernels (including -mm series) in the range of 2.6.19.7
(before sata_nv adma support went in) up to 2.6.20-rc4.
Every NCQ enabled kernel I've tested showed ata errors in dmesg. So I came to 
the conclusion that my system was faulty. I ran memtest86+ for a long time, 
but no errors were found. After some fiddling with my HW I discovered that 
the nforce chipset fan induced some kind of electro magnetic interference to 
the southbridge, which could clearly be heard as a low frequency noise if I 
plugged in my speakers to the onboard sound. After replacing the fan, my 
system is stable again. Now running 2.6.21-rc3-mm2+rsdlv31 without errors. 
Really strange problem he ;-)


Man, that's the strangest way to solve ATA command failures I've ever 
heard of.  Kudos to you for finding it out.  :-)


--
tejun
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-20 Thread Max Kellermann
On 2007/03/19 13:09, Jeff Garzik <[EMAIL PROTECTED]> wrote:
> I may have missed the answer to this before, but: does the problem
> go away if you disable preempt?

On my system (same problem, original bug report), preemption is
disabled.

Max

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-20 Thread Max Kellermann
On 2007/03/19 13:09, Jeff Garzik [EMAIL PROTECTED] wrote:
 I may have missed the answer to this before, but: does the problem
 go away if you disable preempt?

On my system (same problem, original bug report), preemption is
disabled.

Max

-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-19 Thread Pablo Sebastian Greco

Christian wrote:

On Sunday 18 March 2007 06:43:09 you wrote:
  

Christian wrote:


This does indeed look like a drive side issue to me (the controller is
reporting CPBs with response flags 2 which as far as I can tell
indicates it's still waiting for the drive to complete the request).


I have been using this hw-config (SATA II, NCQ) since the nvidia ADMA
support made it in the -mm kernel (maybe around 2.6.19-mm? or even
earlyer). I'm seeing this problem excessively since I upgraded to
2.6.21-rc3-mm1. I think something got broken recently...
  

Can you post the result of "hdparm -I /dev/sdX"?



Output generated on 2.6.21-rc3-mm1 #3 SMP PREEMPT

[EMAIL PROTECTED]:~$ sudo hdparm -I /dev/sda

/dev/sda:

ATA device, with non-removable media
Model Number:   SAMSUNG HD401LJ
Serial Number:  S0HVJ1FL900207
Firmware Revision:  ZZ100-15
Standards:
Used: ATA/ATAPI-7 T13 1532D revision 4a
Supported: 7 6 5 4
Configuration:
Logical max current
cylinders   16383   16383
heads   16  16
sectors/track   63  63
--
CHS current addressable sectors:   16514064
LBAuser addressable sectors:  268435455
LBA48  user addressable sectors:  781422768
device size with M = 1024*1024:  381554 MBytes
device size with M = 1000*1000:  400088 MBytes (400 GB)
Capabilities:
LBA, IORDY(can be disabled)
Queue depth: 32
Standby timer values: spec'd by Standard, no device specific minimum
R/W multiple sector transfer: Max = 16  Current = 16
Recommended acoustic management value: 254, current value: 0
DMA: mdma0 mdma1 mdma2 udma0 udma1 udma2 udma3 udma4 udma5 *udma6 
udma7

 Cycle time: min=120ns recommended=120ns
PIO: pio0 pio1 pio2 pio3 pio4
 Cycle time: no flow control=120ns  IORDY flow control=120ns
Commands/features:
Enabled Supported:
   *SMART feature set
Security Mode feature set
   *Power Management feature set
   *Write cache
   *Look-ahead
   *Host Protected Area feature set
   *WRITE_BUFFER command
   *READ_BUFFER command
   *NOP cmd
   *DOWNLOAD_MICROCODE
SET_MAX security extension
Automatic Acoustic Management feature set
   *48-bit Address feature set
   *Device Configuration Overlay feature set
   *Mandatory FLUSH_CACHE
   *FLUSH_CACHE_EXT
   *SMART error logging
   *SMART self-test
   *General Purpose Logging feature set
   *SATA-I signaling speed (1.5Gb/s)
   *SATA-II signaling speed (3.0Gb/s)
   *Native Command Queueing (NCQ)
   *Host-initiated interface power management
   *Phy event counters
DMA Setup Auto-Activate optimization
Device-initiated interface power management
   *Software settings preservation
   *SMART Command Transport (SCT) feature set
   *SCT Long Sector Access (AC1)
   *SCT LBA Segment Access (AC2)
   *SCT Error Recovery Control (AC3)
   *SCT Features Control (AC4)
   *SCT Data Tables (AC5)
Security:
Master password revision code = 65534
supported
not enabled
not locked
frozen
not expired: security count
supported: enhanced erase
228min for SECURITY ERASE UNIT. 228min for ENHANCED SECURITY ERASE 
UNIT.

Checksum: correct


[EMAIL PROTECTED]:~$ sudo hdparm -I /dev/sdb

/dev/sdb:

ATA device, with non-removable media
Model Number:   SAMSUNG SP2504C
Serial Number:  S09QJ1LYC06381
Firmware Revision:  VT100-33
Standards:
Used: ATA/ATAPI-7 T13 1532D revision 4a
Supported: 7 6 5 4
Configuration:
Logical max current
cylinders   16383   16383
heads   16  16
sectors/track   63  63
--
CHS current addressable sectors:   16514064
LBAuser addressable sectors:  268435455
LBA48  user addressable sectors:  488397168
device size with M = 1024*1024:  238475 MBytes
device size with M = 1000*1000:  250059 MBytes (250 GB)
Capabilities:
LBA, IORDY(can be disabled)
Queue depth: 32
Standby timer values: spec'd by Standard, no device specific minimum
R/W multiple sector transfer: Max = 16  Current = 16
Recommended acoustic management value: 254, current value: 254
DMA: mdma0 mdma1 mdma2 udma0 udma1 udma2 udma3 udma4 udma5 *udma6 
udma7

 Cycle time: min=120ns recommended=120ns
PIO: pio0 pio1 pio2 pio3 pio4
 

Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-19 Thread Jeff Garzik

Christian wrote:

On Monday 19 March 2007 03:48:14 you wrote:

Christian wrote:

On Sunday 18 March 2007 06:43:09 you wrote:

Christian wrote:

This does indeed look like a drive side issue to me (the controller is
reporting CPBs with response flags 2 which as far as I can tell
indicates it's still waiting for the drive to complete the request).

I have been using this hw-config (SATA II, NCQ) since the nvidia ADMA
support made it in the -mm kernel (maybe around 2.6.19-mm? or even
earlyer). I'm seeing this problem excessively since I upgraded to
2.6.21-rc3-mm1. I think something got broken recently...

Can you post the result of "hdparm -I /dev/sdX"?

Output generated on 2.6.21-rc3-mm1 #3 SMP PREEMPT

[EMAIL PROTECTED]:~$ sudo hdparm -I /dev/sda

/dev/sda:

ATA device, with non-removable media
Model Number:   SAMSUNG HD401LJ
Serial Number:  S0HVJ1FL900207
Firmware Revision:  ZZ100-15
Standards:
Used: ATA/ATAPI-7 T13 1532D revision 4a
Supported: 7 6 5 4
Configuration:
Logical max current
cylinders   16383   16383
heads   16  16
sectors/track   63  63
--
CHS current addressable sectors:   16514064
LBAuser addressable sectors:  268435455
LBA48  user addressable sectors:  781422768
device size with M = 1024*1024:  381554 MBytes
device size with M = 1000*1000:  400088 MBytes (400 GB)

That's a fairly recent drive.  Does the problem go away if you downgrade
the kernel?


Yes, for me the problem was introduced recently. I have moved around terabytes 
(sic!) on my discs with older kernels and I never got errors.


I may have missed the answer to this before, but:  does the problem go 
away if you disable preempt?


Jeff



-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-19 Thread Tejun Heo
Christian wrote:
> Yes, for me the problem was introduced recently. I have moved around 
> terabytes 
> (sic!) on my discs with older kernels and I never got errors.

There is always the possibility of disk going bad, so it would be great
if you can boot an older kernel and verify that the problem doesn't
occur on it.

Thanks.

-- 
tejun
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-19 Thread Christian
On Monday 19 March 2007 03:48:14 you wrote:
> Christian wrote:
> > On Sunday 18 March 2007 06:43:09 you wrote:
> >> Christian wrote:
>  This does indeed look like a drive side issue to me (the controller is
>  reporting CPBs with response flags 2 which as far as I can tell
>  indicates it's still waiting for the drive to complete the request).
> >>>
> >>> I have been using this hw-config (SATA II, NCQ) since the nvidia ADMA
> >>> support made it in the -mm kernel (maybe around 2.6.19-mm? or even
> >>> earlyer). I'm seeing this problem excessively since I upgraded to
> >>> 2.6.21-rc3-mm1. I think something got broken recently...
> >>
> >> Can you post the result of "hdparm -I /dev/sdX"?
> >
> > Output generated on 2.6.21-rc3-mm1 #3 SMP PREEMPT
> >
> > [EMAIL PROTECTED]:~$ sudo hdparm -I /dev/sda
> >
> > /dev/sda:
> >
> > ATA device, with non-removable media
> > Model Number:   SAMSUNG HD401LJ
> > Serial Number:  S0HVJ1FL900207
> > Firmware Revision:  ZZ100-15
> > Standards:
> > Used: ATA/ATAPI-7 T13 1532D revision 4a
> > Supported: 7 6 5 4
> > Configuration:
> > Logical max current
> > cylinders   16383   16383
> > heads   16  16
> > sectors/track   63  63
> > --
> > CHS current addressable sectors:   16514064
> > LBAuser addressable sectors:  268435455
> > LBA48  user addressable sectors:  781422768
> > device size with M = 1024*1024:  381554 MBytes
> > device size with M = 1000*1000:  400088 MBytes (400 GB)
>
> That's a fairly recent drive.  Does the problem go away if you downgrade
> the kernel?

Yes, for me the problem was introduced recently. I have moved around terabytes 
(sic!) on my discs with older kernels and I never got errors.

-Christian
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-19 Thread Christian
On Monday 19 March 2007 03:48:14 you wrote:
 Christian wrote:
  On Sunday 18 March 2007 06:43:09 you wrote:
  Christian wrote:
  This does indeed look like a drive side issue to me (the controller is
  reporting CPBs with response flags 2 which as far as I can tell
  indicates it's still waiting for the drive to complete the request).
 
  I have been using this hw-config (SATA II, NCQ) since the nvidia ADMA
  support made it in the -mm kernel (maybe around 2.6.19-mm? or even
  earlyer). I'm seeing this problem excessively since I upgraded to
  2.6.21-rc3-mm1. I think something got broken recently...
 
  Can you post the result of hdparm -I /dev/sdX?
 
  Output generated on 2.6.21-rc3-mm1 #3 SMP PREEMPT
 
  [EMAIL PROTECTED]:~$ sudo hdparm -I /dev/sda
 
  /dev/sda:
 
  ATA device, with non-removable media
  Model Number:   SAMSUNG HD401LJ
  Serial Number:  S0HVJ1FL900207
  Firmware Revision:  ZZ100-15
  Standards:
  Used: ATA/ATAPI-7 T13 1532D revision 4a
  Supported: 7 6 5 4
  Configuration:
  Logical max current
  cylinders   16383   16383
  heads   16  16
  sectors/track   63  63
  --
  CHS current addressable sectors:   16514064
  LBAuser addressable sectors:  268435455
  LBA48  user addressable sectors:  781422768
  device size with M = 1024*1024:  381554 MBytes
  device size with M = 1000*1000:  400088 MBytes (400 GB)

 That's a fairly recent drive.  Does the problem go away if you downgrade
 the kernel?

Yes, for me the problem was introduced recently. I have moved around terabytes 
(sic!) on my discs with older kernels and I never got errors.

-Christian
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-19 Thread Tejun Heo
Christian wrote:
 Yes, for me the problem was introduced recently. I have moved around 
 terabytes 
 (sic!) on my discs with older kernels and I never got errors.

There is always the possibility of disk going bad, so it would be great
if you can boot an older kernel and verify that the problem doesn't
occur on it.

Thanks.

-- 
tejun
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-19 Thread Jeff Garzik

Christian wrote:

On Monday 19 March 2007 03:48:14 you wrote:

Christian wrote:

On Sunday 18 March 2007 06:43:09 you wrote:

Christian wrote:

This does indeed look like a drive side issue to me (the controller is
reporting CPBs with response flags 2 which as far as I can tell
indicates it's still waiting for the drive to complete the request).

I have been using this hw-config (SATA II, NCQ) since the nvidia ADMA
support made it in the -mm kernel (maybe around 2.6.19-mm? or even
earlyer). I'm seeing this problem excessively since I upgraded to
2.6.21-rc3-mm1. I think something got broken recently...

Can you post the result of hdparm -I /dev/sdX?

Output generated on 2.6.21-rc3-mm1 #3 SMP PREEMPT

[EMAIL PROTECTED]:~$ sudo hdparm -I /dev/sda

/dev/sda:

ATA device, with non-removable media
Model Number:   SAMSUNG HD401LJ
Serial Number:  S0HVJ1FL900207
Firmware Revision:  ZZ100-15
Standards:
Used: ATA/ATAPI-7 T13 1532D revision 4a
Supported: 7 6 5 4
Configuration:
Logical max current
cylinders   16383   16383
heads   16  16
sectors/track   63  63
--
CHS current addressable sectors:   16514064
LBAuser addressable sectors:  268435455
LBA48  user addressable sectors:  781422768
device size with M = 1024*1024:  381554 MBytes
device size with M = 1000*1000:  400088 MBytes (400 GB)

That's a fairly recent drive.  Does the problem go away if you downgrade
the kernel?


Yes, for me the problem was introduced recently. I have moved around terabytes 
(sic!) on my discs with older kernels and I never got errors.


I may have missed the answer to this before, but:  does the problem go 
away if you disable preempt?


Jeff



-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-19 Thread Pablo Sebastian Greco

Christian wrote:

On Sunday 18 March 2007 06:43:09 you wrote:
  

Christian wrote:


This does indeed look like a drive side issue to me (the controller is
reporting CPBs with response flags 2 which as far as I can tell
indicates it's still waiting for the drive to complete the request).


I have been using this hw-config (SATA II, NCQ) since the nvidia ADMA
support made it in the -mm kernel (maybe around 2.6.19-mm? or even
earlyer). I'm seeing this problem excessively since I upgraded to
2.6.21-rc3-mm1. I think something got broken recently...
  

Can you post the result of hdparm -I /dev/sdX?



Output generated on 2.6.21-rc3-mm1 #3 SMP PREEMPT

[EMAIL PROTECTED]:~$ sudo hdparm -I /dev/sda

/dev/sda:

ATA device, with non-removable media
Model Number:   SAMSUNG HD401LJ
Serial Number:  S0HVJ1FL900207
Firmware Revision:  ZZ100-15
Standards:
Used: ATA/ATAPI-7 T13 1532D revision 4a
Supported: 7 6 5 4
Configuration:
Logical max current
cylinders   16383   16383
heads   16  16
sectors/track   63  63
--
CHS current addressable sectors:   16514064
LBAuser addressable sectors:  268435455
LBA48  user addressable sectors:  781422768
device size with M = 1024*1024:  381554 MBytes
device size with M = 1000*1000:  400088 MBytes (400 GB)
Capabilities:
LBA, IORDY(can be disabled)
Queue depth: 32
Standby timer values: spec'd by Standard, no device specific minimum
R/W multiple sector transfer: Max = 16  Current = 16
Recommended acoustic management value: 254, current value: 0
DMA: mdma0 mdma1 mdma2 udma0 udma1 udma2 udma3 udma4 udma5 *udma6 
udma7

 Cycle time: min=120ns recommended=120ns
PIO: pio0 pio1 pio2 pio3 pio4
 Cycle time: no flow control=120ns  IORDY flow control=120ns
Commands/features:
Enabled Supported:
   *SMART feature set
Security Mode feature set
   *Power Management feature set
   *Write cache
   *Look-ahead
   *Host Protected Area feature set
   *WRITE_BUFFER command
   *READ_BUFFER command
   *NOP cmd
   *DOWNLOAD_MICROCODE
SET_MAX security extension
Automatic Acoustic Management feature set
   *48-bit Address feature set
   *Device Configuration Overlay feature set
   *Mandatory FLUSH_CACHE
   *FLUSH_CACHE_EXT
   *SMART error logging
   *SMART self-test
   *General Purpose Logging feature set
   *SATA-I signaling speed (1.5Gb/s)
   *SATA-II signaling speed (3.0Gb/s)
   *Native Command Queueing (NCQ)
   *Host-initiated interface power management
   *Phy event counters
DMA Setup Auto-Activate optimization
Device-initiated interface power management
   *Software settings preservation
   *SMART Command Transport (SCT) feature set
   *SCT Long Sector Access (AC1)
   *SCT LBA Segment Access (AC2)
   *SCT Error Recovery Control (AC3)
   *SCT Features Control (AC4)
   *SCT Data Tables (AC5)
Security:
Master password revision code = 65534
supported
not enabled
not locked
frozen
not expired: security count
supported: enhanced erase
228min for SECURITY ERASE UNIT. 228min for ENHANCED SECURITY ERASE 
UNIT.

Checksum: correct


[EMAIL PROTECTED]:~$ sudo hdparm -I /dev/sdb

/dev/sdb:

ATA device, with non-removable media
Model Number:   SAMSUNG SP2504C
Serial Number:  S09QJ1LYC06381
Firmware Revision:  VT100-33
Standards:
Used: ATA/ATAPI-7 T13 1532D revision 4a
Supported: 7 6 5 4
Configuration:
Logical max current
cylinders   16383   16383
heads   16  16
sectors/track   63  63
--
CHS current addressable sectors:   16514064
LBAuser addressable sectors:  268435455
LBA48  user addressable sectors:  488397168
device size with M = 1024*1024:  238475 MBytes
device size with M = 1000*1000:  250059 MBytes (250 GB)
Capabilities:
LBA, IORDY(can be disabled)
Queue depth: 32
Standby timer values: spec'd by Standard, no device specific minimum
R/W multiple sector transfer: Max = 16  Current = 16
Recommended acoustic management value: 254, current value: 254
DMA: mdma0 mdma1 mdma2 udma0 udma1 udma2 udma3 udma4 udma5 *udma6 
udma7

 Cycle time: min=120ns recommended=120ns
PIO: pio0 pio1 pio2 pio3 pio4
   

Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-18 Thread Tejun Heo
Christian wrote:
> On Sunday 18 March 2007 06:43:09 you wrote:
>> Christian wrote:
 This does indeed look like a drive side issue to me (the controller is
 reporting CPBs with response flags 2 which as far as I can tell
 indicates it's still waiting for the drive to complete the request).
>>> I have been using this hw-config (SATA II, NCQ) since the nvidia ADMA
>>> support made it in the -mm kernel (maybe around 2.6.19-mm? or even
>>> earlyer). I'm seeing this problem excessively since I upgraded to
>>> 2.6.21-rc3-mm1. I think something got broken recently...
>> Can you post the result of "hdparm -I /dev/sdX"?
> 
> Output generated on 2.6.21-rc3-mm1 #3 SMP PREEMPT
> 
> [EMAIL PROTECTED]:~$ sudo hdparm -I /dev/sda
> 
> /dev/sda:
> 
> ATA device, with non-removable media
> Model Number:   SAMSUNG HD401LJ
> Serial Number:  S0HVJ1FL900207
> Firmware Revision:  ZZ100-15
> Standards:
> Used: ATA/ATAPI-7 T13 1532D revision 4a
> Supported: 7 6 5 4
> Configuration:
> Logical max current
> cylinders   16383   16383
> heads   16  16
> sectors/track   63  63
> --
> CHS current addressable sectors:   16514064
> LBAuser addressable sectors:  268435455
> LBA48  user addressable sectors:  781422768
> device size with M = 1024*1024:  381554 MBytes
> device size with M = 1000*1000:  400088 MBytes (400 GB)

That's a fairly recent drive.  Does the problem go away if you downgrade
the kernel?

-- 
tejun
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-18 Thread Christian
On Sunday 18 March 2007 06:43:09 you wrote:
> Christian wrote:
> >> This does indeed look like a drive side issue to me (the controller is
> >> reporting CPBs with response flags 2 which as far as I can tell
> >> indicates it's still waiting for the drive to complete the request).
> >
> > I have been using this hw-config (SATA II, NCQ) since the nvidia ADMA
> > support made it in the -mm kernel (maybe around 2.6.19-mm? or even
> > earlyer). I'm seeing this problem excessively since I upgraded to
> > 2.6.21-rc3-mm1. I think something got broken recently...
>
> Can you post the result of "hdparm -I /dev/sdX"?

Output generated on 2.6.21-rc3-mm1 #3 SMP PREEMPT

[EMAIL PROTECTED]:~$ sudo hdparm -I /dev/sda

/dev/sda:

ATA device, with non-removable media
Model Number:   SAMSUNG HD401LJ
Serial Number:  S0HVJ1FL900207
Firmware Revision:  ZZ100-15
Standards:
Used: ATA/ATAPI-7 T13 1532D revision 4a
Supported: 7 6 5 4
Configuration:
Logical max current
cylinders   16383   16383
heads   16  16
sectors/track   63  63
--
CHS current addressable sectors:   16514064
LBAuser addressable sectors:  268435455
LBA48  user addressable sectors:  781422768
device size with M = 1024*1024:  381554 MBytes
device size with M = 1000*1000:  400088 MBytes (400 GB)
Capabilities:
LBA, IORDY(can be disabled)
Queue depth: 32
Standby timer values: spec'd by Standard, no device specific minimum
R/W multiple sector transfer: Max = 16  Current = 16
Recommended acoustic management value: 254, current value: 0
DMA: mdma0 mdma1 mdma2 udma0 udma1 udma2 udma3 udma4 udma5 *udma6 
udma7
 Cycle time: min=120ns recommended=120ns
PIO: pio0 pio1 pio2 pio3 pio4
 Cycle time: no flow control=120ns  IORDY flow control=120ns
Commands/features:
Enabled Supported:
   *SMART feature set
Security Mode feature set
   *Power Management feature set
   *Write cache
   *Look-ahead
   *Host Protected Area feature set
   *WRITE_BUFFER command
   *READ_BUFFER command
   *NOP cmd
   *DOWNLOAD_MICROCODE
SET_MAX security extension
Automatic Acoustic Management feature set
   *48-bit Address feature set
   *Device Configuration Overlay feature set
   *Mandatory FLUSH_CACHE
   *FLUSH_CACHE_EXT
   *SMART error logging
   *SMART self-test
   *General Purpose Logging feature set
   *SATA-I signaling speed (1.5Gb/s)
   *SATA-II signaling speed (3.0Gb/s)
   *Native Command Queueing (NCQ)
   *Host-initiated interface power management
   *Phy event counters
DMA Setup Auto-Activate optimization
Device-initiated interface power management
   *Software settings preservation
   *SMART Command Transport (SCT) feature set
   *SCT Long Sector Access (AC1)
   *SCT LBA Segment Access (AC2)
   *SCT Error Recovery Control (AC3)
   *SCT Features Control (AC4)
   *SCT Data Tables (AC5)
Security:
Master password revision code = 65534
supported
not enabled
not locked
frozen
not expired: security count
supported: enhanced erase
228min for SECURITY ERASE UNIT. 228min for ENHANCED SECURITY ERASE 
UNIT.
Checksum: correct


[EMAIL PROTECTED]:~$ sudo hdparm -I /dev/sdb

/dev/sdb:

ATA device, with non-removable media
Model Number:   SAMSUNG SP2504C
Serial Number:  S09QJ1LYC06381
Firmware Revision:  VT100-33
Standards:
Used: ATA/ATAPI-7 T13 1532D revision 4a
Supported: 7 6 5 4
Configuration:
Logical max current
cylinders   16383   16383
heads   16  16
sectors/track   63  63
--
CHS current addressable sectors:   16514064
LBAuser addressable sectors:  268435455
LBA48  user addressable sectors:  488397168
device size with M = 1024*1024:  238475 MBytes
device size with M = 1000*1000:  250059 MBytes (250 GB)
Capabilities:
LBA, IORDY(can be disabled)
Queue depth: 32
Standby timer values: spec'd by Standard, no device specific minimum
R/W multiple sector transfer: Max = 16  Current = 16
Recommended acoustic management value: 254, current value: 254
DMA: mdma0 mdma1 mdma2 udma0 udma1 udma2 udma3 udma4 udma5 *udma6 
udma7
 Cycle time: min=120ns recommended=120ns
PIO: pio0 pio1 pio2 pio3 pio4
 Cycle 

Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-18 Thread Christian
On Sunday 18 March 2007 06:43:09 you wrote:
 Christian wrote:
  This does indeed look like a drive side issue to me (the controller is
  reporting CPBs with response flags 2 which as far as I can tell
  indicates it's still waiting for the drive to complete the request).
 
  I have been using this hw-config (SATA II, NCQ) since the nvidia ADMA
  support made it in the -mm kernel (maybe around 2.6.19-mm? or even
  earlyer). I'm seeing this problem excessively since I upgraded to
  2.6.21-rc3-mm1. I think something got broken recently...

 Can you post the result of hdparm -I /dev/sdX?

Output generated on 2.6.21-rc3-mm1 #3 SMP PREEMPT

[EMAIL PROTECTED]:~$ sudo hdparm -I /dev/sda

/dev/sda:

ATA device, with non-removable media
Model Number:   SAMSUNG HD401LJ
Serial Number:  S0HVJ1FL900207
Firmware Revision:  ZZ100-15
Standards:
Used: ATA/ATAPI-7 T13 1532D revision 4a
Supported: 7 6 5 4
Configuration:
Logical max current
cylinders   16383   16383
heads   16  16
sectors/track   63  63
--
CHS current addressable sectors:   16514064
LBAuser addressable sectors:  268435455
LBA48  user addressable sectors:  781422768
device size with M = 1024*1024:  381554 MBytes
device size with M = 1000*1000:  400088 MBytes (400 GB)
Capabilities:
LBA, IORDY(can be disabled)
Queue depth: 32
Standby timer values: spec'd by Standard, no device specific minimum
R/W multiple sector transfer: Max = 16  Current = 16
Recommended acoustic management value: 254, current value: 0
DMA: mdma0 mdma1 mdma2 udma0 udma1 udma2 udma3 udma4 udma5 *udma6 
udma7
 Cycle time: min=120ns recommended=120ns
PIO: pio0 pio1 pio2 pio3 pio4
 Cycle time: no flow control=120ns  IORDY flow control=120ns
Commands/features:
Enabled Supported:
   *SMART feature set
Security Mode feature set
   *Power Management feature set
   *Write cache
   *Look-ahead
   *Host Protected Area feature set
   *WRITE_BUFFER command
   *READ_BUFFER command
   *NOP cmd
   *DOWNLOAD_MICROCODE
SET_MAX security extension
Automatic Acoustic Management feature set
   *48-bit Address feature set
   *Device Configuration Overlay feature set
   *Mandatory FLUSH_CACHE
   *FLUSH_CACHE_EXT
   *SMART error logging
   *SMART self-test
   *General Purpose Logging feature set
   *SATA-I signaling speed (1.5Gb/s)
   *SATA-II signaling speed (3.0Gb/s)
   *Native Command Queueing (NCQ)
   *Host-initiated interface power management
   *Phy event counters
DMA Setup Auto-Activate optimization
Device-initiated interface power management
   *Software settings preservation
   *SMART Command Transport (SCT) feature set
   *SCT Long Sector Access (AC1)
   *SCT LBA Segment Access (AC2)
   *SCT Error Recovery Control (AC3)
   *SCT Features Control (AC4)
   *SCT Data Tables (AC5)
Security:
Master password revision code = 65534
supported
not enabled
not locked
frozen
not expired: security count
supported: enhanced erase
228min for SECURITY ERASE UNIT. 228min for ENHANCED SECURITY ERASE 
UNIT.
Checksum: correct


[EMAIL PROTECTED]:~$ sudo hdparm -I /dev/sdb

/dev/sdb:

ATA device, with non-removable media
Model Number:   SAMSUNG SP2504C
Serial Number:  S09QJ1LYC06381
Firmware Revision:  VT100-33
Standards:
Used: ATA/ATAPI-7 T13 1532D revision 4a
Supported: 7 6 5 4
Configuration:
Logical max current
cylinders   16383   16383
heads   16  16
sectors/track   63  63
--
CHS current addressable sectors:   16514064
LBAuser addressable sectors:  268435455
LBA48  user addressable sectors:  488397168
device size with M = 1024*1024:  238475 MBytes
device size with M = 1000*1000:  250059 MBytes (250 GB)
Capabilities:
LBA, IORDY(can be disabled)
Queue depth: 32
Standby timer values: spec'd by Standard, no device specific minimum
R/W multiple sector transfer: Max = 16  Current = 16
Recommended acoustic management value: 254, current value: 254
DMA: mdma0 mdma1 mdma2 udma0 udma1 udma2 udma3 udma4 udma5 *udma6 
udma7
 Cycle time: min=120ns recommended=120ns
PIO: pio0 pio1 pio2 pio3 pio4
 Cycle time: no flow control=120ns  

Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-18 Thread Tejun Heo
Christian wrote:
 On Sunday 18 March 2007 06:43:09 you wrote:
 Christian wrote:
 This does indeed look like a drive side issue to me (the controller is
 reporting CPBs with response flags 2 which as far as I can tell
 indicates it's still waiting for the drive to complete the request).
 I have been using this hw-config (SATA II, NCQ) since the nvidia ADMA
 support made it in the -mm kernel (maybe around 2.6.19-mm? or even
 earlyer). I'm seeing this problem excessively since I upgraded to
 2.6.21-rc3-mm1. I think something got broken recently...
 Can you post the result of hdparm -I /dev/sdX?
 
 Output generated on 2.6.21-rc3-mm1 #3 SMP PREEMPT
 
 [EMAIL PROTECTED]:~$ sudo hdparm -I /dev/sda
 
 /dev/sda:
 
 ATA device, with non-removable media
 Model Number:   SAMSUNG HD401LJ
 Serial Number:  S0HVJ1FL900207
 Firmware Revision:  ZZ100-15
 Standards:
 Used: ATA/ATAPI-7 T13 1532D revision 4a
 Supported: 7 6 5 4
 Configuration:
 Logical max current
 cylinders   16383   16383
 heads   16  16
 sectors/track   63  63
 --
 CHS current addressable sectors:   16514064
 LBAuser addressable sectors:  268435455
 LBA48  user addressable sectors:  781422768
 device size with M = 1024*1024:  381554 MBytes
 device size with M = 1000*1000:  400088 MBytes (400 GB)

That's a fairly recent drive.  Does the problem go away if you downgrade
the kernel?

-- 
tejun
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-17 Thread Tejun Heo
Christian wrote:
>> This does indeed look like a drive side issue to me (the controller is
>> reporting CPBs with response flags 2 which as far as I can tell
>> indicates it's still waiting for the drive to complete the request).
> 
> I have been using this hw-config (SATA II, NCQ) since the nvidia ADMA support 
> made it in the -mm kernel (maybe around 2.6.19-mm? or even earlyer). I'm 
> seeing this problem excessively since I upgraded to 2.6.21-rc3-mm1. I think 
> something got broken recently...

Can you post the result of "hdparm -I /dev/sdX"?

-- 
tejun
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-17 Thread Tejun Heo
Christian wrote:
 This does indeed look like a drive side issue to me (the controller is
 reporting CPBs with response flags 2 which as far as I can tell
 indicates it's still waiting for the drive to complete the request).
 
 I have been using this hw-config (SATA II, NCQ) since the nvidia ADMA support 
 made it in the -mm kernel (maybe around 2.6.19-mm? or even earlyer). I'm 
 seeing this problem excessively since I upgraded to 2.6.21-rc3-mm1. I think 
 something got broken recently...

Can you post the result of hdparm -I /dev/sdX?

-- 
tejun
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-16 Thread Christian
> This does indeed look like a drive side issue to me (the controller is
> reporting CPBs with response flags 2 which as far as I can tell
> indicates it's still waiting for the drive to complete the request).

I have been using this hw-config (SATA II, NCQ) since the nvidia ADMA support 
made it in the -mm kernel (maybe around 2.6.19-mm? or even earlyer). I'm 
seeing this problem excessively since I upgraded to 2.6.21-rc3-mm1. I think 
something got broken recently...

-Christian
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-16 Thread Christian
On Friday 16 March 2007 12:20:02 Max Kellermann wrote:
> Hi,
>
> I have been trying to upgrade my 2.6.19 kernel for a while now,
> without success.  All kernel versions since 2.6.20 failed with a
> timeout while trying to write to my SATA hard disk.  After I wait for
> a while (during which all I/O to the hard disk blocks), the kernel
> message "NCQ disabled due to excessive errors" appears, and the system
> seems to work perfectly.
>
> My system is an Athlon64 X2 on an Asus A8N-Premium.  I'm using the
> sata_nv driver for the on-board SATA controller.  My only hard disk is
> a Samsum HD401LJ (400 GB).
>
> The kernel is build statically, and the only module is nvidia.ko.  If
> someone believes it may make a difference, I can try the kernel
> without the proprietary module.
>
> Please see the kernel config and dmesg attached to this email.
>
> I don't know the libata subsystem well, but I suspect my hard drive
> has to be added to the NCQ blacklist.
>
> Max

Hi,

I have the same HW ;-) And the same problem on 2.6.21-rc3-mm1.


[ 8212.682404] ata1: EH in ADMA mode, notifier 0x0 notifier_error 0x0 gen_ctl 
0x1501000 status 0x400 next cpb count 0x0 next cpb idx 0x0
[ 8212.682409] ata1: CPB 1: ctl_flags 0x1f, resp_flags 0x2
[ 8212.682411] ata1: CPB 2: ctl_flags 0x1f, resp_flags 0x2
[ 8212.682413] ata1: CPB 3: ctl_flags 0x1f, resp_flags 0x2
[ 8212.682416] ata1: EH in ADMA mode, notifier 0x0 notifier_error 0x0 gen_ctl 
0x1501000 status 0x400
[ 8212.682419] ata1: CPB 1: ctl_flags 0x1f, resp_flags 0x2
[ 8212.682420] ata1: CPB 2: ctl_flags 0x1f, resp_flags 0x2
[ 8212.682422] ata1: CPB 3: ctl_flags 0x1f, resp_flags 0x2
[ 8212.682427] ata1: timeout waiting for ADMA IDLE, stat=0x400
[ 8212.682432] ata1: timeout waiting for ADMA LEGACY, stat=0x400
[ 8212.682439] ata1.00: exception Emask 0x0 SAct 0xe SErr 0x20 action 0x2 
frozen
[ 8212.682445] ata1.00: cmd 61/00:08:df:90:f7/02:00:02:00:00/40 tag 1 cdb 0x0 
data 262144 out
[ 8212.682446]  res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 
(timeout)
[ 8212.682451] ata1.00: cmd 61/00:10:df:92:f7/02:00:02:00:00/40 tag 2 cdb 0x0 
data 262144 out
[ 8212.682453]  res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 
(timeout)
[ 8212.682457] ata1.00: cmd 61/00:18:df:94:f7/02:00:02:00:00/40 tag 3 cdb 0x0 
data 262144 out
[ 8212.682459]  res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 
(timeout)
[ 8212.833749] ata1: soft resetting port
[ 8212.941187] ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
[ 8212.992763] ata1.00: configured for UDMA/133
[ 8212.992837] ata1: EH complete
[ 8212.992906] SCSI device sda: 781422768 512-byte hdwr sectors (400088 MB)
[ 8212.992936] sda: Write Protect is off
[ 8212.992938] sda: Mode Sense: 00 3a 00 00
[ 8212.993048] SCSI device sda: write cache: enabled, read cache: enabled, 
doesn't support DPO or FUA
[ 8227.981905] ata1: EH in ADMA mode, notifier 0x0 notifier_error 0x0 gen_ctl 
0x1501000 status 0x400 next cpb count 0x2 next cpb idx 0x0
[ 8227.981910] ata1: CPB 0: ctl_flags 0x1f, resp_flags 0x0
[ 8227.981912] ata1: CPB 1: ctl_flags 0x1f, resp_flags 0x0
[ 8227.981914] ata1: CPB 2: ctl_flags 0x1f, resp_flags 0x0
[ 8227.981917] ata1: EH in ADMA mode, notifier 0x0 notifier_error 0x0 gen_ctl 
0x1501000 status 0x400
[ 8227.981919] ata1: CPB 0: ctl_flags 0x1f, resp_flags 0x0
[ 8227.981921] ata1: CPB 1: ctl_flags 0x1f, resp_flags 0x0
[ 8227.981923] ata1: CPB 2: ctl_flags 0x1f, resp_flags 0x0
[ 8227.981928] ata1: timeout waiting for ADMA IDLE, stat=0x400
[ 8227.981933] ata1: timeout waiting for ADMA LEGACY, stat=0x400
[ 8227.981940] ata1.00: exception Emask 0x0 SAct 0x7 SErr 0x0 action 0x2 
frozen
[ 8227.981945] ata1.00: cmd 61/00:00:df:94:f7/02:00:02:00:00/40 tag 0 cdb 0x0 
data 262144 out
[ 8227.981947]  res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 
(timeout)
[ 8227.981952] ata1.00: cmd 61/00:08:df:92:f7/02:00:02:00:00/40 tag 1 cdb 0x0 
data 262144 out
[ 8227.981953]  res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 
(timeout)
[ 8227.981958] ata1.00: cmd 61/00:10:df:90:f7/02:00:02:00:00/40 tag 2 cdb 0x0 
data 262144 out
[ 8227.981959]  res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 
(timeout)
[ 8228.133306] ata1: soft resetting port
[ 8228.208763] ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
[ 8228.258729] ata1.00: configured for UDMA/133
[ 8228.258824] ata1: EH complete
[ 8228.258926] SCSI device sda: 781422768 512-byte hdwr sectors (400088 MB)
[ 8228.259021] sda: Write Protect is off
[ 8228.259023] sda: Mode Sense: 00 3a 00 00
[ 8228.259132] SCSI device sda: write cache: enabled, read cache: enabled, 
doesn't support DPO or FUA
[ 8243.247980] ata1: EH in ADMA mode, notifier 0x0 notifier_error 0x0 gen_ctl 
0x1501000 status 0x400 next cpb count 0x2 next cpb idx 0x0
[ 8243.247985] ata1: CPB 0: ctl_flags 0x1f, resp_flags 0x0
[ 8243.247987] ata1: CPB 1: ctl_flags 0x1f, resp_flags 0x0
[ 8243.247989] ata1: CPB 2: ctl_flags 0x1f, resp_flags 0x0
[ 8243.247992] ata1: EH in ADMA mode, 

Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-16 Thread Robert Hancock

(linux-ide cc'ed)

Max Kellermann wrote:

Hi,

I have been trying to upgrade my 2.6.19 kernel for a while now,
without success.  All kernel versions since 2.6.20 failed with a
timeout while trying to write to my SATA hard disk.  After I wait for
a while (during which all I/O to the hard disk blocks), the kernel
message "NCQ disabled due to excessive errors" appears, and the system
seems to work perfectly.

My system is an Athlon64 X2 on an Asus A8N-Premium.  I'm using the
sata_nv driver for the on-board SATA controller.  My only hard disk is
a Samsum HD401LJ (400 GB).

The kernel is build statically, and the only module is nvidia.ko.  If
someone believes it may make a difference, I can try the kernel
without the proprietary module.

Please see the kernel config and dmesg attached to this email.

I don't know the libata subsystem well, but I suspect my hard drive
has to be added to the NCQ blacklist.

Max


This does indeed look like a drive side issue to me (the controller is 
reporting CPBs with response flags 2 which as far as I can tell 
indicates it's still waiting for the drive to complete the request).


--
Robert Hancock  Saskatoon, SK, Canada
To email, remove "nospam" from [EMAIL PROTECTED]
Home Page: http://www.roberthancock.com/

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-16 Thread Christian
On Friday 16 March 2007 12:20:02 Max Kellermann wrote:
 Hi,

 I have been trying to upgrade my 2.6.19 kernel for a while now,
 without success.  All kernel versions since 2.6.20 failed with a
 timeout while trying to write to my SATA hard disk.  After I wait for
 a while (during which all I/O to the hard disk blocks), the kernel
 message NCQ disabled due to excessive errors appears, and the system
 seems to work perfectly.

 My system is an Athlon64 X2 on an Asus A8N-Premium.  I'm using the
 sata_nv driver for the on-board SATA controller.  My only hard disk is
 a Samsum HD401LJ (400 GB).

 The kernel is build statically, and the only module is nvidia.ko.  If
 someone believes it may make a difference, I can try the kernel
 without the proprietary module.

 Please see the kernel config and dmesg attached to this email.

 I don't know the libata subsystem well, but I suspect my hard drive
 has to be added to the NCQ blacklist.

 Max

Hi,

I have the same HW ;-) And the same problem on 2.6.21-rc3-mm1.


[ 8212.682404] ata1: EH in ADMA mode, notifier 0x0 notifier_error 0x0 gen_ctl 
0x1501000 status 0x400 next cpb count 0x0 next cpb idx 0x0
[ 8212.682409] ata1: CPB 1: ctl_flags 0x1f, resp_flags 0x2
[ 8212.682411] ata1: CPB 2: ctl_flags 0x1f, resp_flags 0x2
[ 8212.682413] ata1: CPB 3: ctl_flags 0x1f, resp_flags 0x2
[ 8212.682416] ata1: EH in ADMA mode, notifier 0x0 notifier_error 0x0 gen_ctl 
0x1501000 status 0x400
[ 8212.682419] ata1: CPB 1: ctl_flags 0x1f, resp_flags 0x2
[ 8212.682420] ata1: CPB 2: ctl_flags 0x1f, resp_flags 0x2
[ 8212.682422] ata1: CPB 3: ctl_flags 0x1f, resp_flags 0x2
[ 8212.682427] ata1: timeout waiting for ADMA IDLE, stat=0x400
[ 8212.682432] ata1: timeout waiting for ADMA LEGACY, stat=0x400
[ 8212.682439] ata1.00: exception Emask 0x0 SAct 0xe SErr 0x20 action 0x2 
frozen
[ 8212.682445] ata1.00: cmd 61/00:08:df:90:f7/02:00:02:00:00/40 tag 1 cdb 0x0 
data 262144 out
[ 8212.682446]  res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 
(timeout)
[ 8212.682451] ata1.00: cmd 61/00:10:df:92:f7/02:00:02:00:00/40 tag 2 cdb 0x0 
data 262144 out
[ 8212.682453]  res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 
(timeout)
[ 8212.682457] ata1.00: cmd 61/00:18:df:94:f7/02:00:02:00:00/40 tag 3 cdb 0x0 
data 262144 out
[ 8212.682459]  res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 
(timeout)
[ 8212.833749] ata1: soft resetting port
[ 8212.941187] ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
[ 8212.992763] ata1.00: configured for UDMA/133
[ 8212.992837] ata1: EH complete
[ 8212.992906] SCSI device sda: 781422768 512-byte hdwr sectors (400088 MB)
[ 8212.992936] sda: Write Protect is off
[ 8212.992938] sda: Mode Sense: 00 3a 00 00
[ 8212.993048] SCSI device sda: write cache: enabled, read cache: enabled, 
doesn't support DPO or FUA
[ 8227.981905] ata1: EH in ADMA mode, notifier 0x0 notifier_error 0x0 gen_ctl 
0x1501000 status 0x400 next cpb count 0x2 next cpb idx 0x0
[ 8227.981910] ata1: CPB 0: ctl_flags 0x1f, resp_flags 0x0
[ 8227.981912] ata1: CPB 1: ctl_flags 0x1f, resp_flags 0x0
[ 8227.981914] ata1: CPB 2: ctl_flags 0x1f, resp_flags 0x0
[ 8227.981917] ata1: EH in ADMA mode, notifier 0x0 notifier_error 0x0 gen_ctl 
0x1501000 status 0x400
[ 8227.981919] ata1: CPB 0: ctl_flags 0x1f, resp_flags 0x0
[ 8227.981921] ata1: CPB 1: ctl_flags 0x1f, resp_flags 0x0
[ 8227.981923] ata1: CPB 2: ctl_flags 0x1f, resp_flags 0x0
[ 8227.981928] ata1: timeout waiting for ADMA IDLE, stat=0x400
[ 8227.981933] ata1: timeout waiting for ADMA LEGACY, stat=0x400
[ 8227.981940] ata1.00: exception Emask 0x0 SAct 0x7 SErr 0x0 action 0x2 
frozen
[ 8227.981945] ata1.00: cmd 61/00:00:df:94:f7/02:00:02:00:00/40 tag 0 cdb 0x0 
data 262144 out
[ 8227.981947]  res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 
(timeout)
[ 8227.981952] ata1.00: cmd 61/00:08:df:92:f7/02:00:02:00:00/40 tag 1 cdb 0x0 
data 262144 out
[ 8227.981953]  res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 
(timeout)
[ 8227.981958] ata1.00: cmd 61/00:10:df:90:f7/02:00:02:00:00/40 tag 2 cdb 0x0 
data 262144 out
[ 8227.981959]  res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 
(timeout)
[ 8228.133306] ata1: soft resetting port
[ 8228.208763] ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
[ 8228.258729] ata1.00: configured for UDMA/133
[ 8228.258824] ata1: EH complete
[ 8228.258926] SCSI device sda: 781422768 512-byte hdwr sectors (400088 MB)
[ 8228.259021] sda: Write Protect is off
[ 8228.259023] sda: Mode Sense: 00 3a 00 00
[ 8228.259132] SCSI device sda: write cache: enabled, read cache: enabled, 
doesn't support DPO or FUA
[ 8243.247980] ata1: EH in ADMA mode, notifier 0x0 notifier_error 0x0 gen_ctl 
0x1501000 status 0x400 next cpb count 0x2 next cpb idx 0x0
[ 8243.247985] ata1: CPB 0: ctl_flags 0x1f, resp_flags 0x0
[ 8243.247987] ata1: CPB 1: ctl_flags 0x1f, resp_flags 0x0
[ 8243.247989] ata1: CPB 2: ctl_flags 0x1f, resp_flags 0x0
[ 8243.247992] ata1: EH in ADMA mode, notifier 0x0 

Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-16 Thread Christian
 This does indeed look like a drive side issue to me (the controller is
 reporting CPBs with response flags 2 which as far as I can tell
 indicates it's still waiting for the drive to complete the request).

I have been using this hw-config (SATA II, NCQ) since the nvidia ADMA support 
made it in the -mm kernel (maybe around 2.6.19-mm? or even earlyer). I'm 
seeing this problem excessively since I upgraded to 2.6.21-rc3-mm1. I think 
something got broken recently...

-Christian
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [BUG 2.6.21-rc3-git9] SATA NCQ failure with Samsum HD401LJ

2007-03-16 Thread Robert Hancock

(linux-ide cc'ed)

Max Kellermann wrote:

Hi,

I have been trying to upgrade my 2.6.19 kernel for a while now,
without success.  All kernel versions since 2.6.20 failed with a
timeout while trying to write to my SATA hard disk.  After I wait for
a while (during which all I/O to the hard disk blocks), the kernel
message NCQ disabled due to excessive errors appears, and the system
seems to work perfectly.

My system is an Athlon64 X2 on an Asus A8N-Premium.  I'm using the
sata_nv driver for the on-board SATA controller.  My only hard disk is
a Samsum HD401LJ (400 GB).

The kernel is build statically, and the only module is nvidia.ko.  If
someone believes it may make a difference, I can try the kernel
without the proprietary module.

Please see the kernel config and dmesg attached to this email.

I don't know the libata subsystem well, but I suspect my hard drive
has to be added to the NCQ blacklist.

Max


This does indeed look like a drive side issue to me (the controller is 
reporting CPBs with response flags 2 which as far as I can tell 
indicates it's still waiting for the drive to complete the request).


--
Robert Hancock  Saskatoon, SK, Canada
To email, remove nospam from [EMAIL PROTECTED]
Home Page: http://www.roberthancock.com/

-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/