On Mon, Feb 27, 2006 at 11:33:19PM +0100, Juan Piñeros wrote:
>   1 Raw_Read_Error_Rate     0x000d   100   100   050  
>  Pre-fail  Offline
> -       51
> 195 Hardware_ECC_Recovered  0x001a   100   100   000  
>  Old_age   Always
> -       2
> 199 UDMA_CRC_Error_Count    0x003e   200   200   000  
>  Old_age   Always
> -       9

This is where your issue seems to live. I have never seen the read
error and ecc corrected number not matching. It would mean that an error
occurs but there has been no way to make it right so I would expect the
read to be garbage... Did you see any corruption in your files? I mean
data corrupted instead of metadata?

Also, you say that sata does not support smart. That is not true, with
one of the very recent kernels (2.6.15.4), you can get them. I have not
much experience with the kernels shipped with debian. I always recompile
my own. But some problems I had with an nfs server (in an HPC system)
vanished when I upgraded from 2.6.12 to 2.6.14. There was a bug with the
futex, and I think that was the source of my problems (race conditions
are always nasty). 

As for the udma crc? That usually means that your controller/cable is
going bad. Each time I have seen that, the whole system crashed
corrupting files everywhere... That is pretty odd that you see the thig
on two different system though.

jacques

PS: With development kernels, always try to use the latest. Especially
when you see a problem. (And I still consider the 2.6 as being a
development version) 

Attachment: signature.asc
Description: Digital signature

Reply via email to