On Mon, Feb 27, 2006 at 11:33:19PM +0100, Juan Piñeros wrote: > 1 Raw_Read_Error_Rate 0x000d 100 100 050 > Pre-fail Offline > - 51 > 195 Hardware_ECC_Recovered 0x001a 100 100 000 > Old_age Always > - 2 > 199 UDMA_CRC_Error_Count 0x003e 200 200 000 > Old_age Always > - 9
This is where your issue seems to live. I have never seen the read error and ecc corrected number not matching. It would mean that an error occurs but there has been no way to make it right so I would expect the read to be garbage... Did you see any corruption in your files? I mean data corrupted instead of metadata? Also, you say that sata does not support smart. That is not true, with one of the very recent kernels (2.6.15.4), you can get them. I have not much experience with the kernels shipped with debian. I always recompile my own. But some problems I had with an nfs server (in an HPC system) vanished when I upgraded from 2.6.12 to 2.6.14. There was a bug with the futex, and I think that was the source of my problems (race conditions are always nasty). As for the udma crc? That usually means that your controller/cable is going bad. Each time I have seen that, the whole system crashed corrupting files everywhere... That is pretty odd that you see the thig on two different system though. jacques PS: With development kernels, always try to use the latest. Especially when you see a problem. (And I still consider the 2.6 as being a development version)
signature.asc
Description: Digital signature