possible hardware failure ?

2004-08-22 Thread Claudiu
Hello,
I get this in logs, recently:
ad4: TIMEOUT - WRITE_DMA retrying (2 retries left) LBA=208234968
There are more lines like this, with different LBA values of course.
Is this a time to backup the data and replace the disk ?
Thank you in advance.
Best regards,
--
Claudiu Dragalina-Paraipan
e-mail: [EMAIL PROTECTED]
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: possible hardware failure ?

2004-08-22 Thread Claudiu
I have also checked the SMART state of the driver with smartctl, it 
looks like this:

ID# ATTRIBUTE_NAME  FLAG VALUE WORST THRESH TYPE 
UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate 0x000f   062   056   006Pre-fail 
Always   -   101410838
  3 Spin_Up_Time0x0003   096   096   000Pre-fail 
Always   -   0
  4 Start_Stop_Count0x0032   100   100   020Old_age 
Always   -   0
  5 Reallocated_Sector_Ct   0x0033   100   100   036Pre-fail 
Always   -   0
  7 Seek_Error_Rate 0x000f   081   060   030Pre-fail 
Always   -   146351689
  9 Power_On_Hours  0x0032   094   094   000Old_age 
Always   -   5292
 10 Spin_Retry_Count0x0013   100   100   097Pre-fail 
Always   -   0
 12 Power_Cycle_Count   0x0032   100   100   020Old_age 
Always   -   67
194 Temperature_Celsius 0x0022   034   047   000Old_age   Always 
  -   34
195 Hardware_ECC_Recovered  0x001a   062   056   000Old_age   Always 
  -   101410838
197 Current_Pending_Sector  0x0012   100   100   000Old_age   Always 
  -   0
198 Offline_Uncorrectable   0x0010   100   100   000Old_age 
Offline  -   0
199 UDMA_CRC_Error_Count0x003e   200   200   000Old_age   Always 
  -   0
200 Multi_Zone_Error_Rate   0x   100   253   000Old_age 
Offline  -   0
202 TA_Increase_Count   0x0032   100   253   000Old_age   Always 
  -   0

This does not look very encouraging, but seems to be in reasonable limits.
I have 2 drivers, and I am using vinum for mirroring. Both are in 
similar situation.
Should I trust both of them until the SMART attributes are closer to to 
the WORST case ?

Best regards,
--
Claudiu Dragalina-Paraipan
e-mail: [EMAIL PROTECTED]
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: possible hardware failure ?

2004-08-22 Thread Anton Alin-Adrian
Claudiu wrote:
 I have also checked the SMART state of the driver with smartctl, it 
looks like this: [...]

Same happens with my atapi cdrom from time to time. Could be messed-up IDE 
channel.

But anyway, good time for a backup. Did you try a fsck -y..
Yours,
--
Alin-Adrian Anton
Spintech Systems
GPG keyID 0x1E2FFF2E (2963 0C11 1AF1 96F6 0030 6EE9 D323 639D 1E2F FF2E)
gpg --keyserver pgp.mit.edu --recv-keys 1E2FFF2E
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to [EMAIL PROTECTED]