On 2/11/07, Matty <[EMAIL PROTECTED]> wrote:
Howdy,
On one of my Solaris 10 11/06 servers, I am getting numerous errors
similar to the following:
Feb 11 09:30:23 rx scsi: WARNING: /[EMAIL PROTECTED],2000/[EMAIL
PROTECTED],1/[EMAIL PROTECTED],0 (sd1):
Feb 11 09:30:23 rx Error for Command: write(10)
Error Level: Retryable
Feb 11 09:30:23 rx scsi: Requested Block: 58458343
Error Block: 58458343
Feb 11 09:30:23 rx scsi: Vendor: SEAGATE
Serial Number: 0404A72YCG
Feb 11 09:30:23 rx scsi: Sense Key: Hardware Error
Feb 11 09:30:23 rx scsi: ASC: 0x19 (defect list error), ASCQ:
0x0, FRU: 0x2
Feb 11 09:32:18 rx scsi: WARNING: /[EMAIL PROTECTED],2000/[EMAIL
PROTECTED],1/[EMAIL PROTECTED],0 (sd1):
Feb 11 09:32:18 rx Error for Command: write(10)
Error Level: Retryable
Feb 11 09:32:18 rx scsi: Requested Block: 58696759
Error Block: 58696501
Feb 11 09:32:18 rx scsi: Vendor: SEAGATE
Serial Number: 0404A72YCG
Feb 11 09:32:18 rx scsi: Sense Key: Media Error
Feb 11 09:32:18 rx scsi: ASC: 0xc (write error - auto
reallocation failed), ASCQ: 0x2, FRU: 0x1
Assuming I am reading the error message correctly, it looks like the
disk drive (c2t2d0) has used up all of all of the spare sectors used
to reallocate bad sectors. If this is the case, is there a reason
Solaris doesn't offline the drive? This would allow ZFS to evict the
faulty disk from my pool, and kick in the spare disk drive I have
configured:
$ zpool status -v
pool: rz2pool
state: ONLINE
scrub: scrub completed with 0 errors on Sat Feb 10 18:46:54 2007
config:
NAME STATE READ WRITE CKSUM
rz2pool ONLINE 0 0 0
raidz2 ONLINE 0 0 0
c1t9d0 ONLINE 0 0 0
c1t10d0 ONLINE 0 0 0
c1t12d0 ONLINE 0 0 0
c2t1d0 ONLINE 0 0 0
c2t2d0 ONLINE 0 0 0
spares
c2t3d0 AVAIL
errors: No known data errors
Thanks for any insight,
- Ryan
--
UNIX Administrator
http://prefetch.net
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
We've seen the same thing with sil3124 or marvel chipsets and SATA
drives, but when the errors come up the pool wedges and the hot spare
is never automatically utilized. I'm not sure if FMA and friends
actually use the spare and start a re-silver automatically. It appears
to still be a manual effort.
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss