> No, you still have the hardware problem.

What hardware problem?

There seems to be an unspoken assumption that any checksum error detected by 
ZFS is caused by a relatively high error rate in the underlying hardware.

There are at least two classes of hardware-related errors. One class are those 
which are genuinely being introduced at a high rate, as exemplified by the post 
earlier in this list about the bad FibreChannel port on a SAN. The other are 
those which are very rare events, for instance a radiation-induced bit-flip in 
SRAM. In this case, there is no “problem” as such to be repaired (well, perhaps 
if you live in Denver you could buy radiation shielding for your computer room 
;-).

(There are also software errors. Errors in ZFS itself or anywhere else in the 
Solaris kernel, including device drivers, can result in erroneous data being 
written to disk. There may be a software problem, rather than a hardware 
problem, in any individual case.)

Clearly, the existence of a high error rate (say, more than one error every two 
weeks on a server pushing 100 MB/second) would point to a hardware or software 
problem; but fewer errors may simply be “normal” for standard hardware.
 
 
This message posted from opensolaris.org
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to