Re: [zfs-discuss] Re: zfs corrupted my data!

Toby Thain Tue, 28 Nov 2006 19:49:53 -0800


On 28-Nov-06, at 10:35 PM, Anton B. Rang wrote:

No, you still have the hardware problem.
What hardware problem?
There seems to be an unspoken assumption that any checksum errordetected by ZFS is caused by a relatively high error rate in theunderlying hardware.
There are at least two classes of hardware-related errors. Oneclass are those which are genuinely being introduced at a highrate, as exemplified by the post earlier in this list about the badFibreChannel port on a SAN. The other are those which are very rareevents, for instance a radiation-induced bit-flip in SRAM. In thiscase, there is no “problem” as such to be repaired (well, perhapsif you live in Denver you could buy radiation shielding for yourcomputer room ;-).
(There are also software errors. Errors in ZFS itself or anywhereelse in the Solaris kernel, including device drivers, can result inerroneous data being written to disk. There may be a softwareproblem, rather than a hardware problem, in any individual case.)
Clearly, the existence of a high error rate (say, more than oneerror every two weeks on a server pushing 100 MB/second) wouldpoint to a hardware or software problem; but fewer errors maysimply be “normal” for standard hardware.

Her original configuration wasn't redundant, so she should expectthis kind of manual recovery from time to time. Seems a logicalconclusion to me? Or is this one of those once-in-a-lifetime strikes?


--Toby



This message posted from opensolaris.org
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Re: zfs corrupted my data!

Reply via email to