In June of 2010, we were running Solaris AFS file servers on Solaris
with ZFS for partitions on a SATAbeast.

AFS reported I/O error from read() that were ZFS checksums.

Turned out the hardware logs on that SATAbeast were reporting problems
but would continue to serve up the bad data.

Since ZFS is doing checksums when it writes and then again when it reads,
ZFS was catching intermittent errors which other systems might not catch.

Here is a nice explanation of how and why ZFS does checksum.
It also points out other source of corruption that can occur
on a SAN.

http://blogs.sun.com/bonwick/entry/zfs_end_to_end_data

And this one that sounds a lot like our problem!!

http://blogs.sun.com/elowe/entry/zfs_saves_the_day_ta

And this is one of the reasons why ZFS is so cool :)

Yes it is cool!


_______________________________________________
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info


--

 Douglas E. Engert  <deeng...@anl.gov>
 Argonne National Laboratory
 9700 South Cass Avenue
 Argonne, Illinois  60439
 (630) 252-5444
_______________________________________________
OpenAFS-info mailing list
OpenAFS-info@openafs.org
https://lists.openafs.org/mailman/listinfo/openafs-info

Reply via email to