Hi Jakob,

2) You read it. In which case the disk can see that the checksum has failed. There's nothing it can do about this except give you a read error on that block.

A good RAID controller *may* help you with scenario (2) above; it can read the
block from a mirror or compute it from parity (etc.), and may even re-write the
correct block to the disk again to cure this single bad-block failure (by
letting the disk re-allocate the block on a write as in case (1) above).

Yes, precisely.

When I buy RAID controllers, I put this requirement directly in my bid specifications. I say something like the following:

------------------------------------------------------------------

During a READ operation, if the RAID controller finds an unreadable (uncorrectable) disk sector, then it will immediately reconstruct the missing data for that sector using redundant data from the rest of the array, and WRITE that data to the unreadable (uncorrectable) sector to force sector reallocation if needed by the disk.

In addition, the RAID controller will perform a continous or regular (at least daily) background scan of all disk sectors to identify and repair any unreadable sectors as described above.

------------------------------------------------------------------

I think it is a mistake to purchase a RAID controller without these features. The absence of these features is the main reason that I don't think Linux software RAID is very good.

Question for the Sun/ZFS experts: Does the Sun X4500 (Thumper) do what I describe above?

Cheers,
        Bruce
_______________________________________________
Beowulf mailing list, [email protected]
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Reply via email to