Most ECC setups are as you describe. The memory hardware detects and corrects 
all 1-bit errors, and detects all two-bit errors on its own. What ... should 
... happen is that the OS should get an interrupt when this happens so it has 
the opportunity to note the error in logs and to higher level stuff if needed - 
map out the memory in question, call an operator, halt and catch fire, etc. But 
the hardware must have that interrupt line connected to something to even make 
this possible. And the OS doesn't have to do anything, necessarily.

Although, as mentioned, if the OS has a low level read-through-memory routine, 
it does guarantee that memory is scrubbed of one-bit error and bad pages found.
-- 
This message posted from opensolaris.org
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to