Frank Steinmetzger wrote:
> Am Tue, Apr 12, 2022 at 06:01:11PM -0500 schrieb Dale:
>
>>> The advantage of an integrity scheme (like ZFS or comparing with a checksum
>>> file) over your rsync approach is that you only need to read all the datas™
>>> from one drive instead of two. Plus: if rsync actually detects a change, it
>>> doesn’t know which of the two drives introduced the error. You need to find
>>> out yourself after the fact (which probably won’t be hard, but still, it’s
>>> one more manual step).
>> In this case, if something had changed, I'd have no problem manually
>> checking the file to be sure which was good and which was bad.
> Consider a big video file, which I know you like to accumulate from youtube
> and the likes. How do you find out the broken one? By watching it and trying
> to find the one image or audio frame that is garbled? The drive might return
> zeros or other garbage (bit flip) instead of actual content without SMART
> noticing it (uncorrectable error).
>

In this case, I'd likely rename one file and keep them both until I can
figure out which is good.  That said, I'd certainly keep the backup copy
because odds are, it is good since the error came well after my last
backup.  At this point tho, I don't know what file was on that bad spot. 


>> Given
>> the error is recent on my drive, I'd suspect the backups to still be a
>> good file.  For that reason, I'd suspect the backup file to be good
>> therefore not to be overwritten.  I was trying to avoid a bad file
>> replacing a good file on the backup which then destroys all good files
>> and leaves only bad ones.  This is why I like that SMART at least let me
>> know there is a problem. 
> I also tend to rely on smart, but it’s not all-knowing and probably not
> infallible.
>
>


This is very true.  I mentioned elsewhere that things like spindle motor
failure or the motor that moves the heads are usually not detectable. 
Some component failures can be detected but not all or even most from
what I've read.  Basically, the best you can hope for is SMART seeing a
bad spot on the media itself.  That it seems it can detect most of the
time. 

TL;DR next two paragraphs.  Just a interesting story along this line.  I
used to work in parts at a fortune 500 office company.  We had millions
of dollars of just computer stuff in inventory just for computers.  That
was in early 90's.  They also had copiers and their parts, paper etc
etc.  We used a NCR computer for a computer system for the whole
company.  At the end of the building was a speed bump so people wouldn't
go flying down the one lane road between the building and fence on the
property line.  One day a large truck almost empty went a little faster
than normal over the last speed bump.  It shook the building to the
point I could feel it about 150 feet away.  The computer room was like
50 feet away from that side of the building.  It seems the hard drive
felt it very well.  One, maybe more, of the head(s) got under the media
and started peeling it off the platter and made a really ugly screeching
sound.  No routine shutdown, they just pulled the plug.  As you can
imagine tho, it did no good.  Even way back then drives of that speed
were spinning fast enough.  I suspect even by the time a person could
blink it was way past fixing. 

That of course was way before SMART came along but SMART would never be
able to predict such a failure.  Even NCR said it was likely a 1 in a
million chance that the truck hits just when the head was moving over a
weak spot.  Several thousand dollars later, and a private plane bringing
in a new drive, the drive was replaced.  Of course, the idiot in charge
had no backups that were of any use.  All of them were several weeks
old, likely over a month.  Luckily he stayed far away from me for at
least a month.  Otherwise, I'd likely still be in jail, with my hands
around the neck of his corpse.  :-@

SMART isn't a sure thing but it can help in some cases which is better
than nothing at all. 

Dale

:-)  :-) 

Reply via email to