Re: Softraid Multi-dirve Failure

Charles Jones Fri, 09 Jan 2009 13:53:57 -0800

Joe Fleming wrote:

Hey all, I have a Debian box that was acting as a 4 drive RAID-5 mdadmsoftraid server. I heard one of the drives making strange noises butmdstat reported no problems with any of the drives. I decided to copythe data off the array so I had a backup before I tried to figure outwhich drive it was. Unfortunately, in the middle of copying said data,2 of the drives dropped out at the same time. Since RAID-5 is onlytolerant to one failure at a time, basically the whole array is hosednow. I've had drives drop out on me before, but never 2 at once. Sigh.
I tried to Google a little about dealing with multi-drive failureswith mdadm, but I couldn't find much in my initial looking. I'm goingto keep digging, but I thought I'd post a question to the group andsee what happens. So, is there a way to tell mdadm to "unmark" one ofthe 2 drives as failed and try to bring up the array again WITHOUTrebuilding it? I really don't think both of the drives failed on mesimultaneously and I'd like to try to return 1 of the 2 to the arrayand test my theory. If I can get the array back up, I can either keeptrying to copy data off it or add a new replacement and try torebuild. I'm pretty novice with mdadm thought I don't see an optionthat will let me do what I want. Can anyone offer me some advice orpoint me in the right direction..... or am I just SOL?
As a side note, why can't hard drive manufacturers make drives thatlast anymore? I've had like 5 drives fail on me in the last year...WD, Seagate, Hitachi, they all suck equally! I can't find any thatlast for any reasonable amount of time, and all the warranties leaveyou with reman'd drives which fail even more rapidly, some even showup DOA. Plus, I'm not sending my unencrypted data off to some randomplace! Sorry for venting, just a little ticked off at all of this.Thanks in advance for any help.
-Joe

I've had luck in the past recovering from a multi-drive failure, wherethe other failed drive was not truly dead but rather was dropped becauseof an IO error caused by a thermal calibration or something similar.The trick is to re-add the drive to the array and using the option toforce it NOT to try to rebuild the array. This used to be an requireseveral options like --really-force and --really-dangerous but now Ithink its just something like --assemble --force /dev/md0. This forcesthe array to come back up to its degraded (still down 1 disk) state. Ifpossible replace the degraded disk or copy your data off before theother flakey drive fails.

---------------------------------------------------
PLUG-discuss mailing list - PLUG-discuss@lists.plug.phoenix.az.us
To subscribe, unsubscribe, or to change your mail settings:
http://lists.PLUG.phoenix.az.us/mailman/listinfo/plug-discuss

Re: Softraid Multi-dirve Failure

Reply via email to