Re: Requesting migrate device options for raid5/6

Bill Davidsen Thu, 01 Nov 2007 06:41:35 -0800

Goswin von Brederlow wrote:

Hi,


I would welcome if someone could work on a new feature for raid5/6
that would allow replacing a disk in a raid5/6 with a new one without
having to degrade the array.

Consider the following situation:

raid5 md0 : sda sdb sdc

Now sda gives a "SMART - failure iminent" warning and you want to
repalce it with sdd.

% mdadm --fail /dev/md0 /dev/sda
% mdadm --remove /dev/md0 /dev/sda
% mdadm --add /dev/md0 /dev/sdd

Further consider that drive sdb will give an I/O error during resync
of the array or fail completly. The array is in degraded mode so you
experience data loss.

That's a two drive failure, so you will lose data.

But that is completly avoidable and some hardware raids support disk
migration too. Loosly speaking the kernel should do the following:

No, it's not "completly avoidable" because have described sda is readyto fail and sdb as "will give an I/O error" so if both happen at onceyou will lose data because you have no valid copy. That said, some ofwhat you describe below is possible to *reduce* the probability offailure. But if sdb is going to have i/o errors, you really need toreplace two drive :-(

See below for some thoughts.

raid5 md0 : sda sdb sdc
-> create internal raid1 or dm-mirror
raid1 mdT : sda
raid5 md0 : mdT sdb sdc
-> hot add sdd to mdT
raid1 mdT : sda sdd
raid5 md0 : mdT sdb sdc
-> resync and then drop sda
raid1 mdT : sdd
raid5 md0 : mdT sdb sdc
-> remove internal mirror

raid5 md0 : sdd sdb sdc


Thoughts?


If there were a "migrate" option, it might work something like this:

Given a migrate from sda to sdd, as you noted and raid1 between sda andsdd needs to be created, and obviously all chunks of sdd need to bemarked as needing rebuild, but in addition sda needs to be maderead-only, to minimize the i/o and to prevent any errors which mightcome from a failed write, like failed sector relocates, etc. Also, ifvalid data for a chunk is on sdd, no read would be done to sda. I thinkthere's relevant code in the "write-mostly" bits to implement keep i/oto sda to a minimum, no writes and only mandatory reads when no validchunk is on sdd yet. This is similar to recovery to a spare, save thatmost data will be valid on the failing drive and doesn't need to berecreated, only unreadable data must be done the slow way.

Care is needed for sda as well, so that if sdd fails during migrate, alast chance attempt to bring sda back to useful content can be made, I'mparanoid that way.

Assuming the migrate works correctly, sda is removed from the array, andthe superblock should be marked to reflect that. Now sdd is a part ofthe array, and assemble, at least using UUID, should work.

I personally think that a migrate capability would be vastly useful,both for handling failing drives and just moving data to a better place.As you point out, the user commands are not *quite* as robust as aninternal implementation could be, and are complex enough to invite usererror. I certainly always write down steps before doing migrate, and ifpossible do it with the system booted from a rescue media.


--
bill davidsen <[EMAIL PROTECTED]>
 CTO TMR Associates, Inc
 Doing interesting things with small computers since 1979

-
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: Requesting migrate device options for raid5/6

Reply via email to