mkfs.btrfs limits "odd" [and maybe a "failed" phantom device?]

Robert White Wed, 10 Dec 2014 14:19:26 -0800

So I started looking at the mkfs.btrfs manual page with an eye towardsdocumenting some of the tidbits like metadata automatically switchingfrom dup to raid1 when more than one device is used.


In experimenting I ended up with some questions...

(1) why is the dup profile for data restricted to only one device andonly if it's mixed mode?


Gust t # mkfs.btrfs -f /dev/loop{0..1} -d dup
Error: unable to create FS with data profile 16 (have 2 devices)

Gust t # mkfs.btrfs -f /dev/loop0 -d dup
Error: dup for data is allowed only in mixed mode

(2) why is metadata dup profile restricted to only one device oncreation when it will run that way just fine after a device add?


Gust t # mkfs.btrfs -f /dev/loop{0..1} -m dup
Error: unable to create FS with metadata profile 32 (have 2 devices)

(3) why can I make a raid5 out of two devices? (I understand that we arecurrently just making mirrors, but the standard requires three devicesin the geometry etc. So I would expect a two device RAID5 to beconsidered degraded with all that entails. It just looks like its askingfor trouble to allow this once the support is finalized as suddenly aworking RAID5 thats really a mirror would become something that can onlybe mounted with the degraded flag.)


Gust t # mkfs.btrfs -f /dev/loop{0..1} -d raid5 -m raid5
Btrfs v3.17.1
See http://btrfs.wiki.kernel.org for more information.

Performing full device TRIM (2.00GiB) ...

Turning ON incompat feature 'extref': increased hardlink limit per fileto 65536

Turning ON incompat feature 'raid56': raid56 extended format
Performing full device TRIM (2.00GiB) ...
adding device /dev/loop1 id 2
fs created label (null) on /dev/loop0
        nodesize 16384 leafsize 16384 sectorsize 4096 size 4.00GiB

(4) Same question for raid6 but with three drives instead of themandated four.

(5) If I can make a RAID5 or RAID6 device with one missing element, whycan't I make a RAID1 out of one drive, e.g. with one missing element?

(6) If I make a RAID1 out of three devices are there three copies ofevery extent or are there always two copies that are semi-randomlyspread across three devices? (ibid for more than three).

---

It seems to me (very dangerous words in computer science, I know) thatwe need a "failed" device designator so that a device can be in thegeometry (e.g. have a device ID) but not actually exist. Reads/writes tothe failed device would always be treated as error returns.

The failed device would be subject to replacement with "btrfs devreplace", and could be the source of said replacement to drop aproblematic device out of an array.


EXAMPLE:
Gust t # mkfs.btrfs -f /dev/loop0 failed -d raid1 -m raid1
Btrfs v3.17.1
See http://btrfs.wiki.kernel.org for more information.

Performing full device TRIM (2.00GiB) ...

Turning ON incompat feature 'extref': increased hardlink limit per fileto 65536

Processing explicitly missing device
adding device (failed) id 2 (phantom device)

mount /dev/loop0 /mountpoint

btrfs replace start 2 /dev/loop1 /mountpoint

(and so on)

Being able to "replace" a faulty device with a phantom "failed" devicewould nicely disambiguate the whole device add/remove versus replacemistake.


It would make the degraded status less mysterious.

A filesystem with an explicitly failed element would also make thefuture roll-out of full RAID5/6 less confusing.

--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

mkfs.btrfs limits "odd" [and maybe a "failed" phantom device?]

Reply via email to