Re: btrfs raid1 and btrfs raid10 arrays NOT REDUNDANT

Jim Salter Sat, 04 Jan 2014 13:18:00 -0800


On 01/04/2014 02:18 PM, Chris Murphy wrote:

I'm not sure what else you're referring to?(working on bootenvironment of btrfs)

Just the string of caveats regarding mounting at boot time - needing tomonkeypatch 00_header to avoid the bogus sparse file error (which,worse, tells you to press a key when pressing a key does nothing)followed by this, in my opinion completely unexpected, behavior whenmissing a disk in a fault-tolerant array, which also requiresmonkey-patching in fstab and now elsewhere in GRUB to avoid.

Please keep in mind - I think we got off on the wrong foot here, and I'msorry for my part in that, it was unintentional. I *love* btrfs, andthink the devs are doing incredible work. I'm excited about it. I'maware it's not intended for production yet. However, it's just on thecusp, with distributions not only including it in their installers but acouple teetering on the fence with declaring it their next default FS(Oracle Unbreakable, OpenSuse, hell even RedHat was flirting with theidea) that it seems to me some extra testing with an eye towardsproduction isn't a bad thing. That's why I'm here. Not to crap onanybody, but to get involved, hopefully helpfully.

fs_passno is 1 which doesn't apply to Btrfs.

Again, that's the distribution's default, so the argument should be withthem, not me... with that said, I'd respectfully argue that fs_passno 1is correct for any root file system; if the file system itself declinesto run an fsck that's up to the filesystem, but it's correct to specifyfs_passno 1 if the filesystem is to be mounted as root in the first place.


I'm open to hearing why that's a bad idea, if you have a specific reason?

Well actually LVM thinp does have fast snapshots without requiringpreallocation, and uses COW.

LVM's snapshots aren't very useful for me - there's a performancepenalty while you have them in place, so they're best used as atransient use-then-immediately-delete feature, for instance forrsync'ing off a database binary. Until recently, there also wasn't agood way to roll back an LV to a snapshot, and even now, that can bepretty problematic. Finally, there's no way to get a partial copy of anLV snapshot out of the snapshot and back into production, so if eg youhave virtual machines of significant size, you could be looking at*hours* of file copy operations to restore an individual VM out of asnapshot (if you even have the drive space available for it), ascompared to btrfs' cp --reflink=always operation, which allows you to dothe same thing instantaneously.

FWIW, I think the ability to do cp --reflink=always is one of the bigkiller features that makes btrfs more attractive than zfs (which, againFWIW, I have 5+ years of experience with, and is my current primarystorage system).

I'm not sure what you mean by self-correcting, but if the drivereports a read error md, lvm, and Btrfs raid1+ all will get missingdata from mirror/parity reconstruction, and write corrected data backto the bad sector.

You're assuming that the drive will actually *report* a read error,which is frequently not the case. I have a production ZFS array rightnow that I need to replace an Intel SSD on - the SSD has thrown > 10Kchecksum errors in six months. Zero read or write errors. Neitherhardware RAID nor mdraid nor LVM would have helped me there.

Since running filesystems that do block-level checksumming, I havebecome aware that bitrot happens without hardware errors getting thrownFAR more frequently than I would have thought before having the tools tospot it. ZFS, and now btrfs, are the only tools at hand that canactually prevent it.

--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: btrfs raid1 and btrfs raid10 arrays NOT REDUNDANT

Reply via email to