Re: RedHat 7.4 Release Notes: "Btrfs has been deprecated" - wut?

Austin S. Hemmelgarn Tue, 15 Aug 2017 04:37:36 -0700

On 2017-08-14 15:54, Christoph Anton Mitterer wrote:

On Mon, 2017-08-14 at 11:53 -0400, Austin S. Hemmelgarn wrote:

Quite a few applications actually _do_ have some degree of secondary
verification or protection from a crash.  Go look at almost any
database
software.

Then please give proper references for this!


This is from 2015, where you claimed this already and I looked up all
the bigger DBs and they either couldn't do it at all, didn't to it per
default, or it required application support (i.e. from the programs
using the DB)
https://www.spinics.net/lists/linux-btrfs/msg50258.html

Go look at Chrome, or Firefox, or Opera, or any other major web browser.At minimum, they will safely bail out if they detect corruption in theuser profile and can trivially resync the profile from another system ifthe user has profile sync set up. Go take a look at any enterprisedatabase application from a reasonable company, it will almost alwayssupport replication across systems and validate data it reads. Notethat in both cases this isn't the same as BTRFS checking blockchecksums, and I never said that the application had to work withoutissue, even BTRFS and ZFS can only provide that guarantee with multipledevices or dup profiles on a single disk, but I can count on one handthe software I've used in the last few years that didn't at least failgracefully when fed bad data (and sending -EIO when a checksum fails isessentially the same thing).

It usually will not have checksumming, but it will almost
always have support for a journal, which is enough to cover the
particular data loss scenario we're talking about (unexpected
unclean
shutdown).


I don't think we talk about this:
We talk about people wanting checksuming to notice e.g. silent data
corruption.

The crash case is only the corner case about what happens then if data
is written correctly but csums not.

In my own experience, the things that use nodatacow fall into one of
4
categories:
1. Cases where the data is non-critical, and data loss will be
inconvenient but not fatal.  Systemd journal files are a good example
of
this, as are web browser profiles when you're using profile sync.


I'd guess many people would want to have their log files valid and
complete. Same for their profiles (especially since people concerned
about their integrity might not want to have these synced to
Mozilla/Google etc.)

Agreed, but there's also the counter argument for log files that mostpeople who are not running servers rarely (if ever) look at old logs,and it's the old logs that are the most likely to have at restcorruption (the longer something sits idle on media, the more likely itwill suffer from a media error).

2. Cases where the upper level can reasonably be expected to have
some
degree of handling, even if it's not correction.  VM disk images and
most database applications fall into this category.


No. Wrong. Or prove me that I'm wrong ;-)
And these two (VMs, DBs) are actually *the* main cases for nodatacow.

Go install OpenSUSE in a VM. Look at what filesystem it uses. Goinstall Solaris in a VM, lo and behold it uses ZFS _with no option foranything else_ as it's root filesystem. Go install a recent version ofWindows server in a VM, notice that it also has the option of a properlychecked filesystem (ReFS). Go install FreeBSD in a VM, notice that itprovides the option (which is actively recommended by many people whouse FreeBSD) to install with root on ZFS. Install Android or Chrome OS(or AOSP or Chromium OS) in a VM. Root the system and take a look atthe storage stack, both of them use dm-verity, and Android (and possiblyChrome OS too, not 100% certain) uses per-file AEAD through the VFSencryption API on encrypted devices. The fact that some OS'es blindlytrust the underlying storage hardware is not our issue, it's theirissue, and it shouldn't be 'fixed' by BTRFS because it doesn't justaffect their customers who run the OS in a VM on BTRFS.

As far as databases, I know of only one piece of enterprise leveldatabase software that doesn't have some kind of handling for this typeof thing, and it's a a horribly designed piece of software other thanthat too. Most enterprise database apps offer support for replication,and quite a few do their own data validation when reading from thedatabase. And if you care about non-enterprise database apps, then youneed to worry about the edge case caused by unclean shutdown.

And I and most other sysadmins I know would prefer the opposite with
the
addition of a secondary notification method.  You can still hook the
notification to stop the application, but you don't have to if you
don't
want to (and in cases 1 and 2 I listed above, you probably don't want
to).


Then I guess btrfs is generally not the right thing for such people, as
in the CoW case it will also give them EIO on any corruptions and their
programs will fail.

For a single disk? Yes, I'd agree that BTRFS isn't the correct answerunless you're running dup for all profiles on said single disk when youcare about data safety. Once you add another though, it's far superiorto regular RAID because it knows inherently which copy is wrong.

--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: RedHat 7.4 Release Notes: "Btrfs has been deprecated" - wut?

Reply via email to