Re: RedHat 7.4 Release Notes: "Btrfs has been deprecated" - wut?

Chris Mason Wed, 16 Aug 2017 06:13:48 -0700

On Mon, Aug 14, 2017 at 09:54:48PM +0200, Christoph Anton Mitterer wrote:

On Mon, 2017-08-14 at 11:53 -0400, Austin S. Hemmelgarn wrote:

Quite a few applications actually _do_ have some degree of secondary 
verification or protection from a crash.  Go look at almost any
database 
software.

Then please give proper references for this!


This is from 2015, where you claimed this already and I looked up all
the bigger DBs and they either couldn't do it at all, didn't to it per
default, or it required application support (i.e. from the programs
using the DB)
https://www.spinics.net/lists/linux-btrfs/msg50258.html

It usually will not have checksumming, but it will almost 
always have support for a journal, which is enough to cover the 
particular data loss scenario we're talking about (unexpected
unclean 
shutdown).


I don't think we talk about this:
We talk about people wanting checksuming to notice e.g. silent data
corruption.

The crash case is only the corner case about what happens then if data
is written correctly but csums not.

We use the crcs to catch storage gone wrong, both in terms of simplethings like cabling, bus errors, drives gone crazy or exotic problemslike every time I reboot the box a handful of sectors return EFIpartition table headers instead of the data I wrote. You don't needdata center scale for this to happen, but it does help...

So, we do catch crc errors in prod and they do keep us from replicatingbad data over good data. Some databases also crc, and all drives havecorrection bits of of some kind. There's nothing wrong with crcshappening at lots of layers.

Btrfs couples the crcs with COW because it's the least complicated wayto protect against:


* bits flipping

* IO getting lost on the way to the drive, leaving stale but valid datain place* IO from sector A going to sector B instead, overwriting valid datawith other valid data.

It's possible to protect against all three without COW, but allsolutions have their own tradeoffs and this is the setup we chose. It'seasy to trust and easy to debug and at scale that really helps.

In general, production storage environments prefer clearly definederrors when the storage has the wrong data. EIOs happen often, and youwant to be able to quickly pitch the bad data and replicate in gooddata.

My real goal is to make COW fast enough that we can leave it on for thedatabase applications too. Obviously I haven't quite finished that oneyet ;) But I'd rather keep the building block of all the other btrfsfeatures in place than try to do crcs differently.


-chris
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: RedHat 7.4 Release Notes: "Btrfs has been deprecated" - wut?

Reply via email to