Re: btrfs-cleaner / snapshot performance analysis

Ellis H. Wilson III Mon, 12 Feb 2018 08:40:28 -0800

On 02/12/2018 11:02 AM, Austin S. Hemmelgarn wrote:

I will look into that if using built-in group capacity functionalityproves to be truly untenable. Thanks!
As a general rule, unless you really need to actively prevent asubvolume from exceeding it's quota, this will generally be morereliable and have much less performance impact than using qgroups.

Ok ok :). I will plan to go this route, but since I'll want tobenchmark it either way, I'll include qgroups enabled in the benchmarkand will report back.

With qgroups involved, I really can't say for certain, as I've neverdone much with them myself, but based on my understanding of how it allworks, I would expect multiple subvolumes with a small number ofsnapshots each to not have as many performance issues as a singlesubvolume with the same total number of snapshots.


Glad to hear that.  That was my expectation as well.

BTRFS in general works fine at that scale, dependent of course on thelevel of concurrent access you need to support. Each tree update needsto lock a bunch of things in the tree itself, and having large numbersof clients writing to the same set of files concurrently can cause lockcontention issues because of this, especially if all of them are callingfsync() or fdatasync() regularly. These issues can be mitigated bysegregating workloads into their own subvolumes (each subvolume is amostly independent filesystem tree), but it sounds like you're alreadydoing that, so I don't think that would be an issue for you.

Hmm...I'll think harder about this. There is potential for us toartificially divide access to files across subvolumes automaticallybecause of the way we are using BTRFS as a backing store for ourparallel file system. So far even with around 1000 threads across about10 machines accessing BTRFS via our parallel filesystem over the wirewe've not seen issues, but if we do I have some ways out I've notexplored yet. Thanks!

Now, there are some other odd theoretical cases that may cause issueswhen dealing with really big filesystems, but they're either reallyspecific edge cases (for example, starting with a really smallfilesystem and gradually scaling it up in size as it gets full) orhappen at scales far larger than what you're talking about (on the orderof at least double digit petabyte scale).

Yea, our use case will be in the tens of TB to hundreds of TB for theforeseeable future, so I'm glad to hear this is relatively standard.That was my read of the situation as well.


Thanks!

ellis
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: btrfs-cleaner / snapshot performance analysis

Reply via email to