Re: 5 _thousand_ snapshots? even 160? (was: device balance times)

Robert White Wed, 22 Oct 2014 22:18:40 -0700

On 10/22/2014 09:30 PM, Chris Murphy wrote:

Sure. So if Btrfs is meant to address scalability, then perhaps at the moment 
it's falling short. As it's easy to add large drives and get very large 
multiple device volumes, the snapshotting needs to scale also.


I'd say per user, it's reasonable to have 24 hourly (one snapshot per hour for 
a day), 7 daily, 4 weekly, and 12 monthly snapshots, or 47 snapshots. That's 
47,000 snapshots if it's sane for a single Btrfs volume to host 1000 users. 
Arguably, such a system is better off with a distributed fs: Gluster FS or GFS2 
or Ceph.

Is one subvolume per user a rational expectation? Is it evenparticularly smart? Dooable, sure, but as a best practice it doesn'tseem that useful because it multiplies the maintenace by the user base.

Presuming a linux standard base layout (which is very presumptive)having the 47 snapshots of /home instead of the 47,000 snapshots of/home/X(1000) is just as workable, if not moreso. A reflink recursivecopy of /home/X(n) from /home_Backup_date/X(n) is only trivially longerthan resnapshotting the individual user.

Again this gets into the question not of what exercises well to createthe snapshot but what functions well during a restore.

People constantly create "backup solutions" without really looking atthe restore path.

I can't get anybody here to answer the question about "btrfs fi li -s /"and setting/resetting the "snapshot" status of a subvolume. I've beentold "snapshots are subvolumes" which is fine, but since there _is_ aclassification mechanism things get all caca if you rely on the "-s" inyour scripting and then promote a snapshot back into prime activity.(seriously compare the listing with and without -s, note its naturalaffinity for classifying subvolumes, then imagine the horror of needingto take /home_backup_date and make it /home.)

Similar problems obtain as soon as you consider the daunting task ofshuffling through 47,000 snapshots instead of just 47.

And if you setup each user on their own snapshot what happens the firsttime two users want to hard-link a file betwixt them?


Excessive segmentation of storage is an evil unto itself.

YMMV, of course.

An orthoginal example:

If you give someone six disks and tell them to make an encrypted raid6via cryptsetup and mdadm, at least eight out of ten will encrypt thedrives and then raid the result. But it's _massivly_ more efficent toraid the drives and then encrypt the result. Why? Because writing ablock with the latter involves only one block being encrypted/decrypted.The former, if the raid is fine involves several encryptions/decryptionsand _many_ if the raid is degraded.

The above is a mental constraint, a mistake, that is all to commonbecause people expect encrytion to be "better" the closer you get to thespinning rust.

So too people expect that segmentation is somehow better if it mostclosely matches the abstract groupings (like per user) but in practicalterms it is better matched to the modality, where, for instance, allusers are one kind of thing, while all data stores are another kind ofthing.

We were just talking about putting all your VMs and larger NOCOW filesinto a separate subvolume/domain because of their radically differentwrite behaviors. Thats a sterling reason to subdivide the storage. So is/ vs. /var vs. /home as three different domains with radically differentupdate profiles.

So while the natural impulse is to give each user its own subvolume it'snot likely to be that great an idea in practice because... um... 47,000snapshots dude, and so on.

--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: 5 _thousand_ snapshots? even 160? (was: device balance times)

Reply via email to