Re: 5 _thousand_ snapshots? even 160? (was: device balance times)

Robert White Wed, 22 Oct 2014 13:38:15 -0700

On 10/22/2014 01:08 PM, Zygo Blaxell wrote:

I have datasets where I record 14000+ snapshots of filesystem directory
trees scraped from test machines and aggregated onto a single server
for deduplication...but I store each snapshot as a git commit, not as
a btrfs snapshot or even subvolume.


We do sometimes run queries like "in the last two years, how many times
did $CONDITION occur?" which will scan a handful files in all of the
snapshots.  The use case itself isn't unreasonable, although using the
filesystem instead of a more domain-specific tool to achieve it may be.

Okay, sure. And as stated by others, there _are_ use cases that areexceptional.

But such an archival system most likely does not _need_ to be balancedetc with any frequency, or likely ever because it isn't experiencingchurn from dynamic use.


In the world of trade-offs, trade-offs happen.

The guy who cited the 5000 snapshots said they were hourly and takenbecause he might remove an important file or something. This is _way_more action than the feared condition.

ASIDE: While fixing someone's document archive RAID device (a Sunhardware device the size of a fridge) back in 1997 or so I discoveredthat they'd disabled _all_ the hardware cache features. When asked I wastold that "the procedure for replacing a failed drive required the cachedevice to be cleared by pressing the red button" and they were afraidthat, should that day come, someone would forget to press that button...so they'd turned off the feature entirely. This is a form ofunreasonable paranoia. They were afraid that someone in the future wouldnot follow the directions would be printed on both the machine and thenew drive (these were _not_ commodity parts).

When an over-abundance of caution passes beyond reasonable expectations,performance will suffer. The system is immaterial, the rule holds.

What's worse is it becomes very like "security theater" only its "abackup show" where no actual backing up is really happening in anyuseful sense. And god save you picking which version of a file was thelast "good one".

So in your use case, your git repository of snapshots isn't actually"live" on the production server you are archiving, right?

So too, it would be reasonable to btrfs send periodic snapshots to anarchive system, retain lots and lots of them, and expect reasonableperformance of your queries.


And you cold expect reasonable performance in your maintenance.

But "reasonable performance" in the maintenance case is massivelydifferent than reasonable performance in use cases. Indeed if you try tobalance multiple terabytes of data spread across thousands of snapshotsyou'll be taking a lot of time. A _perfectly_ _reasonable_ lot of timefor the operation at hand.

But if you expect to be able to do maintenance (like btrfsck yourproduction box with its 5k snapshots) in just a few minutes when you'vegot logarithmic-rate meta data to shuffle through... well good luck withthat.

--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: 5 _thousand_ snapshots? even 160? (was: device balance times)

Reply via email to