Re: Tiered storage?

Austin S. Hemmelgarn Wed, 15 Nov 2017 04:52:42 -0800

On 2017-11-15 02:11, waxhead wrote:

As a regular BTRFS user I can tell you that there is no such thing ashot data tracking yet. Some people seem to use bcache together withbtrfs and come asking for help on the mailing list.

Bcache works fine recently. It was only with older versions that therewere issues. dm-cache similarly works fine on recent versions. In bothcases though, you need to be sure you know what you're doing, otherwiseyou are liable to break things.

Raid5/6 have received a few fixes recently, and it *may* soon me worthtrying out raid5/6 for data, but keeping metadata in raid1/10 (I wouldrather loose a file or two than the entire filesystem).
I had plans to run some tests on this a while ago, but forgot about it.
As call good citizens, remember to have good backups. Last time I testedfor Raid5/6 I ran into issues easily. For what it's worth - raid1/10seems pretty rock solid as long as you have sufficient disks (hint: youneed more than two for raid1 if you want to stay safe)

Parity profiles (raid5 and raid6) still have issues, although there arefewer than there were, with most of the remaining issues surroundingrecovery. I would still recommend against it for production usage.

Simple replication (raid1) is pretty much rock solid as long as you keepon top of replacing failing hardware and aren't stupid enough to run thearray degraded for any extended period of time (converting to a singledevice volume instead of leaving things with half a volume is vastlypreferred for multiple reasons).

Striped replication (raid10) is generally fine, but you can get muchbetter performance by running BTRFS with a raid1 profile on top of twoMD/LVM/Hardware RAID0 volumes (BTRFS still doesn't do a very good job ofparallelizing things).

As for dedupe there is (to my knowledge) nothing fully automatic yet.You have to run a program to scan your filesystem but all thededuplication is done in the kernel.duperemove works apparently quite well when I tested it, but there maybe some performance implications.

Correct, there is nothing automatic (and there are pretty significantarguments against doing automatic deduplication in most cases), but theoff-line options (via the EXTENT_SAME ioctl) are reasonably reliable.Duperemove in particular does a good job, though it may take a long timefor large data sets.

As far as performance, it's no worse than large numbers of snapshots.The issues arise from using very large numbers of reflinks.

Roy Sigurd Karlsbakk wrote:
Hi all
I've been following this project on and off for quite a few years, andI wonder if anyone has looked into tiered storage on it. With tieredstorage, I mean hot data lying on fast storage and cold data on slowstorage. I'm not talking about cashing (where you just keep a copy ofthe hot data on the fast storage).
And btw, how far is raid[56] and block-level dedup from somethinguseful in production?
Vennlig hilsen

roy
--
Roy Sigurd Karlsbakk
(+47) 98013356
http://blogg.karlsbakk.net/
GPG Public key: http://karlsbakk.net/roysigurdkarlsbakk.pubkey.txt
--
Hið góða skaltu í stein höggva, hið illa í snjó rita.


--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: Tiered storage?

Reply via email to