Re: [zfs-discuss] ZFS Send Priority and Performance

Adam Serediuk Fri, 20 Nov 2009 12:28:44 -0800

I've done some work on such things. The difficulty in design isfiguring
out how often to do the send. You will want to balance your send time
interval with the write rate such that the send data is likely to bein the ARC.There is no magic formula, but empirically you can discover areasonable
interval.

Currently I replicate snapshots daily, the idea that I might be betteroff doing snapshots and replication hourly and potentially even morefrequent never occurred to me. I'll have to try. Surprisingly doing areplication of the entire data set (currently 13TB) actually performsbetter than the incremental, from a raw throughput point of view.

P.S. if you have atime enabled, which is the default, handlingbillions of
files will be quite a challenge.

Indeed, that was one of the very first things I tweaked and disabled.I don't know how bad it would have been with it enabled but I wasn'tabout to find out.


Thanks

On 20-Nov-09, at 11:48 AM, Richard Elling wrote:

On Nov 20, 2009, at 11:27 AM, Adam Serediuk wrote:
I have several X4540 Thor systems with one large zpool thatreplicate data to a backup host via zfs send/recv. The processworks quite well when there is little to no usage on the sourcesystems. However when the source systems are under usagereplication slows down to a near crawl. Without load replicationstreams along usually near 1 Gbps but drops down to anywherebetween 0 - 5000 Kbps while under load.
This makes it difficult to keep snapshot replication workingeffectively. It seems that the zfs_send operation is low priorityonly occurring after I/O operations have been completed.
Is there a way that I can increase the send priority to increasereplication speed?
No, unless you compile the code yourself.
Both the source and destination system are configured in one largezpool comprised of 8 raidz sets. While under load the sourcesystem does ~ 500 - 950 iops/s (from zpool iostat) with no apparenthot spots. It seems to me that the system should be able to performmuch faster. Unfortunately the data on these systems is in the formof hundreds of millions (maybe even into the billion mark now) ofvery small files, could this be a factor even with the block levelreplication occurring?
The process is currently:

zfs_send -> mbuffer -> LAN -> mbuffer -> zfs_recv
I've done some work on such things. The difficulty in design isfiguring
out how often to do the send. You will want to balance your send time
interval with the write rate such that the send data is likely to bein the ARC.There is no magic formula, but empirically you can discover areasonable
interval.
There is a lurking RFE here somewhere: it would be nice toautomatically
snapshot when some threshold of writes has occurred.
P.S. if you have atime enabled, which is the default, handlingbillions of
files will be quite a challenge.
-- richard


_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] ZFS Send Priority and Performance

Reply via email to