[zfs-discuss] "zfs send..." too slow?

2009-10-25 Thread Orvar Korvar
I am trying to backup a large zfs file system to two different identical hard 
drives. I have therefore started two commands to backup "myfs" and when they 
have finished, I will backup "nextfs"

zfs send mypool/m...@now | zfs receive backupzpool1/now & zfs send 
mypool/m...@now | zfs receive backupzpool2/now ; zfs send mypool/nex...@now | 
zfs receive backupzpool3/now

in parallell. The logic is that the same file data is cached and therefore easy 
to send to each backup drive.

Should I instead have done one "zfs send..." and waited for it to complete, and 
then started the next?

It seems that "zfs send..." takes quite some time? 300GB takes 10 hours, this 
far. And I have in total 3TB to backup. This means it will take 100 hours. Is 
this normal? If I had 30TB to back up, it would take 1000 hours, which is more 
than a month. Can I speed this up?

Is rsync faster? As I have understood it, "zfs send.." gives me an exact 
replica, whereas rsync doesnt necessary do that, maybe the ACL are not 
replicated, etc. Is this correct about rsync vs "zfs send"?
-- 
This message posted from opensolaris.org
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] "zfs send..." too slow?

2009-10-25 Thread Ian Collins

Orvar Korvar wrote:


It seems that "zfs send..." takes quite some time? 300GB takes 10 hours, this 
far. And I have in total 3TB to backup. This means it will take 100 hours. Is this 
normal? If I had 30TB to back up, it would take 1000 hours, which is more than a month. 
Can I speed this up?

  
That looks very slow.  How are the pools configured?  Last time I 
migrated data between pools on the same box (an x4500), I got 80-90GB/hour.



Is rsync faster? As I have understood it, "zfs send.." gives me an exact replica, whereas 
rsync doesnt necessary do that, maybe the ACL are not replicated, etc. Is this correct about rsync 
vs "zfs send"?
  

I wouldn't bother with rsync in this situation.

--
Ian.

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] "zfs send..." too slow?

2009-10-26 Thread Albert Chin
On Sun, Oct 25, 2009 at 01:45:05AM -0700, Orvar Korvar wrote:
> I am trying to backup a large zfs file system to two different
> identical hard drives. I have therefore started two commands to backup
> "myfs" and when they have finished, I will backup "nextfs"
> 
> zfs send mypool/m...@now | zfs receive backupzpool1/now & zfs send
> mypool/m...@now | zfs receive backupzpool2/now ; zfs send
> mypool/nex...@now | zfs receive backupzpool3/now
> 
> in parallell. The logic is that the same file data is cached and
> therefore easy to send to each backup drive.
> 
> Should I instead have done one "zfs send..." and waited for it to
> complete, and then started the next?
> 
> It seems that "zfs send..." takes quite some time? 300GB takes 10
> hours, this far. And I have in total 3TB to backup. This means it will
> take 100 hours. Is this normal? If I had 30TB to back up, it would
> take 1000 hours, which is more than a month. Can I speed this up?

It's not immediately obvious what the cause is. Maybe the server running
zfs send has slow MB/s performance reading from disk. Maybe the network.
Or maybe the remote system. This might help:
  http://tinyurl.com/yl653am

-- 
albert chin (ch...@thewrittenword.com)
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] "zfs send..." too slow?

2009-10-26 Thread Richard Elling

On Oct 25, 2009, at 1:45 AM, Orvar Korvar wrote:

I am trying to backup a large zfs file system to two different  
identical hard drives. I have therefore started two commands to  
backup "myfs" and when they have finished, I will backup "nextfs"


zfs send mypool/m...@now | zfs receive backupzpool1/now & zfs send  
mypool/m...@now | zfs receive backupzpool2/now ; zfs send mypool/ 
nex...@now | zfs receive backupzpool3/now


in parallell. The logic is that the same file data is cached and  
therefore easy to send to each backup drive.


Should I instead have done one "zfs send..." and waited for it to  
complete, and then started the next?


Parallel works, well, in parallel. Unless the changes are in the ARC,  
you
will be spending a lot of time waiting on disk. So having multiple  
sends in
parallel, in general, gains parallelism. If you only have a single  
HDD, you

might not notice much improvement, though.

It seems that "zfs send..." takes quite some time? 300GB takes 10  
hours, this far. And I have in total 3TB to backup. This means it  
will take 100 hours. Is this normal? If I had 30TB to back up, it  
would take 1000 hours, which is more than a month. Can I speed this  
up?


CR 6418042 integrated in b102 and Solaris 10 10/09 improves send  
performance.


Is rsync faster? As I have understood it, "zfs send.." gives me an  
exact replica, whereas rsync doesnt necessary do that, maybe the ACL  
are not replicated, etc. Is this correct about rsync vs "zfs send"?


I general, rsync will be slower, especially if there are millions of  
files, because it

must stat() every file to determine those that have changed.
 -- richard

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] "zfs send..." too slow?

2009-10-26 Thread Marion Hakanson
knatte_fnatte_tja...@yahoo.com said:
> Is rsync faster? As I have understood it, "zfs send.." gives me an exact
> replica, whereas rsync doesnt necessary do that, maybe the ACL are not
> replicated, etc. Is this correct about rsync vs "zfs send"? 

It is true that rsync (as of 3.0.5, anyway) does not preserve NFSv4/ZFS
ACL's.  It also cannot handle ZFS snapshots.

On the other hand, you can run multiple rsync's in parallel;  You can
only do that with zfs send/recv if you have multiple, independent ZFS
datasets that can be done in parallel.  So which one goes faster will
depend on your situation.

Regards,

Marion


___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] "zfs send..." too slow?

2009-10-26 Thread Richard Elling

On Oct 26, 2009, at 11:51 AM, Marion Hakanson wrote:


knatte_fnatte_tja...@yahoo.com said:
Is rsync faster? As I have understood it, "zfs send.." gives me an  
exact
replica, whereas rsync doesnt necessary do that, maybe the ACL are  
not

replicated, etc. Is this correct about rsync vs "zfs send"?


It is true that rsync (as of 3.0.5, anyway) does not preserve NFSv4/ 
ZFS

ACL's.  It also cannot handle ZFS snapshots.

On the other hand, you can run multiple rsync's in parallel;  You can
only do that with zfs send/recv if you have multiple, independent ZFS
datasets that can be done in parallel.  So which one goes faster will
depend on your situation.


Yes. Your configuration and intended use impacts the decision.

Also, b119 improves stat() performance, which should help rsync
and other file-based backup software.
http://bugs.opensolaris.org/bugdatabase/view_bug.do?bug_id=6775100
 -- richard

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] "zfs send..." too slow?

2009-10-27 Thread David Dyer-Bennet

On Sun, October 25, 2009 03:45, Orvar Korvar wrote:

> It seems that "zfs send..." takes quite some time? 300GB takes 10 hours,
> this far. And I have in total 3TB to backup. This means it will take 100
> hours. Is this normal? If I had 30TB to back up, it would take 1000 hours,
> which is more than a month. Can I speed this up?

That seems pretty bad, I back up around 650GB to a USB-2 external drive
(not the world's fastest!!!) in about 7 hours, last I checked the time.
>
> Is rsync faster? As I have understood it, "zfs send.." gives me an exact
> replica, whereas rsync doesnt necessary do that, maybe the ACL are not
> replicated, etc. Is this correct about rsync vs "zfs send"?

rsync doesn't seem to cover the ACLs, last I looked closely.  Which is why
I converted my working rsync-based version to a zfs send-receive version
that only supports full backups and isn't automated yet.  Grumble.  (The
ACLs are key for in-kernel CIFS, which is what drove me here.)
-- 
David Dyer-Bennet, d...@dd-b.net; http://dd-b.net/
Snapshots: http://dd-b.net/dd-b/SnapshotAlbum/data/
Photos: http://dd-b.net/photography/gallery/
Dragaera: http://dragaera.info

___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss