Re: Rsync Throughput issues

2001-10-09 Thread tim . conway

You might try putting the rsycd (server) on the source.  rsync as a server is really 
designed for read-only.  The read-write support is an afterthough, and very flaky.  
Martin, Tridge, Wayne:  correct me if that's not true any more.
Make sure you're not using --bwlimit=, unless you need to parcel your bandwidth (In my 
situation, I do).
Don't use -H, unless you actually use multiple hard links to files, as that adds 
tremendous overhead on large filesystems.
If the files are nfs mounted, use -W, else you will read both files completely, before 
you transfer anything... might as well just send it if datestamp or size differs.
I doubt ssh was giving you any slowdown, unless you're cpu-limited on one or the other 
box.  If you don't need the security, and policy permits, plain rsh transport might be 
a bit faster.


Tim Conway
[EMAIL PROTECTED]
303.682.4917
Philips Semiconductor - Longmont TC
1880 Industrial Circle, Suite D
Longmont, CO 80501
Available via SameTime Connect within Philips
Available as n9hmg on AIM
perl -e 'print pack(, 
19061,29556,8289,28271,29800,25970,8304,25970,27680,26721,25451,25970), .\n '
There are some who call me Tim?





Ben Ricker [EMAIL PROTECTED]@lists.samba.org on 10/08/2001 06:50:02 PM

Sent by:  [EMAIL PROTECTED]


To: [EMAIL PROTECTED]
cc:  (bcc: Tim Conway/LMT/SC/PHILIPS)
Subject:  Rsync Throughput issues
Classification:



I am using Rsync between a Redhat Linux box and an AIX RS600. We have
a about 30gb of database we need to sync to a backup server. Sounds
good, right? The problem is that Rsync is so slow when we do the
initial dump. We have files that are 1 - 5gb. It takes around 14-20
hours to Rsync the file structure, which means our daily backup will
miss that days data because we are waiting for the data to rsync.

At first we were using ssh as the agent but we thought that might be
slowing us down. We then tried to use rsync as a server and push the
data to the module/zone of the server. Consistently, it takes roughly
2 minutes to transfer 10meg of data

I thought it was a network problem so we tried ftp. A 10 mb file took
90% the time (about 20 seconds. This is on a 100bit Full Duplex
switched network with both boxes on the SAME subnet.

Am I missing something? Should I expect these speeds? Could there be
something on the App switch throttling rsync for some reason?

Any thought will help,

Ben Ricker
System Administrator
Wellinx.com









Re: Rsync Throughput issues

2001-10-09 Thread Eric Whiting

Ben Ricker wrote:
 
 I am using Rsync between a Redhat Linux box and an AIX RS600. We have
 a about 30gb of database we need to sync to a backup server. Sounds
 good, right? The problem is that Rsync is so slow when we do the
 initial dump. We have files that are 1 - 5gb. It takes around 14-20
 hours to Rsync the file structure, which means our daily backup will
 miss that days data because we are waiting for the data to rsync.

How many files are in the tree? Are all the files 1-5G? Are the source
files on new/fast disk? For older systems, the 100Mbit network is often
faster than disk. 

I have a 40G tree with 1 million files in it. I just killed the tree on
a development box and restarted a rsync to it. I think it might take 10+
hours mainly because of how many files are in the tree -- not so much
because of the total size. I'll let you know. I know it takes about 2
hours just to verify that all 1 million files are in sync even when
there are no changes to the tree. 
 
 At first we were using ssh as the agent but we thought that might be
 slowing us down. We then tried to use rsync as a server and push the
 data to the module/zone of the server. Consistently, it takes roughly
 2 minutes to transfer 10meg of data

That seems way too long. Sounds like a reverse dns lookup failure. Long
shot debug question -- Do you have dns/nis/files all working properly?
Sometimes reverse dns lookups can take a few minutes to fail or time out
and thus affect the overall speed of transfers. Repeated reverse dns
failures can add up fast.
 
 I thought it was a network problem so we tried ftp. A 10 mb file took
 90% the time (about 20 seconds. This is on a 100bit Full Duplex
 switched network with both boxes on the SAME subnet.

I think you still have other problems. A ftp of a 10Mbyte file takes my
boxes about 1s on a 100Mbit wire. 

Be sure to try ftp both get and put. Oftentimes a duplex mismatch or
speed problem will only affect traffic in one direction and not the
other. I think you might have some network issues. Do a ifconfig on the
linux box to see what sort of errors and collisions you are seeing.
 
 Am I missing something? Should I expect these speeds? Could there be
 something on the App switch throttling rsync for some reason?
 
 Any thought will help,
 
 Ben Ricker
 System Administrator
 Wellinx.com




Re[2]: Rsync Throughput issues

2001-10-09 Thread Rusty Carruth

(Courtesy copy sent directly to Ben)

Eric Whiting [EMAIL PROTECTED] wrote:
 Ben Ricker wrote:
  
  I am using Rsync between a Redhat Linux box and an AIX RS600. We have
  a about 30gb of database we need to sync to a backup server. Sounds
  good, right? The problem is that Rsync is so slow when we do the
  initial dump. We have files that are 1 - 5gb. It takes around 14-20
  hours to Rsync the file structure, which means our daily backup will
  miss that days data because we are waiting for the data to rsync.
 
 ...

I've got 2 areas that I sync across a T-1, as follows:

Number of files: 8122
Number of files transferred: 626
Total file size: 2,101,157,198 bytes
Total transferred file size: 1,855,602,142 bytes
Literal data: 2,851,745 bytes
Matched data: 1,852,750,397 bytes
File list size: 197,222
Total bytes written: 10,194,719
Total bytes read: 487,766

wrote 10,194,719 bytes  read 487,766 bytes  6,841.17 bytes/sec
total size is 2,101,157,198  speedup is 196.69

Number of files: 13,502
Number of files transferred: 0
Total file size: Total transferred file size: 0 bytes
Literal data: 0 bytes
Matched data: 0 bytes
File list size: 304636
Total bytes written: 2741
Total bytes read: 304,713

wrote 2,741 bytes  read 304,713 bytes  3,967.15 bytes/sec
total size is 

The above 2 things took 27 minutes 20 seconds total.  Remeber again,
this is across a T1 (full 1.5 Mbit T1), not 10Mbit or 100Mbit. Without
ssh or anything else.

  At first we were using ssh as the agent but we thought that might be
  slowing us down. We then tried to use rsync as a server and push the
  data to the module/zone of the server. Consistently, it takes roughly
  2 minutes to transfer 10meg of data
 
 That seems way too long. ...

  I thought it was a network problem so we tried ftp. A 10 mb file took
  90% the time (about 20 seconds. This is on a 100bit Full Duplex

20 seconds for 10 MB (I'm assuming you meant MegaByte, not MegaBit ;-) is
way slow.  ESPECIALLY for ftp. A long time ago I ran an ftp test on
a 10 Mbit link between 2 Sun Sparc2 machines.  I was able to use 99.9%
of the bandwidth (i.e. I got over 1 megaByte/sec transfer rates!). Put
another way, using 2 sparcstation 2s on a slightly occupied 10Mbit network
I should be able to transfer that file in 10 seconds.

One word of warning - FTP transfers are way efficient, and can burn all
the bandwidth up.  Most other protocols are not nearly so efficient, and
so you cannot get the speed from them.  The same machines and all worked
much slower doing rcp as transfer means (but I don't remember the numbers,
this was what, 9 years ago?).

So, whats the point?  Its this:

What's your network loading?  Have you run anything (like Sun's SE performance
toolkit (which unfortunatly only works on sun) to see if your network
is working up to snuff?

Going off in another direction: Do you have enough RAM?  Programs running
out of disk instead of out of ram are DEADLY slow...  And the amount of 
data you have to worry about is non-trivial here...  ;-)

  switched network with both boxes on the SAME subnet.
 
 I think you still have other problems. A ftp of a 10Mbyte file takes my
 boxes about 1s on a 100Mbit wire. 

probably due as much to file transfer rates as network speeds!
What happens if you do the transfer twice?  (So you use the cache
for the file data instead of having to read it from the disk -
assuming that you're not memory limited!)

 Be sure to try ftp both get and put. Oftentimes ...
 I think you might have some network issues. 

I'm going to have to agree.  DNS is one possiblility, but that slow
ftp reported tends to make me think its something else.

 Do a ifconfig on the
 linux box to see what sort of errors and collisions you are seeing.

And on the AIX box also!

What's the loading on that AIX box like?

Can you try rsync-ing between 2 linux boxes?  Between 2 aix boxes?
(First, try ftp transfers and see what kind of performance you get,
once THAT is fixed then see how well rsync works between the 'correct'
machines, before you worry about trying between other than the machines
you plan to use in the final setup)

  Am I missing something?

Probably (not much help, eh? ;-)

  Should I expect these speeds? 

No, this is slow.

  Could there be something on the App switch throttling rsync for some reason?

Umm, I think I missed something.  What is the app switch?  Unless you mean the
network switching hub - maybe.  Is is SNMP manageable?  How much bandwidth is
it currently handling?  How much CAN it handle?

One note on switching hubs - I have one at home, and I've noticed a really
weird thing - when transferring data from my laptop (10Mbit PCMCIA on a
P133 laptop) to my 'big machine' (Celeron 450MHz with 100Mbit ethernet,
lotsa ram, etc) through the switching hub I get collisions(!!!) between
the laptop and the hub.  Scratch head time, and one of these days I'm
going to have to figure out how and why...  The point is - don't assume
your hub does all its supposed to do and 

Re: Re[2]: Rsync Throughput issues

2001-10-09 Thread Ben Ricker

Thanks for the advice. I ran an rsync test, using SSH as the transport
agent, and I got the following throughput:

wrote 178895529 bytes  read 32 bytes  499011.33 bytes/sec
total size is 1075112361  speedup is 6.01

It took the same amount of time as the ftp transfer did (about 6
minutes).

So, it looks like the problem is with Rsync for AIX. We use the binary
install. Someone suggested compiling rsync from source. Has anyone else
run rsync on AIX?

Ben Ricker
System Administrator
Wellinx.com


On Tue, 2001-10-09 at 12:01, Rusty Carruth wrote:
 (Courtesy copy sent directly to Ben)
 
 Eric Whiting [EMAIL PROTECTED] wrote:
  Ben Ricker wrote:
   
   I am using Rsync between a Redhat Linux box and an AIX RS600. We have
   a about 30gb of database we need to sync to a backup server. Sounds
   good, right? The problem is that Rsync is so slow when we do the
   initial dump. We have files that are 1 - 5gb. It takes around 14-20
   hours to Rsync the file structure, which means our daily backup will
   miss that days data because we are waiting for the data to rsync.
  
  ...
 
 I've got 2 areas that I sync across a T-1, as follows:
 
 Number of files: 8122
 Number of files transferred: 626
 Total file size: 2,101,157,198 bytes
 Total transferred file size: 1,855,602,142 bytes
 Literal data: 2,851,745 bytes
 Matched data: 1,852,750,397 bytes
 File list size: 197,222
 Total bytes written: 10,194,719
 Total bytes read: 487,766
 
 wrote 10,194,719 bytes  read 487,766 bytes  6,841.17 bytes/sec
 total size is 2,101,157,198  speedup is 196.69
 
 Number of files: 13,502
 Number of files transferred: 0
 Total file size: Total transferred file size: 0 bytes
 Literal data: 0 bytes
 Matched data: 0 bytes
 File list size: 304636
 Total bytes written: 2741
 Total bytes read: 304,713
 
 wrote 2,741 bytes  read 304,713 bytes  3,967.15 bytes/sec
 total size is 
 
 The above 2 things took 27 minutes 20 seconds total.  Remeber again,
 this is across a T1 (full 1.5 Mbit T1), not 10Mbit or 100Mbit. Without
 ssh or anything else.
 
   At first we were using ssh as the agent but we thought that might be
   slowing us down. We then tried to use rsync as a server and push the
   data to the module/zone of the server. Consistently, it takes roughly
   2 minutes to transfer 10meg of data
  
  That seems way too long. ...
 
   I thought it was a network problem so we tried ftp. A 10 mb file took
   90% the time (about 20 seconds. This is on a 100bit Full Duplex
 
 20 seconds for 10 MB (I'm assuming you meant MegaByte, not MegaBit ;-) is
 way slow.  ESPECIALLY for ftp. A long time ago I ran an ftp test on
 a 10 Mbit link between 2 Sun Sparc2 machines.  I was able to use 99.9%
 of the bandwidth (i.e. I got over 1 megaByte/sec transfer rates!). Put
 another way, using 2 sparcstation 2s on a slightly occupied 10Mbit network
 I should be able to transfer that file in 10 seconds.
 
 One word of warning - FTP transfers are way efficient, and can burn all
 the bandwidth up.  Most other protocols are not nearly so efficient, and
 so you cannot get the speed from them.  The same machines and all worked
 much slower doing rcp as transfer means (but I don't remember the numbers,
 this was what, 9 years ago?).
 
 So, whats the point?  Its this:
 
 What's your network loading?  Have you run anything (like Sun's SE performance
 toolkit (which unfortunatly only works on sun) to see if your network
 is working up to snuff?
 
 Going off in another direction: Do you have enough RAM?  Programs running
 out of disk instead of out of ram are DEADLY slow...  And the amount of 
 data you have to worry about is non-trivial here...  ;-)
 
   switched network with both boxes on the SAME subnet.
  
  I think you still have other problems. A ftp of a 10Mbyte file takes my
  boxes about 1s on a 100Mbit wire. 
 
 probably due as much to file transfer rates as network speeds!
 What happens if you do the transfer twice?  (So you use the cache
 for the file data instead of having to read it from the disk -
 assuming that you're not memory limited!)
 
  Be sure to try ftp both get and put. Oftentimes ...
  I think you might have some network issues. 
 
 I'm going to have to agree.  DNS is one possiblility, but that slow
 ftp reported tends to make me think its something else.
 
  Do a ifconfig on the
  linux box to see what sort of errors and collisions you are seeing.
 
 And on the AIX box also!
 
 What's the loading on that AIX box like?
 
 Can you try rsync-ing between 2 linux boxes?  Between 2 aix boxes?
 (First, try ftp transfers and see what kind of performance you get,
 once THAT is fixed then see how well rsync works between the 'correct'
 machines, before you worry about trying between other than the machines
 you plan to use in the final setup)
 
   Am I missing something?
 
 Probably (not much help, eh? ;-)
 
   Should I expect these speeds? 
 
 No, this is slow.
 
   Could there be something on the App switch throttling rsync for some reason?
 
 Umm, I think I 

Re: Re[2]: Rsync Throughput issues

2001-10-09 Thread Ben Ricker

On Tue, 2001-10-09 at 14:51, meg wrote:
 we use rsync on aix 4.1.5 (apple hardware).  it builds very easily. 
 do you have specific questions?
 
 goodluck!

We are running on an RS6000 (H80 with single CPU and 1gb memory). We are
also running a rather large Oracle database on it. I noticed that when
rsync was running, there was a considerable I/O wait (at one point, it
was at 98%!). We installed Rsync from a binary dist, not compiled. I
assume by your use of build that you compiled from source?

Ben Ricker
System Administrator
Wellinx.com






Rsync Throughput issues

2001-10-08 Thread Ben Ricker

I am using Rsync between a Redhat Linux box and an AIX RS600. We have 
a about 30gb of database we need to sync to a backup server. Sounds 
good, right? The problem is that Rsync is so slow when we do the 
initial dump. We have files that are 1 - 5gb. It takes around 14-20 
hours to Rsync the file structure, which means our daily backup will 
miss that days data because we are waiting for the data to rsync.

At first we were using ssh as the agent but we thought that might be 
slowing us down. We then tried to use rsync as a server and push the 
data to the module/zone of the server. Consistently, it takes roughly 
2 minutes to transfer 10meg of data

I thought it was a network problem so we tried ftp. A 10 mb file took 
90% the time (about 20 seconds. This is on a 100bit Full Duplex 
switched network with both boxes on the SAME subnet.

Am I missing something? Should I expect these speeds? Could there be 
something on the App switch throttling rsync for some reason?

Any thought will help,

Ben Ricker
System Administrator
Wellinx.com