Hello!
Here's what I've done. I've loaded a brand new backup server just out
of the box, and a test host running actual RHEL4 on a previously unused
computer. In other words, 100% of the hardware (including the network
cables and switch) are completely different. I get the same error.
I have included detailed information about the systems involved at the
end of this e-mail, including the BackupPC and rsync logs. I am running
the latest BackupPC, the latest File:RsyncP and a standard Red Hat rsync
binary. Yet, I get exactly the same error.
I also decided to retest running rsync directly from the backup server,
using the command-line specified for BackupPC. It turns out actual
rsync *does* fail on the backup server. Here is the error:
BackupPC side:
# rsync --numeric-ids --perms --owner --group -D --links --hard-links
--times --block-size=2048 --recursive --one-file-system
[EMAIL PROTECTED]::ROOT/* .
rsync: read error: No route to host
rsync error: error in rsync protocol data stream (code 12) at io.c(177)
rsync: connection unexpectedly closed (1423795 bytes read so far)
rsync error: error in rsync protocol data stream (code 12) at io.c(165)
#
Host side: /var/log/rsyncd.log
2007/01/31 12:16:59 [2625] rsyncd version 2.6.3 starting, listening on
port 873
2007/01/31 12:17:16 [2627] name lookup failed for 172.28.16.32: Name or
service not known
2007/01/31 12:17:16 [2627] rsync on . from [EMAIL PROTECTED] (172.28.16.32)
2007/01/31 12:34:15 [2627] rsync: writefd_unbuffered failed to write
4096 bytes: phase "unknown" [sender]: Connection timed out (110)
2007/01/31 12:34:15 [2627] rsync error: error in rsync protocol data
stream (code 12) at io.c(909)
While it still fails, actual rsync seems to copy more data faster than
File::RsyncP. In fact, with only 500MB of data, I was actually able to
get an entire copy done once with the real rsync binary.
To eliminate the possibility of special files causing problems, and to
generate more test data, I created a test directory and cp -a /usr 4
times times into the test directory, creating 2.0GB of data (and I later
increased it to 4GB). It still fails.
I realized that I've only used my VIA-based backup servers. Different
systems *and* different operating systems, but same model of hardware.
So, I grabbed another computer (IBM NetVista), loaded it with CentOS 3.8
(just like my backup servers) and ran the above command-line rsync
against the test data. Same error.
I then booted Knoppix 3.2 with a 2.4 kernel on the NetVista and did a
command-line rsync. Same error! I then booted Knoppix 3.9 with a 2.6
kernel: same error. Knoppix 4.0.2: same error. Knoppix 5.0.1: Same
error.
In doing a bunch of tests (which are not all detailed here), I began to
think that there might be a relationship between system speed and the
amount of data that could be copied: the NetVista tended to copy more
data before the failure than the VIA does (though not always). So, I
booted a Knoppix 3.6 CD (3.2 doesn't support the NIC) on my notebook
(IBM ThinkPad R52, Pentium M 1.86GHz, 1.5GB RAM). It too failed, but
much farther: 1.3GHz. I then booted Knoppix 3.7: it failed at 700MB.
I then booted Knoppix 3.9. It failed once at 550MB. I reran it, and
it finished successfully. So, I doubled the size of the data to 4GB.
Reran it: it failed at over 1GB. It seems that my notebook will still
fail, but it's a little harder to get it to fail.
So, I am able to reproduce the error with 3 different computers, 2
completely different operating system families and a bunch of different
versions.
Now that I'm on a disposable computer, I can start making changes to the
host side.
Anything else you want to see on the server side? I'm out of ideas
there. I will start manipulating the host side when I get some more
free time.
Timothy J. Massey
====================================
Detailed Server and Host Information
====================================
Backup Server
=============
Hardware: VIA EPIA-MII 1.2GHz motherboard, 512MB RAM, -or-
IBM NetVista 6790-CAU (Intel Pentium 4 1.6GHz), 512MB RAM -or-
IBM ThinkPad R52 (Intel Pentium M 1.86GHz), 1.5GB RAM
OS: Stock CentOS 3.8
Minimal install with two additional options
Web Server (all subitems unchecked)
Windows Server (all subitems unchecked)
Stock Knoppix 3.2
Stock Knoppix 3.6
Stock Knoppix 3.7
Stock Knoppix 3.9
Stock Knoppix 4.0.2
Stock Knoppix 5.0.1
Non-BackupPC software needed by BackupPC:
CGI.pm-3.25.tar.gz
compat-libstdc++-7.3-2.96.128.i386.rpm
Encode-2.18.tar.gz
par2cmdline-0.4-alt0.M24.1.i586.rpm
perl-Archive-Zip-1.16-1.1.el3.rf.noarch.rpm
perl-Compress-Zlib-1.42-1.el3.rf.i386.rpm
perl-File-RsyncP-0.68-1.el3.rf.i386.rpm
perl-HTML-Parser-3.26-17.i386.rpm
perl-HTML-Tagset-3.03-28.noarch.rpm
perl-libwww-perl-5.65-6.noarch.rpm
perl-suidperl-5.8.0-94.EL3.i386.rpm
perl-URI-1.21-7.noarch.rpm
perl-XML-Parser-2.31-16.EL3.i386.rpm
perl-XML-RSS-1.10-1.el3.rf.noarch.rpm
BackupPC 3.0.0
Config.pl
RsyncArgs:
$Conf{RsyncArgs} = [
#
# Do not edit these!
#
'--numeric-ids',
'--perms',
'--owner',
'--group',
'-D',
'--links',
'--hard-links',
'--times',
'--block-size=2048',
'--recursive',
#
# Rsync >= 2.6.3 supports the --checksum-seed option
# which allows rsync checksum caching on the server.
# Uncomment this to enable rsync checksum caching if
# you have a recent client rsync version and you want
# to enable checksum caching.
#
#'--checksum-seed=32761',
#
# Add additional arguments here
#
'--one-file-system',
];
# uname -a:
Linux vault.example.com 2.4.21-47.EL #1 Tue Aug 1 08:56:24 EDT 2006 i686
i686 i386 GNU/Linux
# perl -V
Summary of my perl5 (revision 5.0 version 8 subversion 0) configuration:
Platform:
osname=linux, osvers=2.4.21-37.0.1.el.centos3.xfs.0smp,
archname=i386-linux-thread-multi
uname='linux sillage.bis.pasteur.fr
2.4.21-37.0.1.el.centos3.xfs.0smp #1 smp fri jan 20 10:58:28 cet 2006
i686 athlon i386 gnulinux '
config_args='-des -Doptimize=-O2 -g -pipe -march=i386 -mcpu=i686
-Dmyhostname=localhost [EMAIL PROTECTED] -Dcc=gcc -Dcf_by=Red
Hat, Inc. -Dinstallprefix=/usr -Dprefix=/usr -Darchname=i386-linux
-Dvendorprefix=/usr -Dsiteprefix=/usr
-Dotherlibdirs=/usr/lib/perl5/5.8.0 -Duseshrplib -Dusethreads
-Duseithreads -Duselargefiles -Dd_dosuid -Dd_semctl_semun -Di_db
-Ui_ndbm -Di_gdbm -Di_shadow -Di_syslog -Dman3ext=3pm -Duseperlio
-Dinstallusrbinperl -Ubincompat5005 -Uversiononly -Dpager=/usr/bin/less
-isr'
hint=recommended, useposix=true, d_sigaction=define
usethreads=define use5005threads=undef useithreads=define
usemultiplicity=define
useperlio=define d_sfio=undef uselargefiles=define usesocks=undef
use64bitint=undef use64bitall=undef uselongdouble=undef
usemymalloc=n, bincompat5005=undef
Compiler:
cc='gcc', ccflags ='-D_REENTRANT -D_GNU_SOURCE -DTHREADS_HAVE_PIDS
-DDEBUGGING -fno-strict-aliasing -I/usr/local/include
-D_LARGEFILE_SOURCE -D_FILE_OFFSET_BITS=64 -I/usr/include/gdbm',
optimize='-O2 -g -pipe -march=i386 -mcpu=i686',
cppflags='-D_REENTRANT -D_GNU_SOURCE -DTHREADS_HAVE_PIDS
-DDEBUGGING -fno-strict-aliasing -I/usr/local/include -I/usr/include/gdbm'
ccversion='', gccversion='3.2.3 20030502 (Red Hat Linux 3.2.3-54)',
gccosandvers=''
intsize=4, longsize=4, ptrsize=4, doublesize=8, byteorder=1234
d_longlong=define, longlongsize=8, d_longdbl=define, longdblsize=12
ivtype='long', ivsize=4, nvtype='double', nvsize=8, Off_t='off_t',
lseeksize=8
alignbytes=4, prototype=define
Linker and Libraries:
ld='gcc', ldflags =' -L/usr/local/lib'
libpth=/usr/local/lib /lib /usr/lib
libs=-lnsl -lgdbm -ldb -ldl -lm -lpthread -lc -lcrypt -lutil
perllibs=-lnsl -ldl -lm -lpthread -lc -lcrypt -lutil
libc=/lib/libc-2.3.2.so, so=so, useshrplib=true, libperl=libperl.so
gnulibc_version='2.3.2'
Dynamic Linking:
dlsrc=dl_dlopen.xs, dlext=so, d_dlsymun=undef, ccdlflags='-rdynamic
-Wl,-rpath,/usr/lib/perl5/5.8.0/i386-linux-thread-multi/CORE'
cccdlflags='-fPIC', lddlflags='-shared -L/usr/local/lib'
Characteristics of this binary (from libperl):
Compile-time options: DEBUGGING MULTIPLICITY USE_ITHREADS
USE_LARGE_FILES PERL_IMPLICIT_CONTEXT
Locally applied patches:
MAINT18379
Built under linux
Compiled at Jun 10 2006 13:23:46
@INC:
/usr/lib/perl5/5.8.0/i386-linux-thread-multi
/usr/lib/perl5/5.8.0
/usr/lib/perl5/site_perl/5.8.0/i386-linux-thread-multi
/usr/lib/perl5/site_perl/5.8.0
/usr/lib/perl5/site_perl
/usr/lib/perl5/vendor_perl/5.8.0/i386-linux-thread-multi
/usr/lib/perl5/vendor_perl/5.8.0
/usr/lib/perl5/vendor_perl
/usr/lib/perl5/5.8.0/i386-linux-thread-multi
/usr/lib/perl5/5.8.0
.
#
rsync copy error:
rsync --numeric-ids --perms --owner --group -D --links --hard-links
--times --block-size=2048 --recursive --one-file-system
[EMAIL PROTECTED]::ROOT/* . (wd: /data/junk)
rsync: read error: No route to host
rsync error: error in rsync protocol data stream (code 12) at io.c(177)
rsync: connection unexpectedly closed (1423795 bytes read so far)
rsync error: error in rsync protocol data stream (code 12) at io.c(165)
BackupPC Host log:
2007-01-31 13:24:55 full backup started for directory ROOT
2007-01-31 13:41:53 Got fatal error during xfer (Child exited prematurely)
2007-01-31 13:41:58 Backup aborted (Child exited prematurely)
2007-01-31 13:41:58 Saved partial dump 0
BackupPC System log:
2007-01-31 13:05:20 Reading hosts file
2007-01-31 13:05:20 BackupPC started, pid 2535
2007-01-31 13:05:20 Running BackupPC_trashClean (pid=2541)
2007-01-31 13:05:20 Next wakeup is 2007-01-31 14:00:00
2007-01-31 13:24:54 User backupuser requested backup of victim (victim)
2007-01-31 13:24:55 Started full backup on victim (pid=2583, share=ROOT)
2007-01-31 13:41:58 Backup failed on victim (Child exited prematurely)
2007-01-31 13:41:58 Running BackupPC_link victim (pid=2619)
2007-01-31 13:43:05 Finished victim (BackupPC_link victim)
dmesg is identical to before starting backup.
# tail /data/pc/victim/NewFileList
3586dfe8fe720770cd191f943f3c7e06 74544
fROOT/fusr/flib/flibopcodes-2.15.92.0.2.so
0abb6364411a0900ec408630590e55c1 87622 fROOT/fusr/flib/flibopcodes.a
2bafac95f3f9744f5c957fe005a41659 13 fROOT/fusr/flib/flibpanel.so
f711c6feebb1b89a3a75180cddd94a32 15 fROOT/fusr/flib/flibpanel.so.5
b6dd3eb54188794681f1ea5b151c6d83 49335 fROOT/fusr/flib/flibpanel.so.5.4
02d893b8ef453af2ac30a56f338ed98e 14 fROOT/fusr/flib/flibpanelw.so
08a20e152294a77cd40a0aa42a959318 16 fROOT/fusr/flib/flibpanelw.so.5
a7b670210fd963da0c79af78c770c82e 51679 fROOT/fusr/flib/flibpanelw.so.5.4
55b529fdab36a57951798940d18df1c2 23 fROOT/fusr/flib/flibparted-1.6.so.12
3fecc98d6ac16527ba5fe188232b777e 287280 fROOT/fusr/flib/flibparte#
# tail /data/pc/victim/XferLog
create d 755 0/0 4096 usr/lib/debug/usr/lib/tc
create 755 0/0 9504
usr/lib/debug/usr/lib/tc/q_netem.so.debug
create d 755 0/0 4096 usr/lib/debug/usr/sbin
create 755 0/0 15656 usr/lib/debug/usr/sbin/nstat.debug
create 755 0/0 21340 usr/lib/debug/usr/sbin/rtacct.debug
create 755 0/0 7636 usr/lib/debug/usr/sbin/rtstat.debug
create 755 0/0 77256 usr/lib/debug/usr/sbin/ss.debug
create d 755 0/0 4096 usr/lib/games
create d 755 0/0 4096 usr/lib/gcc-lib
create d 755 0/0 12288 usr/lib/gconv
#
RHEL4 Host
==========
Hardware: IBM NetVista 6790-CAU
Intel Pentium 4 1.6GHz, 512MB RAM
OS: Stock RHEL4
Minimal install with zero additional options
Non-BackupPC software needed by BackupPC:
None
# uname -a:
Linux victim.example.com 2.6.9-5.EL #1 Wed Jan 5 19:22:18 EST 2005 i686
i686 i386 GNU/Linux
# rsync --version
rsync version 2.6.3 protocol version 28
Copyright (C) 1996-2004 by Andrew Tridgell and others
<http://rsync.samba.org/>
Capabilities: 64-bit files, socketpairs, hard links, symlinks, batchfiles,
inplace, IPv6, 64-bit system inums, 64-bit internal inums
rsync comes with ABSOLUTELY NO WARRANTY. This is free software, and you
are welcome to redistribute it under certain conditions. See the GNU
General Public Licence for details.
#
rsyncd.log:
2007/01/31 12:16:59 [2625] rsyncd version 2.6.3 starting, listening on
port 873
2007/01/31 12:17:16 [2627] name lookup failed for 172.28.16.32: Name or
service not known
2007/01/31 12:17:16 [2627] rsync on . from [EMAIL PROTECTED] (172.28.16.32)
2007/01/31 12:34:15 [2627] rsync: writefd_unbuffered failed to write
4096 bytes: phase "unknown" [sender]: Connection timed out (110)
2007/01/31 12:34:15 [2627] rsync error: error in rsync protocol data
stream (code 12) at io.c(909)
dmesg is identical to before starting backup.
-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier.
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
BackupPC-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/backuppc-users
http://backuppc.sourceforge.net/