I have written an application in perl, to run multiple rsyncs, retrying those which fail. I have started getting this error on very large modules since i moved to the latest CVS version. ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ ToolSyncProgress Succeeded Getting big/tools/vlsi_local/etc from willy at 1004767211:2001:11:3:6:0:11. ToolSyncProgress Getting big1/cadappl1/hpux from willy at 1004767211:2001:11:3:6:0:11. rsync: connection unexpectedly closed (50466734 bytes read so far) rsync error: error in rsync protocol data stream (code 12) at io.c(139) ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ The script checks exit status and retries, ordinarily. However, when i get THIS error, the rsync process seems to be killing the perl process. I think it's a bug in the part of the code that cleans up the spawned processes. Here's another example, which shows that the timeout as total transfer time but is still active. It also ends by killing its calling process. The error looks different, though, but it's on a local nfs-to-nfs, instead of rsyncd-to-nfs. ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ ToolSyncProgress Replicating big1/cadappl1/hpux from chi-tools1 to chi-tools2 at 1004913115:2001:11:4:22:31:55. io timeout after 3600 seconds - exiting rsync error: timeout in data send/receive (code 30) at io.c(75) ToolSyncProgress chinetsvr failed Replicating big1/cadappl1/hpux from /wan/chi-tools1/chi-tools1 to /wan/chi-tools2/chi-tools2 at 1004925444:2001:11:5:1:57:24 with 7680, retrying. ToolSyncProgress Replicating big1/cadappl1/hpux from chi-tools1 to chi-tools2 at 1004925444:2001:11:5:1:57:24. io timeout after 3600 seconds - exiting rsync error: timeout in data send/receive (code 30) at io.c(75) ToolSyncProgress chinetsvr failed Replicating big1/cadappl1/hpux from /wan/chi-tools1/chi-tools1 to /wan/chi-tools2/chi-tools2 at 1004937647:2001:11:5:5:20:47 with 7680, retrying. ToolSyncProgress Replicating big1/cadappl1/hpux from chi-tools1 to chi-tools2 at 1004937647:2001:11:5:5:20:47. io timeout after 3600 seconds - exiting rsync error: timeout in data send/receive (code 30) at io.c(75) ToolSyncProgress ToolSyncProgress Replicating io timeout after 3600 seconds - exiting rsync error: timeout in data send/receive (code 30) at io.c(75) ToolSyncProgress ToolSyncProgress Replicating rsync error: received SIGUSR1 or SIGINT (code 20) at main.c(741) ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Any ideas? I can try to track the killing and timeout logic, if there's not already a fix out there that was just accidentally dropped.
Tim Conway [EMAIL PROTECTED] 303.682.4917 Philips Semiconductor - Longmont TC 1880 Industrial Circle, Suite D Longmont, CO 80501 Available via SameTime Connect within Philips, n9hmg on AIM perl -e 'print pack(nnnnnnnnnnnn, 19061,29556,8289,28271,29800,25970,8304,25970,27680,26721,25451,25970), ".\n" ' "There are some who call me.... Tim?"