On Fri, May 14, 2010 at 12:54 AM, Michael Tokarev <[email protected]> wrote:

> I add extra --link-dest argument on the receiving end, to simplify finding
> identical files.


That is not supported because it can radically change what rsync is doing.
 You may have an update in the batch file that is based on a different basis
file than the one that rsync found on the receiving side, which could
(attempt to) corrupt the file or cause other problems.  With the --hard-link
(-H) option, it can move the file's update around since it can change when
rsync figures out that a file with multiple hard links is not going to be
found elsewhere in the transfer.

For a batch update, you should have an identical set of files for the
batch-creation and batch-replay hosts.

If you're wanting to find more files to hard-link together, you should
probably do some kind of post-processing work.  For instance, in 3.1.0dev
(the upcoming release) there is a %C escape for the --out-format option that
would output the md5 sum of every file in the transfer.  If you had a list
of the current md5 sums on the disk, you could match them and hard-link them
together, updating the list with the new checksums.  Or you could create
something that scans through the parallel hierarchies looking for matching
name+mtime+size entries.

As for rsync 2 working with that idiom, it might be caused by a bug-fix in
the hard-linking code, or perhaps by some bug-fixing to the batch code, or
something similar.  Rsync 2 might just be getting lucky, or it might be
silently doing the wrong thing.

..wayne..
-- 
Please use reply-all for most replies to avoid omitting the mailing list.
To unsubscribe or change options: https://lists.samba.org/mailman/listinfo/rsync
Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html

Reply via email to