Thanks

On Tue, May 25, 2010 at 1:26 PM, Benjamin Watkins
<ben-l...@constant-technologies.com> wrote:
> On 5/25/2010 6:41 AM, Mag Gam wrote:
>>
>> I know rsync can do many things but I was wondering if anyone is using
>> it for data deduplication on a large filesystem. I have a filesystem
>> which is about 2TB and I want to make sure I don't have the same data
>> in a different place of a filesystem. Is there an algorithm for that?
>>
>
> While rsync is not an appropriate tool for this, I have successfully used
> dupseek in the past.
>
>    http://freshmeat.net/projects/dupseek/
>
> It is a perl script, so I expect you should be able to use it on any
> platform you need.  It show support for POSIX/Linux, but I expect it can run
> under Windows as well if you are comfortable with Cygwin.
>
> I'm sure there are many more tools like this.  I used this one because it
> was optimized for large files.
>
> -Ben
>
>
-- 
Please use reply-all for most replies to avoid omitting the mailing list.
To unsubscribe or change options: https://lists.samba.org/mailman/listinfo/rsync
Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html

Reply via email to