On Sep 3, 2010, at 1:02 PM, erland wrote:

> 
> andyg;574039 Wrote: 
>> MD5 is fine, the problem is the first 10000 bytes of these files are
>> identical.  I think the easiest way to deal with it is to not take the
>> bytes from the very beginning of the file but from somewhere in the
>> middle.
>> 
> It just felt strange that using a md5_size of 100 000 reports different
> number of duplicates than a setting of 500 000, shouldn't the padding be
> irrelevant when using a larger md5_size settings ?

Yeah, any false-positive duplicates need to be investigated as to why so many 
bytes are identical.
_______________________________________________
beta mailing list
beta@lists.slimdevices.com
http://lists.slimdevices.com/mailman/listinfo/beta

Reply via email to