On Sep 3, 2010, at 1:02 PM, erland wrote: > > andyg;574039 Wrote: >> MD5 is fine, the problem is the first 10000 bytes of these files are >> identical. I think the easiest way to deal with it is to not take the >> bytes from the very beginning of the file but from somewhere in the >> middle. >> > It just felt strange that using a md5_size of 100 000 reports different > number of duplicates than a setting of 500 000, shouldn't the padding be > irrelevant when using a larger md5_size settings ?
Yeah, any false-positive duplicates need to be investigated as to why so many bytes are identical. _______________________________________________ beta mailing list beta@lists.slimdevices.com http://lists.slimdevices.com/mailman/listinfo/beta