>Don't worry about the performance optimizations yet, there are many
>ways to solve that, I'm sure we can get decent performance on this in
>one way or another. At the moment it's more important that we can
>guarantee the uniqueness and that it works with all file formats.

There is never a way to guarantee uniqueness with MD5.
The more audio content that is checked, the less chance of duplicates.  Even 
checking the whole audio content would not guarantee false duplicates.  
Checking all data would be really costly on performance.

I was exploring the idea of calculating the checksum on a block of data, and 
then if that is a duplicate re-calculate the checksum by reading more data.  
Nice idea in concept, to perform the check on a small subset unless necessary, 
but in reality I can't see how it would work.
_______________________________________________
beta mailing list
beta@lists.slimdevices.com
http://lists.slimdevices.com/mailman/listinfo/beta

Reply via email to