Re: [Haskell] Fingerprints and hashing

2007-10-15 Thread Norman Ramsey
Simon Peyton Jones writes: > We are all familiar with the idea of an MD5 checksum, which provides a > reliable "fingerprint" for a file, usually 128 bits or so... > For various applications (including identifying common sub-expressions, and > version tracking in GHC), I'd like a Haskell libra

Re: [Haskell] Fingerprints and hashing

2007-10-12 Thread Bryan O'Sullivan
Neil Mitchell wrote: Hi Simon, We are all familiar with the idea of an MD5 checksum, which provides a reliable "fingerprint" for a file, usually 128 bits or so. If the file changes, the fingerprint is (almost) certain to do so too. There are lots of techniques: CRC, shar?, MD5, etc. I be

Re: [Haskell] Fingerprints and hashing

2007-10-11 Thread Dan Weston
2007 11:27 | To: haskell | Subject: Re: [Haskell] Fingerprints and hashing | | If compositionality is important, at least Rabin's fingerprints are | worth considering: http://citeseer.ist.psu.edu/broder93some.html | | They have the neat property that the fingerprint of a concatenation of | s

Re: [Haskell] Fingerprints and hashing

2007-10-11 Thread Dan Weston
ginal Message- | From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Lauri Alanko | Sent: 11 October 2007 11:27 | To: haskell | Subject: Re: [Haskell] Fingerprints and hashing | | If compositionality is important, at least Rabin's fingerprints are | worth conside

RE: [Haskell] Fingerprints and hashing

2007-10-11 Thread Carl Witty
On Thu, 2007-10-11 at 12:28 +0100, Simon Peyton-Jones wrote: > Interesting! The associativity property is the kind of thing I was after. > Except that I don't really care if FP(as ++ bs) = FP(as) `plusFP` FP(bs). I > only care that the latter is robust in the sense of having low probabilty of

RE: [Haskell] Fingerprints and hashing

2007-10-11 Thread Simon Peyton-Jones
paper gives no comparison with other fingerprinting schemes. Simon | -Original Message- | From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Lauri Alanko | Sent: 11 October 2007 11:27 | To: haskell | Subject: Re: [Haskell] Fingerprints and hashing | | If compositionality is impo

Re: [Haskell] Fingerprints and hashing

2007-10-11 Thread Lauri Alanko
If compositionality is important, at least Rabin's fingerprints are worth considering: http://citeseer.ist.psu.edu/broder93some.html They have the neat property that the fingerprint of a concatenation of strings can be cheaply computed from the fingerprints of the constituents. I think this effect

Re: [Haskell] Fingerprints and hashing

2007-10-11 Thread Neil Mitchell
Hi Simon, > We are all familiar with the idea of an MD5 checksum, which provides a > reliable "fingerprint" for a file, usually 128 bits or so. If the file > changes, the fingerprint is (almost) certain to do so too. There are lots of > techniques: CRC, shar?, MD5, etc. I believe the basic o

[Haskell] Fingerprints and hashing

2007-10-11 Thread Simon Peyton-Jones
Dear Haskellers Here's a challenge. We are all familiar with the idea of an MD5 checksum, which provides a reliable "fingerprint" for a file, usually 128 bits or so. If the file changes, the fingerprint is (almost) certain to do so too. There are lots of techniques: CRC, shar?, MD5, etc. Fo