Re: [Haskell] Re: Fingerprints and hashing

Jan-Willem Maessen Thu, 11 Oct 2007 18:07:27 -0700


On Oct 11, 2007, at 4:33 PM, apfelmus wrote:
...

So, the idea is to use a "local gödel numbering" and uniquelynumber the and only the trees that are actually constructed (nocollisions, but few in numbers). In other words, every new treecreated gets the gödel number size collection + 1 and we cansimply use
  type GödelNumber = Word32
assuming that no more than 4G of trees will ever be constructed.For fast hash-consing, the collection itself is a (generalized) trie
  type Collection  = ExpF GödelNumber ~~> Maybe GödelNumber
mapping each new top of a tree (constructor + gödel numbers foralready known subtrees) to either Just its gödel number or Nothingwhen it's not in the collection yet.
With this, CSE becomes a catamorphism that allocates new gödelnumbers if necessary
...
CSE in the sense that all common subexpressions now have the samegödel number.
Of course, the drawback compared to "free-form" fingerprinting isthat the fingerprinting and the collection now depend on eachother. But for CSE, we have to carry the collection around anyway.
I don't know any references for that method since I came up with itmyself and haven't searched around yet. Any pointers?

Actually, the paper that Lauri cited in his message mentionsessentially this technique; this is equivalent to the permutation T()that they impose when fingerprinting a DAG. That citation again:

    http://citeseer.ist.psu.edu/broder93some.html

But it's generally pretty well known.

-Jan



_______________________________________________
Haskell mailing list
[email protected]
http://www.haskell.org/mailman/listinfo/haskell

Re: [Haskell] Re: Fingerprints and hashing

Reply via email to