storing the hash multiplier instead of the hash value

Andrei Alexandrescu Mon, 22 Mar 2010 10:55:19 -0700

Currently, each entry in a D hashtable stores the hash of the key forefficiency purposes.

There is a bit of redundancy: as soon as you entered a hash bucket youknow that the hashkey % tablesize is a specific number, call it k. But kis not enough information to restore the hash, so the actual hash getsstored for each entry.

I'm thinking of exploiting that redundancy by storing the hashmultiplier, not the hash value. Instead of storing h in each slot, storem = h / tablesize. Then you can restore the actual h by writing:


restored_h = m * tablesize + k;

k is known for each given bucket (it's the index of the bucket) and m iswhat gets stored per entry.

What's the advantage of this approach? Well the advantage is that m is asmall number. Any hash function will try to disperse the hash value asmuch as possible between the 32 available bits. But m will be a smallernumber, and therefore will be more grouped and will engender fewer falsepointers.


Would this help?


Andrei

storing the hash multiplier instead of the hash value

Reply via email to