On Mon, Apr 19, 2010 at 3:17 PM, Benson Margulies <bimargul...@gmail.com>wrote:

> That doesn't bother my intuition so much, since the two hashes are
> *different*. Or maybe I'm not following the implications of the linear
> conbination.
>

Yes, but h_1 + h_2 and h_1 + 2 h_2 are pretty similar.  That was the
surprise.  The win comes from the fact that the linear combo is way faster
than hashing from scratch.


>
> It's the conceptual model I'd like to understand here. In my
> 'understanding', bloom filters work because each hash function grabs a
> different picture of the total information content of the original key.
>

A good hash does this if you have different seeds.


>
> Generally, I am indeed feeling like a bear of rapidly shrinking brain,
> since
> that page is at pains to insist on two independent has functions.
>

For a good hash function, different seeds give you essentially independent
hashes.  Otherwise, it would be easy to recover the seed given the two
hashes which contracts "goodness" of a hash.

Reply via email to