At 10:53 PM -0500 7/16/01, Gilles Detillieux wrote:
>this, but in practise there are some hashing functions that are less
>likely to fail than others.  I like the seed for a random number idea.
>It would be worth a try.  I wonder, though, if there wouldn't be some
>other list of numbers that would have the same effect on this function.

For any set hashing function (rather than a "perfect hash" table), I 
can come up with a nasty list of numbers. >:-) I can't say that the 
current function is best for everything--and we might want to have 
multiple hash functions (e.g. one for URLs and one for regular text 
might help a lot).

I set the hash function to recognize numbers because it seemed to 
save a lot on the texts I was indexing. But I'd be glad to make them 
seeds for random numbers and then throw them through some sort of 
mangling function--but it doesn't make sense to waste too much time 
or we might as well treat them as ASCII again.

-Geoff

_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
http://lists.sourceforge.net/lists/listinfo/htdig-dev

Reply via email to