I had a look at the bad_word file that came with htdig. It's very small, so
many very common words would still be indexed.

I've created a much larger list - partly based on the standard "stop words"
from SWISH-E but edited and extended. This takes into account how htdig
treats apostrophes by default.

I'm using this basic list to create site-specific lists with extra words
that occur on practically every page in a site (such as my name ;-)).

If anyone is interested in the basic list, which now contains 348 "words",
I can zip it up and post it on the  web somewhere. No private emails,
please, just post to the  list and I'll post the URL to the list.

Marjolein Katsma      [EMAIL PROTECTED]
Java Woman - http://javawoman.com/
----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the body of the message.

Reply via email to