--- Jeff Kirby wrote: [snip] > Here is a brief description of what I'm trying to > accomplish: > We have about 60,000 documents that we are indexing, > most of them have > statute numbers (similar to "356.47(b)(a)" )... > you'll notice a problem > right off the bat when looking at this... and that > is the period. Now > if I include the period and open/close paranthesis, > then I'm going to be > indexing invalid words as well...
You can modify the default "bad words" list. These are words that are excluded when digging or searching. I think the easiest way to do this is the bad_word_list config file attribute. http://www.htdig.org/attrs.html#bad_word_list Josh __________________________________ Do you Yahoo!? Yahoo! Finance Tax Center - File online. File on time. http://taxes.yahoo.com/filing.html ------------------------------------------------------- This SF.Net email is sponsored by: IBM Linux Tutorials Free Linux tutorial presented by Daniel Robbins, President and CEO of GenToo technologies. Learn everything from fundamentals to system administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click _______________________________________________ ht://Dig general mailing list: <[EMAIL PROTECTED]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general

