According to Mohler, Jeff:
> > When I use:
> >  http://gso-sparky.hq.netapp.com/form.htm
> > to search for 'bla.bla.bla' I get error:
> >  "No matches were found for 'blablabla'"
> > Note the missing dots.
> 
> There are configuration details in our lists that need searched on
> from time to time, what do I need to change to get htdig to use that
> specific string and not wipe out the dots?

Well, first of all, are you sure that stripping out the punctuation
is a problem?  Note that the same process is done during the indexing
phase, so that if there was a document that was indexed that contained
bla.bla.bla, a search for blablabla would find it!  This is only a problem
if you get a lot of false positives, i.e. if you MUST treat bla.bla.bla
and blabl.abla as different words, and you can't allow a search for one
of these to match another with different punctuation in it.

If you really need to treat the period as a significant character, i.e.
just like a letter, then you can remove it from valid_punctuation (set
valid_punctuation in your htdig.conf to something other than the built-in
default) and add the period to extra_word_characters.  If you do this,
though, the period will be treated as a letter in all contexts.  That
means that searching for the word "context" in the previous sentence would
fail because it would be indexed as "context." rather than "context".

See http://www.htdig.org/attrs.html#valid_punctuation
and http://www.htdig.org/attrs.html#extra_word_characters

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to