According to Mike:
> Thanks, I missed that one when I was looking for the answer.
>
> One other thing I can't seem to get a handle on is the amount of data shown between
>the
> <head><title> section.
> This would be the max_head limit, heading_factor and title factor from what I've
>learned
> so far. Basically, some web site authors use ridiculously long <title> sections
>which end
> up showing up complete in the search. I would like to index that information but I
>don't
> want it all displayed.
>
> What can you suggest as a good balance since I assume that I'll have to rebuild my
> database again as this is part of htdig?
There is no configuration attribute to limit the length of the title
field. You would have to patch the code to do this. If you patch
the code in htdig/Retriever.cc, as Geoff suggested a couple years ago
(see http://www.mail-archive.com/[email protected]/msg06949.html),
then you will have to rebuild your database. If instead you just
patch htsearch/Display.cc to shorten the title field before displaying
(i.e. before setting the TITLE template variable, around line 300 in
an unpatched htdig-3.1.5/htsearch/Display.cc), then you won't need
to reindex.
Which approach is best depends on what your goal is. If you want
to limit search engine spamming, using tons of words in the title to
get a higher ranking, then you would have to patch htdig and reindex.
If you just want to avoid really long titles in the search results,
it may be sufficient to patch htsearch. Based on what you wrote above,
I'd guess the latter is what you want. What would be considered a good
balance is entirely subjective - choose the length you feel is best.
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
_______________________________________________
htdig-general mailing list
[EMAIL PROTECTED]
http://lists.sourceforge.net/lists/listinfo/htdig-general