According to Jonas Larsson:
> I have noticed a strange behaviour in how htdig
> computes the score when you search for a keyword.
>
> If I index my site in one big htdig run and then
> search for a specific keyword in the generated database
> using htsearch I get one set of "scores" for the
> documents found.
>
> If I on the other hand index different parts of my site
> with several htdig runs, merge the databases together
> into one big database and then search again for the same
> keyword using htsearch I get a different set of "scores"
> for the documents found. The score for the same document
> is often different - strange, seems incorrect.
>
> Is there a good explanation to this behaviour?
This may be guesswork on my part, but one item that adds a lot of
weight to pages is the link description text from other documents
that link to it. When you index the site in separate parts, I think
you'll lose all the link descriptions for links from one part to
another, so that could have an impact on scores - quite possibly
a profound one.
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives: <http://www.htdig.org/mail/menu.html>
FAQ: <http://www.htdig.org/FAQ.html>