According to Jonas Larsson:
> I have noticed a strange behaviour in how htdig
> computes the score when you search for a keyword.
> 
> If I index my site in one big htdig run and then
> search for a specific keyword in the generated database
> using htsearch I get one set of "scores" for the
> documents found.
> 
> If I on the other hand index different parts of my site
> with several htdig runs, merge the databases together
> into one big database and then search again for the same
> keyword using htsearch I get a different set of "scores" 
> for the documents found. The score for the same document
> is often different - strange, seems incorrect.
> 
> Is there a good explanation to this behaviour? 

This may be guesswork on my part, but one item that adds a lot of
weight to pages is the link description text from other documents
that link to it.  When you index the site in separate parts, I think
you'll lose all the link descriptions for links from one part to
another, so that could have an impact on scores - quite possibly
a profound one.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives:  <http://www.htdig.org/mail/menu.html>
FAQ:            <http://www.htdig.org/FAQ.html>

Reply via email to