According to Geoff Hutchison:
> At 9:15 AM -0500 9/5/00, Ted Stresen-Reuter wrote:
> >If you want, go to http://www.chicagophilanthropy.com/search/ and enter the
> >word "kraft" as the search term and you'll see what I mean. I've tried
> >deleting the databases and indexing again, but I still got the same
> >results....
>
> So here's the answer. I poured through your verbose output and found
> a few links like this:
>
> href: http://www.chicagophilanthropy.com/ (Published: March 1998 Kraft
> Foods, Inc. names Amina Dickerson ...)
>
> So this is where it's getting "Kraft"--from the link text. You can
> turn this off using description_factor since it doesn't seem to be
> working very well in your case. Usually the text of links is fairly
> accurate as a description of the page (or it's so general that it's
> not likely to show up in searches like "click here.")
>
> In any case, the combination of this and possibly backlink_factor are
> probably the reason you're getting these "phantom" matches.
It's strange that I didn't find any documents containing links like
the one above when I searched for "kraft" on his web site. Do these
documents contain any <meta name="robots" content="noindex,follow">
tags, or does his search form use a hidden "restrict" or "exclude" field
that I didn't notice? My understanding is that link description text
is supposed to appear in the index for both the hyperlinked document,
using description_factor, and the document containing the link, using
text_factor.
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED]
You will receive a message to confirm this.
List archives: <http://www.htdig.org/mail/menu.html>
FAQ: <http://www.htdig.org/FAQ.html>