My C++ skills are, umm, okay. :-) But thanks for saving me time, because at least I
know to start in the htsearch/Display.cc.
As for an HTML object/segment having a factor of zero, I think it would make a lot of
sense to have it drop off (if it were the only matching component to the query). I
will keep you informed of my progress or lack thereof.
[.kate]
-----Original Message-----
From: Gilles Detillieux [EMAIL PROTECTED]
Sent: Thu, 13 Sep 2001 16:31:47 -0500 (CDT)
To: [EMAIL PROTECTED]
CC: [EMAIL PROTECTED]
Subject: Re: [htdig-dev] text_factor:0 It doesn't appear to be working.
According to Katherine Porter:
> I'm having some problems with the text_factor. I set text_factor:
> 0 in my configuration, however, in the -vvvv htdig output, I'm getting
> the words showing up with their location.
As Geoff mentioned, in 3.2 the *_factor attributes only affect htsearch,
not htdig. In any case (3.1 or 3.2) a factor of 0 just drops the score
of a match to 0, putting the match at the end of the list of results, but
not dropping it off the list. I believe that's the case for both 3.1.x
and 3.2.0bx. Causing 0 scores to remove matches from the results would
be a new feature, I think, because the scores just affect rankings, and
not selection or rejection.
> By the way, I'm still using
> the htdig-3.2.0b4-072201 code base. If you think that trying a more
> recent one will fix it, let me know, but I don't remember any discussion
> regarding any code commits in this area.
Not all changes are discussed on htdig-general or htdig-dev before
being committed. However, all CVS log entries for committed changes are
posted to htdig-updates. There was a change on Sept. 4 that might have
an effect on score calculations, but nothing that I know of will cause
low or zero scores to be dropped off the result list.
> I've made hyperlinks out
> to the actual documents/config files so that it's easier to follow
> without clogging up your entire screen.
>
> config file: http://www.geocities.com/turbokate2000/sptest_sample_config.txt
>
> crawled file: http://www.geocities.com/turbokate2000/sptest.html
>
> htdig output: http://www.geocities.com/turbokate2000/sptest_output.txt
>
> When I perform htsearch on terms "body1" or "body2", I get results back,
> when I shouldn't. Any clues that help me completely ignore body text,
> and only use title, description, or keywords are appreciated.
How are your C++ skills? :-) I think this would require changes to
Display::buildMatchList(), in htsearch/Display.cc.
--
Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba Phone: (204)789-3766
Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930
___________________________________________________________________________
Visit http://www.visto.com.
Find out how companies are linking mobile users to the
enterprise with Visto.
_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/htdig-dev