On Sunday, July 7, 2002, at 08:20  PM, Jim Cole wrote:

>> Is it possible to configure ht://dig to return documents ONLY if there 
>> are
>> three or more occurrences of the search term (or phrase) in the 
>> document?
>
> I am not aware of any configuration setting that allows this sort
> of functionality. I seem to recall reading that the databases are
> structured in a manner that makes it non-trivial to obtain
> per-document word frequencies.

This is not entirely true, but close enough. Basically, you'd still have 
to retrieve all the documents anyway, and then you'd have to filter the 
search results carefully. You certainly could code in such a restriction 
(i.e. while it's doing the scoring, it counts the number of word 
occurrences), but the key word there at the moment is *code*. Since no 
one else has really asked for this, it's not high on any of the active 
developers' lists.

>> If there was a way we could limit results via the config file as 
>> above, this
>> should ensure the results are a little more relevant, without us 
>> having to
>> constantly fine-tune or list. Is there a better way to do this?
>
> Have you checked through the weight related attributes to see if
> there is anything there that might help you. In particular, there
> are a lot of *_factor attributes that allow you to adjust the

Todd, you also mention using 3.2.0b4 snapshots. All of the scoring for 
3.2 is done in htsearch, so you can quickly and easily change the 
factors and adjust the rankings. No reindexing is needed. If you find 
weightings that you like better, we'd be interested to know them.

Regards,

--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/



-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Stuff, things, and much much more.
http://thinkgeek.com/sf
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to