On Jan 22, 2007, at 9:02 AM, William Morgan wrote:

> Excerpts from Marvin Humphrey's message of Fri Jan 19 23:48:36  
> -0800 2007:
>> The search-time benefit from using a stoplist can be substantial.
>> Search-time costs are dominated by time spent pawing through postings
>> for common terms.  Eliminating the most common terms can make a big
>> difference.
>
> I agree that common terms can really affect search time cost. I just
> don't think it's a problem.

Yes.  If your corpus is small enough and your machine is fast enough,  
the absolute search-time costs of using an engine as efficient as  
Ferret or KinoSearch aren't consequential.  As the corpus grows you  
have the option of trading away some relevance for speed, or, in the  
case of KS, distributing the index over multiple machines and  
aggregating search results.

Marvin Humphrey
Rectangular Research
http://www.rectangular.com/


_______________________________________________
Ferret-talk mailing list
[email protected]
http://rubyforge.org/mailman/listinfo/ferret-talk

Reply via email to