On Jan 22, 2007, at 9:02 AM, William Morgan wrote: > Excerpts from Marvin Humphrey's message of Fri Jan 19 23:48:36 > -0800 2007: >> The search-time benefit from using a stoplist can be substantial. >> Search-time costs are dominated by time spent pawing through postings >> for common terms. Eliminating the most common terms can make a big >> difference. > > I agree that common terms can really affect search time cost. I just > don't think it's a problem.
Yes. If your corpus is small enough and your machine is fast enough, the absolute search-time costs of using an engine as efficient as Ferret or KinoSearch aren't consequential. As the corpus grows you have the option of trading away some relevance for speed, or, in the case of KS, distributing the index over multiple machines and aggregating search results. Marvin Humphrey Rectangular Research http://www.rectangular.com/ _______________________________________________ Ferret-talk mailing list [email protected] http://rubyforge.org/mailman/listinfo/ferret-talk

