Sebastiano Vigna wrote on 05/28/2006 10:39 PM:
> but we will certainly need
> some help to configure Lucene so that it works at its best.
>
> We would like to measure indexing time and query answer time
>
I'm not sure what form you would like that help to take, but here are a
couple high-level points imho:
1. Be sure a single jvm process is running to do all the benchmarks.
There have been many bogus lucene benchmarks created by using
separate command-line java invocations for each operation.
2. Don't use Hits-based search operators if you want anything other
than exactly 50 results (50 is, surprisingly, a magic number
hardwired into hits). It appears the paper referenced elsewhere
on this thread looked at recall and precision over a much larger
result set. Use a HitCollector with a TopDocs orTopFieldDocs to
collect the number of results you want without redoing the search
a bunch of times unnecessarily.
Chuck
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]