Those are nice speedups!

Did you use the 4.0 branch (ie trunk) or the bulkpostings branch for this test?

Mike

On Tue, Dec 21, 2010 at 9:59 PM, Li Li <fancye...@gmail.com> wrote:
> great improvement!
> I did a test in our data set. doc count is about 2M+ and index size
> after optimization is about 13.3GB(including fdt)
> it seems lucene4's index format is better than lucene2.9.3. and PFor
> give good results.
> Besides BlockEncoder for frq and pos. is there any other modification
> for lucene 4?
>
>       decoder    \ avg time     single word(ms)          and
> query(ms)     or query(ms)
>  VINT in lucene 2.9                   11.2
> 36.5                 38.6
>  VINT in lucene 4 branch           10.6
> 26.5                 35.4
>  PFor in lucene 4 branch             8.1
> 22.5                 30.7
> 2010/12/21 Li Li <fancye...@gmail.com>:
>>> OK we should have a look at that one still.  We need to converge on a
>>> good default codec for 4.0.  Fortunately it's trivial to take any int
>>> block encoder (fixed or variable block) and make a Lucene codec out of
>>> it!
>>
>> I suggests you not to use this one, I fixed dozens of bugs but it
>> still failed when with random tests. it's codes is hand coded rather
>> than generated by program. But we may learn something from it.
>>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
> For additional commands, e-mail: dev-h...@lucene.apache.org
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to