[
https://issues.apache.org/jira/browse/LUCENE-3892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13291850#comment-13291850
]
Han Jiang commented on LUCENE-3892:
-----------------------------------
OK, here is a result I tried to reproduce with Mike's test script:
Indexing time:
trunk: 2396 sec
patch: 2793 sec
Searching time:
{noformat}
TaskQPS Lucene40StdDev Lucene40 QPS PFor StdDev PFor Pct diff
AndHighMed 22.76 0.54 14.68 1.00 -41% -
-29%
SloppyPhrase 3.58 0.17 2.46 0.27 -41% -
-19%
SpanNear 5.90 0.09 4.08 0.37 -38% -
-23%
AndHighHigh 10.00 0.17 8.08 0.57 -26% -
-11%
Phrase 1.68 0.07 1.45 0.17 -27% -
0%
Respell 37.65 0.74 33.41 1.04 -15% -
-6%
Fuzzy1 38.00 1.60 34.37 1.06 -15% -
-2%
IntNRQ 4.27 0.33 3.87 0.19 -19% -
3%
Fuzzy2 16.35 0.60 15.02 0.31 -13% -
-2%
Wildcard 30.24 0.57 28.24 1.85 -14% -
1%
PKLookup 85.82 5.04 83.25 2.81 -11% -
6%
Prefix3 19.20 0.40 19.19 1.46 -9% -
9%
OrHighMed 9.25 0.59 9.41 0.70 -11% -
16%
TermGroup1M 11.46 0.62 11.74 0.81 -9% -
15%
OrHighHigh 3.15 0.17 3.28 0.23 -8% -
17%
TermBGroup1M1P 19.28 0.38 20.32 1.14 -2% -
13%
TermBGroup1M 6.23 0.21 6.71 0.46 -3% -
19%
Term 30.86 1.52 34.34 3.26 -4% -
28%
{noformat}
It is done on a 64bit AMD server with Java 1.7.0.
> Add a useful intblock postings format (eg, FOR, PFOR, PFORDelta,
> Simple9/16/64, etc.)
> -------------------------------------------------------------------------------------
>
> Key: LUCENE-3892
> URL: https://issues.apache.org/jira/browse/LUCENE-3892
> Project: Lucene - Java
> Issue Type: Improvement
> Reporter: Michael McCandless
> Labels: gsoc2012, lucene-gsoc-12
> Fix For: 4.1
>
> Attachments: LUCENE-3892_pfor.patch, LUCENE-3892_pfor.patch,
> LUCENE-3892_settings.patch, LUCENE-3892_settings.patch
>
>
> On the flex branch we explored a number of possible intblock
> encodings, but for whatever reason never brought them to completion.
> There are still a number of issues opened with patches in different
> states.
> Initial results (based on prototype) were excellent (see
> http://blog.mikemccandless.com/2010/08/lucene-performance-with-pfordelta-codec.html
> ).
> I think this would make a good GSoC project.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]