[ 
https://issues.apache.org/jira/browse/LUCENE-2084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12794662#action_12794662
 ] 

Robert Muir commented on LUCENE-2084:
-------------------------------------

Steve, I ran the benchmark, worked like a charm. here are my numbers.
some differences are: very slow computer, newer jvm versions, etc.

Unpatched:

||Sun JVM||Language||java.text||ICU4J||KeywordAnalyzer||ICU4J Improvement||
|1.5.0_22 (32-bit)|English|31.61s|21.34s|10.66s|96%|
|1.5.0_22 (32-bit)|French|34.11s|22.59s|11.12s|100%|
|1.5.0_22 (32-bit)|German|34.59s|22.69s|10.84s|100%|
|1.5.0_22 (32-bit)|Ukrainian|50.72s|30.19s|11.09s|107%|
||Sun JVM||Language||java.text||ICU4J||KeywordAnalyzer||ICU4J Improvement||
|1.6.0_16 (32-bit)|English|26.42s|19.98s|10.11s|65%|
|1.6.0_16 (32-bit)|French|28.48s|20.14s|10.30s|85%|
|1.6.0_16 (32-bit)|German|28.41s|21.38s|9.78s|61%|
|1.6.0_16 (32-bit)|Ukrainian|40.55s|28.56s|9.70s|64%|

Patched:

||Sun JVM||Language||java.text||ICU4J||KeywordAnalyzer||ICU4J Improvement||
|1.5.0_22 (32-bit)|English|29.59s|19.33s|10.56s|117%|
|1.5.0_22 (32-bit)|French|32.38s|21.02s|10.88s|112%|
|1.5.0_22 (32-bit)|German|32.92s|20.58s|10.66s|124%|
|1.5.0_22 (32-bit)|Ukrainian|49.59s|27.05s|10.77s|138%|
||Sun JVM||Language||java.text||ICU4J||KeywordAnalyzer||ICU4J Improvement||
|1.6.0_16 (32-bit)|English|24.91s|18.28s|9.72s|77%|
|1.6.0_16 (32-bit)|French|26.30s|19.12s|10.25s|81%|
|1.6.0_16 (32-bit)|German|26.70s|19.23s|10.48s|85%|
|1.6.0_16 (32-bit)|Ukrainian|38.48s|25.66s|9.80s|81%|

Comparison:

||Sun JVM||Language||java.text improvement||ICU4J improvement||
|1.5.0_22 (32-bit)|English|10.5%|22.2%|
|1.5.0_22 (32-bit)|French|7.4%|13.6%|
|1.5.0_22 (32-bit)|German|7.1%|19.9%|
|1.5.0_22 (32-bit)|Ukrainian|2.5%|17.8%|
||Sun JVM||Language||java.text improvement||ICU4J improvement||
|1.6.0_16 (32-bit)|English|7.8%|15.8%|
|1.6.0_16 (32-bit)|French|13.7%|11.4%|
|1.6.0_16 (32-bit)|German|15.3%|33.0%|
|1.6.0_16 (32-bit)|Ukrainian|8.0%|19.4%|


> remove Byte/CharBuffer wrapping for collation key generation
> ------------------------------------------------------------
>
>                 Key: LUCENE-2084
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2084
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: contrib/*
>            Reporter: Robert Muir
>            Assignee: Robert Muir
>            Priority: Minor
>             Fix For: 3.1
>
>         Attachments: collation.benchmark.tar.bz2, LUCENE-2084.patch, 
> LUCENE-2084.patch, TopTFWikipediaWords.tar.bz2
>
>
> We can remove the overhead of ByteBuffer and CharBuffer wrapping in 
> CollationKeyFilter and ICUCollationKeyFilter.
> this patch moves the logic in IndexableBinaryStringTools into char[],int,int 
> and byte[],int,int based methods, with the previous Byte/CharBuffer methods 
> delegating to these.
> Previously, the Byte/CharBuffer methods required a backing array anyway.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to