[
https://issues.apache.org/jira/browse/LUCENE-2084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Steven Rowe updated LUCENE-2084:
--------------------------------
Attachment: collation.benchmark.tar.bz2
Atached collation.benchmark.tar.bz2, which contains stuff to run an
analysis-only contrib benchmark for the (ICU)CollationKeyAnalyzers over 4
languages: English, French, German, and Ukrainian.
Included are:
# For each language, a line-doc containing the most frequent 100K words from a
corresponding Wikipedia dump from November 2009;
# For each language, Java code for a no-argument analyzer callable from a
benchmark alg, that specializes (ICU)CollationKeyAnalyzer and uses
PerFieldAnalyzerWrapper to only run it over the line-doc body field
# A script to compile and jarify the above analyzers
# A benchmark alg running 5 iterations of 10 repetitions of analysis only over
the line-doc for each language
# A script to find the minimum elapsed time for each combination, and output
the results as a JIRA table
# A script to run the previous two scripts once for each of three JDK versions
# A script to compare the output of the above script before and after applying
the attached patch removing Char/ByteBuffer wrapping, and output the result as
a JIRA table
> remove Byte/CharBuffer wrapping for collation key generation
> ------------------------------------------------------------
>
> Key: LUCENE-2084
> URL: https://issues.apache.org/jira/browse/LUCENE-2084
> Project: Lucene - Java
> Issue Type: Improvement
> Components: contrib/*
> Reporter: Robert Muir
> Assignee: Robert Muir
> Priority: Minor
> Fix For: 3.1
>
> Attachments: collation.benchmark.tar.bz2, LUCENE-2084.patch,
> LUCENE-2084.patch
>
>
> We can remove the overhead of ByteBuffer and CharBuffer wrapping in
> CollationKeyFilter and ICUCollationKeyFilter.
> this patch moves the logic in IndexableBinaryStringTools into char[],int,int
> and byte[],int,int based methods, with the previous Byte/CharBuffer methods
> delegating to these.
> Previously, the Byte/CharBuffer methods required a backing array anyway.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]