[ 
https://issues.apache.org/jira/browse/LUCENE-2084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12794593#action_12794593
 ] 

Steven Rowe commented on LUCENE-2084:
-------------------------------------

bq. Do you think after this issue is resolved (whether it helps or doesn't 
help/won't fix either way) that we should open a separate issue to work on 
committing the benchmark so we have collation benchmarks for the future? Seems 
like it would be good to have in the future.

Yeah, I don't know quite how to do that - the custom code wrapping 
ICU/CollationKeyAnalyzer is necessary because the contrib benchmark alg format 
can only handle zero-argument analyzer constructors (or those that take Version 
arguments).  I think it would be useful to have a general-purpose alg syntax to 
handle this case without requiring custom code, and also, more generally, to 
allow for the construction of aribitrary analysis pipelines without requiring 
custom code (a la Solr schema).  The alg parsing code is a bit dense though - I 
think it could be converted to a JFlex-generated grammar to simplify this kind 
of syntax extension.

Can you think of an alternate way to package this benchmark that fits with 
current practice?

> remove Byte/CharBuffer wrapping for collation key generation
> ------------------------------------------------------------
>
>                 Key: LUCENE-2084
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2084
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: contrib/*
>            Reporter: Robert Muir
>            Assignee: Robert Muir
>            Priority: Minor
>             Fix For: 3.1
>
>         Attachments: collation.benchmark.tar.bz2, LUCENE-2084.patch, 
> LUCENE-2084.patch, TopTFWikipediaWords.tar.bz2
>
>
> We can remove the overhead of ByteBuffer and CharBuffer wrapping in 
> CollationKeyFilter and ICUCollationKeyFilter.
> this patch moves the logic in IndexableBinaryStringTools into char[],int,int 
> and byte[],int,int based methods, with the previous Byte/CharBuffer methods 
> delegating to these.
> Previously, the Byte/CharBuffer methods required a backing array anyway.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to