[ 
https://issues.apache.org/jira/browse/SOLR-2211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12927067#action_12927067
 ] 

Tom Burton-West commented on SOLR-2211:
---------------------------------------

Sure, I'll give it a try.  I've got  large Monday morning backlog in my todo 
list today, so it will probably be towards the middle of the week.

> Create Solr FilterFactory for Lucene StandardTokenizer with  UAX#29 support
> ---------------------------------------------------------------------------
>
>                 Key: SOLR-2211
>                 URL: https://issues.apache.org/jira/browse/SOLR-2211
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 3.1
>            Reporter: Tom Burton-West
>            Priority: Minor
>
> The Lucene 3.x StandardTokenizer with UAX#29 support provides benefits for 
> non-English tokenizing.  Presently it can be invoked by using the 
> StandardTokenizerFactory and setting the Version to 3.1.  However, it would 
> be useful to be able to use the improved unicode processing without 
> necessarily including the ip address and email address processing of 
> StandardAnalyzer.   A FilterFactory that allowed the use of the 
> StandardTokenizer with UAX#29 support on its own would be useful.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to