[ 
https://issues.apache.org/jira/browse/LUCENE-5620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13974916#comment-13974916
 ] 

Ahmet Arslan commented on LUCENE-5620:
--------------------------------------

bq. but other than that I don't know which other filters would be likely 
targets for this kind of enhancement.
Please see : https://issues.apache.org/jira/browse/SOLR-5332

bq. Its too complicated for all of our analyzers to have to have 
preserveOriginal options.
I agree with this. Any token filter that modifies its input could be a 
candidate. e.g. TurkishLowerCaseFilter.

How about making these candidate filters (may be all) respect to 
KeywordAttribute. And simulating preserve original behavior with 
{{KeywordRepeatFilter}}?

{code:xml}
<filter class="solr.KeywordRepeatFilterFactory"/>
<filter class="solr.LowerCaseFilterFactory"/>
<filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
{code}


> LowerCaseFilter.preserveOriginal
> --------------------------------
>
>                 Key: LUCENE-5620
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5620
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Mike Sokolov
>         Attachments: LUCENE-5620.patch
>
>
> Following closely the model of LUCENE-5437 (which worked on 
> ASCIIFoldingFilter), this patch adds the ability to preserve the original 
> token to LowerCaseFilter.  This is useful if you want an all-lowercase search 
> term to match without regard to case, while search terms with uppercase 
> letters match in a case-sensitive manner. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to