[
https://issues.apache.org/jira/browse/LUCENE-5369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ryan McKinley updated LUCENE-5369:
----------------------------------
Attachment: LUCENE-5369-uppercase-filter.patch
Here is a patch that adds UpperCaseFilter
There are a few others out there:
http://svn.apache.org/repos/asf/uima/addons/trunk/Lucas/src/main/java/org/apache/uima/lucas/indexer/analysis/UpperCaseFilter.java
https://github.ugent.be/Universiteitsbibliotheek/lludss-solr-java/blob/master/src/main/java/lludss/solr/analysis/UpperCaseFilter.java
--------
Given that we would want to steer people to LowerCase, perhaps this should be
in a different package
I'll wait for +1 from someone who knows more about this than me :)
> Add an UpperCaseFilter
> ----------------------
>
> Key: LUCENE-5369
> URL: https://issues.apache.org/jira/browse/LUCENE-5369
> Project: Lucene - Core
> Issue Type: New Feature
> Reporter: Ryan McKinley
> Assignee: Ryan McKinley
> Priority: Minor
> Attachments: LUCENE-5369-uppercase-filter.patch
>
>
> We should offer a standard way to force upper-case tokens. I understand that
> lowercase is safer for general search quality because some uppercase
> characters can represent multiple lowercase ones.
> However, having upper-case tokens is often nice for faceting (consider
> normalizing to standard acronyms)
--
This message was sent by Atlassian JIRA
(v6.1.4#6159)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]