[ 
https://issues.apache.org/jira/browse/SOLR-248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ryan McKinley updated SOLR-248:
-------------------------------

    Attachment: SOLR-248-CapitalizationFilter.patch

Implementation and test...

<filter class="solr.CapitalizationFilterFactory" onlyFirstWord="false" 
keep="and or the is my or de" maxTokenLength="40" maxWordCount="4" 
okPrefix="McK" forceFirstLetter="true" />

onlyFirstWord="false" -- this capatalizes every word

keep="and or the is my or de" -- don't change capitalization for these words

forceFirstLetter="true" -- capitalize the first letter of the Token (not word) 
even if it is in the "keep" list

maxTokenLength="40" -- if the token is longer then 40 chars, don't even try to 
capitalize it

maxWordCount="4" -- if there are more then 4 words, don't try capitalizing


> Capitalization Filter Factory
> -----------------------------
>
>                 Key: SOLR-248
>                 URL: https://issues.apache.org/jira/browse/SOLR-248
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: SOLR-248-CapitalizationFilter.patch
>
>
> For tokens that are used in faceting, it is nice to have standard 
> capitalization.  
> I want "Aerial views" and "Aerial Views" to both be: "Aerial Views"

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to