[ 
https://issues.apache.org/jira/browse/SOLR-248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12498711
 ] 

Ryan McKinley commented on SOLR-248:
------------------------------------

It is a little strange, but (in my case anyway) i think it makes sense...  

I am indexing a bunch of metadata from a bunch of libraries (OAI-PMH) -- I want 
to display the data exactly as it came from the source, but for faceted 
browsing I need to normalize capitalization.

Implemented at the indexing level, I can have different values for the stored 
value and indexed terms.  Also, at the indexing level I can leverage existing 
Tokenizers and Filters to build the tokens that need capitalization -- it keeps 
all the configuration in schema.xml and lets the OAI -> solr xml be a simple 
transformation, this way whoever takes care of this need only learn solr 
configuration, not ryan+solr configuration. 

If it is not generally useful I can keep it elsewhere - that is why we have the 
nice plugin framework!



> Capitalization Filter Factory
> -----------------------------
>
>                 Key: SOLR-248
>                 URL: https://issues.apache.org/jira/browse/SOLR-248
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: Ryan McKinley
>            Priority: Minor
>         Attachments: SOLR-248-CapitalizationFilter.patch
>
>
> For tokens that are used in faceting, it is nice to have standard 
> capitalization.  
> I want "Aerial views" and "Aerial Views" to both be: "Aerial Views"

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to