[ https://issues.apache.org/jira/browse/SOLR-257?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502546 ]
Mike Klaas commented on SOLR-257: --------------------------------- The difference from generateWordParts is as follows: gWP splits adjacent tokens to see if they are both alpha regardless of how they are delimited. So if gWP=0, then PowerShot=power-shot=Power-Shot=powershot. If gWP=1 and ignoreCaseChange=1, then PowerShot=powershot, but Power-Shot=power-shot=power shot. For us, case changes were too "weak" a delimiter, and high idf subwords were inappropriately driving up relevancy on certain docs. > Add ability for WordDelimiterFilter to ignore case changes > ----------------------------------------------------------- > > Key: SOLR-257 > URL: https://issues.apache.org/jira/browse/SOLR-257 > Project: Solr > Issue Type: Improvement > Components: update > Affects Versions: 1.2 > Reporter: Mike Klaas > Assignee: Mike Klaas > Priority: Trivial > Fix For: 1.3 > > Attachments: ignoreCaseChange.patch > > > patch adds ignoreCaseChange option to WordDelimiterFilter, which I have used > and it may be more generally useful -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.