HTMLStripCharFilterFactory behavior is different in Solr4 than it was in Solr 3.x ---------------------------------------------------------------------------------
Key: LUCENE-3721 URL: https://issues.apache.org/jira/browse/LUCENE-3721 Project: Lucene - Java Issue Type: Bug Reporter: Mike Hugo Attachments: htmlstripfilter_test.patch In Solr3, using the attached configuration, HTML entities like trademark and registered were being stripped (and NOT indexed) using the HTMLStripCharFilterFactory. In Solr4 it looks like those values are still making it through to the index and are then appearing in faceted results (we'd like them not to) see http://lucene.472066.n3.nabble.com/HTMLStripCharFilterFactory-not-working-in-Solr4-td3685599.html for background possibly related to this https://issues.apache.org/jira/browse/LUCENE-3690 -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org