[ https://issues.apache.org/jira/browse/SOLR-3463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13276856#comment-13276856 ]
Steven Rowe commented on SOLR-3463: ----------------------------------- +1 Tanguy, can you add a couple more tests? You should demonstrate that the deletion of repeated characters still works (with letter chars). Also, since there are two repetition removal operations in the code, a test specific to each would be useful. > FrenchLightStemmer performs abusive compression of (arbitrary) repeated > characters in long tokens > ------------------------------------------------------------------------------------------------- > > Key: SOLR-3463 > URL: https://issues.apache.org/jira/browse/SOLR-3463 > Project: Solr > Issue Type: Improvement > Components: Schema and Analysis > Affects Versions: 3.4 > Reporter: Tanguy Moal > Priority: Minor > Attachments: SOLR-3463.patch > > > FrenchLightStemmer performs aggressive deletions on repeated character > sequences, even on numbers. > This might be unexpected during full text search. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org