EnglishPossessiveFilter should work with Unicode right single quotation mark
----------------------------------------------------------------------------
Key: LUCENE-3748
URL: https://issues.apache.org/jira/browse/LUCENE-3748
Project: Lucene - Java
Issue Type: Improvement
Components: modules/analysis
Affects Versions: 3.5, 3.4, 3.2, 3.1
Reporter: David Croley
Priority: Minor
Attachments: LucenePatch
The current EnglishPossessiveFilter (used in EnglishAnalyzer) removes
possessives using only the '\'' character (plus 's' or 'S'), but some common
systems (German?) insert the Unicode "\u2019" (RIGHT SINGLE QUOTATION MARK)
instead and this is not removed when processing UTF-8 text. I propose to change
EnglishPossesiveFilter to support '\u2019' as an alternative to '\''.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]