EnglishPossessiveFilter should work with Unicode right single quotation mark
----------------------------------------------------------------------------

                 Key: LUCENE-3748
                 URL: https://issues.apache.org/jira/browse/LUCENE-3748
             Project: Lucene - Java
          Issue Type: Improvement
          Components: modules/analysis
    Affects Versions: 3.5, 3.4, 3.2, 3.1
            Reporter: David Croley
            Priority: Minor
         Attachments: LucenePatch

The current EnglishPossessiveFilter (used in EnglishAnalyzer) removes 
possessives using only the '\'' character (plus 's' or 'S'), but some common 
systems (German?) insert the Unicode "\u2019" (RIGHT SINGLE QUOTATION MARK) 
instead and this is not removed when processing UTF-8 text. I propose to change 
EnglishPossesiveFilter to support '\u2019' as an alternative to '\''.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to