Martin Wiesner created OPENNLP-1584:
---------------------------------------

             Summary: FeatureGeneratorUtil shall detect German umlauts with dot 
as 'cp'
                 Key: OPENNLP-1584
                 URL: https://issues.apache.org/jira/browse/OPENNLP-1584
             Project: OpenNLP
          Issue Type: Improvement
          Components: Name Finder
    Affects Versions: 2.3.3
            Reporter: Martin Wiesner
            Assignee: Martin Wiesner


German names, such as Änne, Özlem, or Ümit, should be recognized in their 
abbreviated short form (Ä., Ü., Ö.) by the FeatureGeneratorUtil class. 

Atm, recognition fails, as the Pattern "capPeriod" only takes regular, 
capitalized letters into account. This can be fixed easily.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to