[ https://issues.apache.org/jira/browse/OPENNLP-1266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Martin Wiesner closed OPENNLP-1266. ----------------------------------- Fix Version/s: 1.9.4 Resolution: Fixed > Limit normalization regexes in UrlCharSequenceNormalizer > -------------------------------------------------------- > > Key: OPENNLP-1266 > URL: https://issues.apache.org/jira/browse/OPENNLP-1266 > Project: OpenNLP > Issue Type: Task > Reporter: Tim Allison > Assignee: Tim Allison > Priority: Major > Fix For: 1.9.4 > > > The {{MAIL_REGEX}} in UrlCharSequenceNormalizer is unbounded and requires > backtracking. In rare cases, this can cause eye-opening performance costs. > > I tested the other regexes in the other normalizers. I could be wrong, but > they don't appear to require backtracking, and there are no surprising > performance costs. -- This message was sent by Atlassian Jira (v8.20.10#820010)