[ https://issues.apache.org/jira/browse/NIFI-4095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16061120#comment-16061120 ]
ASF GitHub Bot commented on NIFI-4095: -------------------------------------- Github user alopresto commented on the issue: https://github.com/apache/nifi/pull/1931 @pvillard31 merged this to master with [commit 253ea2e](https://github.com/apache/nifi/commit/253ea2e73bd271e82dcfd6c706f679ddad014101) but it did not include the magic words, so I am manually closing this PR. > ExtractText should not require a capture group in every regular expression > -------------------------------------------------------------------------- > > Key: NIFI-4095 > URL: https://issues.apache.org/jira/browse/NIFI-4095 > Project: Apache NiFi > Issue Type: Improvement > Components: Extensions > Affects Versions: 1.3.0 > Reporter: Andy LoPresto > Assignee: Andy LoPresto > Labels: extracttext, regular_expression, validation > Fix For: 1.4.0 > > Original Estimate: 24h > Remaining Estimate: 24h > > The {{ExtractText}} processor currently validates every regular expression > and requires that it contain "between 1 and 40 capture groups". This seems to > be a design decision, as the values are hardcoded into the > [validator|https://github.com/apache/nifi/blob/master/nifi-nar-bundles/nifi-standard-bundle/nifi-standard-processors/src/main/java/org/apache/nifi/processors/standard/ExtractText.java#L262-L262], > but there are valid regular expressions that do not need an explicit capture > group (especially when the expression is small and the full expression is the > desired match). This results in unnecessary duplicate matches ("some_attr" > and "some_attr.1" being identical). -- This message was sent by Atlassian JIRA (v6.4.14#64029)