[ 
https://issues.apache.org/jira/browse/NIFI-4095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16061120#comment-16061120
 ] 

ASF GitHub Bot commented on NIFI-4095:
--------------------------------------

Github user alopresto commented on the issue:

    https://github.com/apache/nifi/pull/1931
  
    @pvillard31 merged this to master with [commit 
253ea2e](https://github.com/apache/nifi/commit/253ea2e73bd271e82dcfd6c706f679ddad014101)
 but it did not include the magic words, so I am manually closing this PR. 


> ExtractText should not require a capture group in every regular expression
> --------------------------------------------------------------------------
>
>                 Key: NIFI-4095
>                 URL: https://issues.apache.org/jira/browse/NIFI-4095
>             Project: Apache NiFi
>          Issue Type: Improvement
>          Components: Extensions
>    Affects Versions: 1.3.0
>            Reporter: Andy LoPresto
>            Assignee: Andy LoPresto
>              Labels: extracttext, regular_expression, validation
>             Fix For: 1.4.0
>
>   Original Estimate: 24h
>  Remaining Estimate: 24h
>
> The {{ExtractText}} processor currently validates every regular expression 
> and requires that it contain "between 1 and 40 capture groups". This seems to 
> be a design decision, as the values are hardcoded into the 
> [validator|https://github.com/apache/nifi/blob/master/nifi-nar-bundles/nifi-standard-bundle/nifi-standard-processors/src/main/java/org/apache/nifi/processors/standard/ExtractText.java#L262-L262],
>  but there are valid regular expressions that do not need an explicit capture 
> group (especially when the expression is small and the full expression is the 
> desired match). This results in unnecessary duplicate matches ("some_attr" 
> and "some_attr.1" being identical). 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to