[ 
https://issues.apache.org/jira/browse/NIFI-911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mark Payne updated NIFI-911:
----------------------------
    Description: 
Currently, the default value for the Regular Expression property is (.\*) which 
is intended to match any text. However, due to the way that the regular 
expression is interpreted, it first matches the entire content of the FlowFile 
and then again matches 0 characters at the end of the content (because .* will 
match 0 or more characters of any character).

This results in the replacement value being added to the FlowFile twice. For 
example, if the replacement value is "Hello World", the output FlowFile will 
contain the text "Hello WorldHello World".

We should instead change the regex pattern to (^.*$) to in order to ensure that 
we do not duplicate the output

  was:
Currently, the default value for the Regular Expression property is (.*) which 
is intended to match any text. However, due to the way that the regular 
expression is interpreted, it first matches the entire content of the FlowFile 
and then again matches 0 characters at the end of the content (because .* will 
match 0 or more characters of any character).

This results in the replacement value being added to the FlowFile twice. For 
example, if the replacement value is "Hello World", the output FlowFile will 
contain the text "Hello WorldHello World".

We should instead change the regex pattern to (^.*$) to in order to ensure that 
we do not duplicate the output


> ReplaceText, with default configuration, will cause the replacement to be 
> added twice
> -------------------------------------------------------------------------------------
>
>                 Key: NIFI-911
>                 URL: https://issues.apache.org/jira/browse/NIFI-911
>             Project: Apache NiFi
>          Issue Type: Improvement
>          Components: Extensions
>            Reporter: Mark Payne
>              Labels: beginner, newbie
>             Fix For: 0.3.0
>
>
> Currently, the default value for the Regular Expression property is (.\*) 
> which is intended to match any text. However, due to the way that the regular 
> expression is interpreted, it first matches the entire content of the 
> FlowFile and then again matches 0 characters at the end of the content 
> (because .* will match 0 or more characters of any character).
> This results in the replacement value being added to the FlowFile twice. For 
> example, if the replacement value is "Hello World", the output FlowFile will 
> contain the text "Hello WorldHello World".
> We should instead change the regex pattern to (^.*$) to in order to ensure 
> that we do not duplicate the output



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to