[ https://issues.apache.org/jira/browse/NIFI-10869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Luigi De Giovanni updated NIFI-10869: ------------------------------------- Priority: Minor (was: Major) > ExtractText processor - RegEx captured twice > -------------------------------------------- > > Key: NIFI-10869 > URL: https://issues.apache.org/jira/browse/NIFI-10869 > Project: Apache NiFi > Issue Type: Improvement > Affects Versions: 1.18.0 > Reporter: Luigi De Giovanni > Priority: Minor > Attachments: image (1).png, image (2).png, image.png > > > Hi, > There is an issue with the ExtractText processor, in the attempt of creating > FlowFile attributes from FlowFile contents. > When creating a custom property with a regex value that does not contain > named groups, the value captured in the group is added as 2 different > attributes. > E.G. > ||Property Name||Property Value||Expected FlowFile Attribute||Actual outcome|| > |param.alpha|<regex for alpha>|param.alpha=alpha|param.alpha=alpha > param.alpha.1=alpha| > Reading the documentation, this might even be an expected behaviour, but if > so, it is preferable to have the captured value only as a single attribute, > without duplication. > Please see the attachment for an example. > -- This message was sent by Atlassian Jira (v8.20.10#820010)