Dear All
I need an information about Flow Files Attribute of Extract Text Processor. 
My flow is that;

1. Getting IIS Log files from Azure Blob Storage 
2. Splitting each IIS Log File to line by line with Split Text Processor. 
2.1. Line Split Count:1
2.2. Maximum Fragment Size: No value set
2.3. Header Line Count: 0
2.4. Header Line Marker Characters: No value set
2.5. Remove Trailing Newlines: True
3. Transferring new flow files which is produced by Split Text Processor to
Extract Text Processor. 
3.1. All Properties are Default
3.2. I added one RegEx in the Properties. I would like to carry on Flow
Files attributes to Syslog
3.2.1. Property Name: msg 
3.2.2. Value: (.*). 
4. Transferring all flow files where is coming from Extract Text to Put
Syslog Processor. 
4.1. All Properties are Default or configured properly for requirements
(such as IP address of the Syslog, port etc.) 
4.2. Message Body: IISHttp${msg}

When I check Flow Files Attribute from Data Provenance in the Extract Text
Processor, I see 3 attributes same each other. 
Msg: 2020-06-24 13:33:49 XXXX GET /Test/Service/test.css
YYYY 200 0 0 852 7005 921
Msg.1: 2020-06-24 13:33:49 XXXX GET /Test/Service/test.css
YYYY 200 0 0 852 7005 921
Msg.2: 2020-06-24 13:33:49 XXXX GET /Test/Service/test.css
YYYY 200 0 0 852 7005 921

How can I remove duplicate attributes from extract text output? Or I need to
use another way?
Do you have any comment or suggestion?

My environment details are below:
Apache NiFi 1.11.3
Windows Server 2016
Java JRE 1.8.0_241 (64 Bit)



--
Sent from: http://apache-nifi-users-list.2361937.n4.nabble.com/

Reply via email to