RE: ExtractText usage

2015-09-09 Thread Mark Payne
All, I did create a ticket for this: https://issues.apache.org/jira/browse/NIFI-942 And I linked it as related to NIFI-921. Thanks -Mark > From: aldrinp...@gmail.com > Date: Wed, 9 Sep 2015 03:00:18 + > Subject: Re: ExtractText usage &g

RE: ExtractText usage

2015-09-09 Thread Mark Payne
ate: Wed, 9 Sep 2015 08:55:44 -0400 > Subject: Re: ExtractText usage > From: wilson...@gmail.com > To: users@nifi.apache.org > > Bryan, thank you for the template, I'll look through that today and see > if that will do the trick. The multi-line regex for capturing all

Re: ExtractText usage

2015-09-09 Thread Ryan Ward
t the impact of multiple FlowFiles but is there an >>> > accumulator that will allow me to take N lines and accumulate them into >>> > a single flow file? >>> > >>> > -Chris >>> > >>> > >>> > On Tue, Sep 8,

Re: ExtractText usage

2015-09-09 Thread Christopher Wilson
perty named "Begins with R" and set the value >> to : "R >> Then set the Match Strategy to Starts With >> And Routing Strategy to "Route each line to matching Property Name" >> >> Then, any line that begins with "R will be route

Re: ExtractText usage

2015-09-08 Thread Aldrin Piri
oute each line to matching Property Name" > > Then, any line that begins with "R will be routed to the Begins with R > relationship. > This would be a simple way to pull out any particular lines of interest in > a text file. > > I can see this being very us

RE: ExtractText usage

2015-09-08 Thread Mark Payne
quot;R will be routed to the Begins with R relationship. This would be a simple way to pull out any particular lines of interest in a text file. I can see this being very useful for processing log files, CSV, etc. ________ > Date: Tue, 8 Sep 2015 17:02:54 -0400 >

Re: ExtractText usage

2015-09-08 Thread Bryan Bende
length up front - but I can see plenty of cases >>> where that's not going to be the case. >>> >>> -Chris >>> >>> On Tue, Sep 8, 2015 at 2:05 PM, Mark Payne wrote: >>> >>>> Agreed. Bryan's suggestion will give you the ability

Re: ExtractText usage

2015-09-08 Thread Christopher Wilson
e wrote: >> >>> Agreed. Bryan's suggestion will give you the ability to match each line >>> against the regex, >>> rather than trying to match the entire file. It would result in a new >>> FlowFile for each line of >>> text, though, as

Re: ExtractText usage

2015-09-08 Thread Bryan Bende
ng to match the entire file. It would result in a new >> FlowFile for each line of >> text, though, as he said. But if you need to rebuild a single file, those >> could potentially be >> merged together using a MergeContent processor, as well. >> >> ___

Re: ExtractText usage

2015-09-08 Thread Christopher Wilson
a single file, those > could potentially be > merged together using a MergeContent processor, as well. > > ________ > > Date: Tue, 8 Sep 2015 13:03:08 -0400 > > Subject: Re: ExtractText usage > > From: bbe...@gmail.com > > To: users@nifi.apache

RE: ExtractText usage

2015-09-08 Thread Mark Payne
erged together using a MergeContent processor, as well. > Date: Tue, 8 Sep 2015 13:03:08 -0400 > Subject: Re: ExtractText usage > From: bbe...@gmail.com > To: users@nifi.apache.org > > Chris, > > I think the issue is that ExtractText is

RE: ExtractText usage

2015-09-08 Thread Robinson, Richard A CTR USSOCOM HQ
something like (?m)^(\"R.*)$ robi -Original Message- From: Bryan Bende [mailto:bbe...@gmail.com] Sent: Tuesday, September 08, 2015 1:03 PM To: users@nifi.apache.org Subject: Re: ExtractText usage Chris, I think the issue is that ExtractText is not reading the file line by line

Re: ExtractText usage

2015-09-08 Thread Bryan Bende
Chris, I think the issue is that ExtractText is not reading the file line by line, and then applying your pattern to each line. It is applying the pattern to the whole content of the file so you would need a regex that repeated the pattern you were looking for so that it captured multiple times.

ExtractText usage

2015-09-08 Thread Christopher Wilson
I'm trying to read a directory of .csv files which have 3 different schemas/list types (not my idea). The descriptor is in the first column of the csv file. I'm reading the files in using GetFile and passing them into ExtractText, but I'm only getting the first 3 (of 8) lines matching my first re