Hello again nifi folks,
I did not get a direct reply to my email below. However, I've since
noticed in the mailing list archive that some of you have kindly
replied, although the emails did not make it to my inbox !
I wasn't part of the mailing list at the time, I am now, I guess that's
why I did not got the responses, it still seems a bit weird though... (*).
Anyway, could someone reply to the thread and include my email so I can
answer each of your comments while keeping the threading 'clean' ?
Thanks !
François
*: Maybe something the admins should look into, as some people might
fire off an email to the list, see no answers and assume no one replied
to them !
On 19/10/2016 11:10, François Prunier wrote:
Hello Nifi folks,
I've built a processor to parse CSV files with headers and turn each
line in a flowfile. Each resulting flowfile has as many attributes as
the number of columns. Each attributes has the name of a column with
the corresponding value for the line.
For example, this CSV file:
|col1,col2,col3 a,b,c d,e,f |
would generate two flowfiles with the following attributes:
|col1 = a col2 = b col3 = c |
and
|col1 = d col2 = e col3 = f |
As of now, you can configure the charset plus delimiter, quote and
escape character. It's based on the commons-csv parser.
It's very handy if you want to, for example, index a CSV file into
elasticsearch.
Would you guys be interested in a pull request to add this processor
to the main code base ? It needs a bit more documentation and cleanup
that I would need to add in but it's already successfully used in
production.
Best regards,
--
*François Prunier
* *Hurence* - /Vos experts Big Data/
http://www.hurence.com
*mobile:* +33 6 38 68 60 50
--
*François Prunier
* *Hurence* - /Vos experts Big Data/
http://www.hurence.com
*mobile:* +33 6 38 68 60 50