[jira] [Commented] (NIFI-3503) Create a 'SplitCSV' processor

Wesley L Lawrence (JIRA) Thu, 29 Jun 2017 08:28:40 -0700

    [ 
https://issues.apache.org/jira/browse/NIFI-3503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16068480#comment-16068480
 ]


Wesley L Lawrence commented on NIFI-3503:
-----------------------------------------

I've been using the SplitRecord processor, and I also think it's sufficient.

> Create a 'SplitCSV' processor
> -----------------------------
>
>                 Key: NIFI-3503
>                 URL: https://issues.apache.org/jira/browse/NIFI-3503
>             Project: Apache NiFi
>          Issue Type: New Feature
>            Reporter: Wesley L Lawrence
>            Priority: Minor
>
> While the 'SplitText' processor helps break up newline separated records into 
> individual files, it's not uncommon to have CSV files where records span 
> multiple lines, and 'SplitText' isn't able or meant to handle this.
> Currently, one can replace, remove, or escape newline characters that exist 
> in a single CSV record by searching within quoted columns with 'ReplaceText', 
> before passing the data onto 'SplitText'. However, this may not work in all 
> cases, or could potentially remove the valid newline character at the end of 
> a CSV record, if all edge cases aren't properly covered with regex.
> Having a dedicated 'SplitCSV' processor will solve this problem, and be a 
> simpler approach for users.
> See the following [Apache NiFi user email 
> thread|https://mail-archives.apache.org/mod_mbox/nifi-users/201702.mbox/%3CCAFuL2BbgymFXwu5fRyd8pP-zu6WkToqPE2Ek7bkyBg0_-cknqQ%40mail.gmail.com%3E]



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

[jira] [Commented] (NIFI-3503) Create a 'SplitCSV' processor

Reply via email to