[ https://issues.apache.org/jira/browse/NIFI-3503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16068480#comment-16068480 ]
Wesley L Lawrence commented on NIFI-3503: ----------------------------------------- I've been using the SplitRecord processor, and I also think it's sufficient. > Create a 'SplitCSV' processor > ----------------------------- > > Key: NIFI-3503 > URL: https://issues.apache.org/jira/browse/NIFI-3503 > Project: Apache NiFi > Issue Type: New Feature > Reporter: Wesley L Lawrence > Priority: Minor > > While the 'SplitText' processor helps break up newline separated records into > individual files, it's not uncommon to have CSV files where records span > multiple lines, and 'SplitText' isn't able or meant to handle this. > Currently, one can replace, remove, or escape newline characters that exist > in a single CSV record by searching within quoted columns with 'ReplaceText', > before passing the data onto 'SplitText'. However, this may not work in all > cases, or could potentially remove the valid newline character at the end of > a CSV record, if all edge cases aren't properly covered with regex. > Having a dedicated 'SplitCSV' processor will solve this problem, and be a > simpler approach for users. > See the following [Apache NiFi user email > thread|https://mail-archives.apache.org/mod_mbox/nifi-users/201702.mbox/%3CCAFuL2BbgymFXwu5fRyd8pP-zu6WkToqPE2Ek7bkyBg0_-cknqQ%40mail.gmail.com%3E] -- This message was sent by Atlassian JIRA (v6.4.14#64029)