Ken Wu created PIG-4623:
---------------------------

             Summary: Fixed the 'new line' character inside double-quote 
causing the csv parsing failure
                 Key: PIG-4623
                 URL: https://issues.apache.org/jira/browse/PIG-4623
             Project: Pig
          Issue Type: Bug
          Components: piggybank
            Reporter: Ken Wu
            Assignee: Ken Wu




A new line character should be allowed inside a double quote as a valid csv 
document. For example, the following csv document should be treated as a SINGLE 
valid csv data

Iphone,"{ ItemName : Cheez-It
21 Ounce}",

However, the current implementation of the getNext() inside 
org.apache.pig.piggybank.storage.CSVLoader class fails to take care of this 
case and it sees two lines of data while in fact it should be treated as single 
line of data.

This pull request fixes the above issue.

(Note: here is a linke to validate whether a csv document: http://csvlint.io/)




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to