[ https://issues.apache.org/jira/browse/PIG-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14615348#comment-14615348 ]
Ken Wu commented on PIG-4623: ----------------------------- This has been fixed and the pull request is available at: https://github.com/apache/pig/pull/20 > Fixed the 'new line' character inside double-quote causing the csv parsing > failure > ---------------------------------------------------------------------------------- > > Key: PIG-4623 > URL: https://issues.apache.org/jira/browse/PIG-4623 > Project: Pig > Issue Type: Bug > Components: piggybank > Reporter: Ken Wu > Assignee: Ken Wu > Original Estimate: 24h > Remaining Estimate: 24h > > A new line character should be allowed inside a double quote as a valid csv > document. For example, the following csv document should be treated as a > SINGLE valid csv data > Iphone,"{ ItemName : Cheez-It > 21 Ounce}", > However, the current implementation of the getNext() inside > org.apache.pig.piggybank.storage.CSVLoader class fails to take care of this > case and it sees two lines of data while in fact it should be treated as > single line of data. > This pull request fixes the above issue. > (Note: here is a linke to validate whether a csv document: http://csvlint.io/) -- This message was sent by Atlassian JIRA (v6.3.4#6332)