Jonathan Packer created PIG-3141: ------------------------------------ Summary: Giving CSVExcelStorage an option to handle header rows Key: PIG-3141 URL: https://issues.apache.org/jira/browse/PIG-3141 Project: Pig Issue Type: Improvement Components: piggybank Affects Versions: 0.11 Reporter: Jonathan Packer Fix For: 0.11 Attachments: csv.patch
Adds an argument to CSVExcelStorage to skip the header row when loading. This works properly with multiple small files each with a header being combined into one split, or a large file with a single header being split into multiple splits. Also fixes a few bugs with CSVExcelStorage, including PIG-2470 and a bug involving quoted fields at the end of a line not escaping properly. Removes the choice of delimiter, since a CSV file ought to only use a comma delimiter, hence the name. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira