[
https://issues.apache.org/jira/browse/PIG-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dmitriy V. Ryaboy updated PIG-2143:
-----------------------------------
Attachment: PIG-2143.3.patch
Submitting a new version of the patch.
* added a bunch of javadocs
* added "-noschema" to turn off loading of schema
* removed the compression code accidentally introduced in the previous version
-- saving that for another ticket
* did NOT fix the delimiter issue. That's because we currently don't store the
delimiter when storing the schema! We just use it to delimit the header file.
There is no clean way to store this kind of information, we need to think about
the proper design for this.
> Improvements for PigStorage
> ---------------------------
>
> Key: PIG-2143
> URL: https://issues.apache.org/jira/browse/PIG-2143
> Project: Pig
> Issue Type: Improvement
> Reporter: Dmitriy V. Ryaboy
> Assignee: Dmitriy V. Ryaboy
> Fix For: 0.10
>
> Attachments: PIG-2143.2.diff, PIG-2143.3.patch, PIG-2143.diff
>
>
> I'd like to propose that we allow for a greater degree of customization in
> PigStorage.
> An incomplete list features that we might want to add:
> - flag to tell it to overwrite existing output if it exists
> - flag to tell it to compress output using gzip|bzip|lzo (currently this can
> be achieved by setting the directory name to end in .gz or .bz2, which is a
> bit awkward)
> - flag to tell it to store the schema and header (perhaps by merging in
> PigStorageSchema work?)
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira