[
https://issues.apache.org/jira/browse/PIG-2143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13065391#comment-13065391
]
Raghu Angadi commented on PIG-2143:
-----------------------------------
PigStorageSchema is not setting "-schema" argument.
> I am not sure if loading the schema when it was created but isn't being
> requested is a good idea..
+1 for using the existing schema. May be we could add '-ignoreSchema' flag.
> PigStorage is used with delim on file stored with '-schema', should it throw
> error if the delim in metadata file is different ? or warn and just use the
> delim specified in metadata file ?
+1 for throwing an error. If we have -ignoreSchema, it will let users handle
some rare cases.
> Improvements for PigStorage
> ---------------------------
>
> Key: PIG-2143
> URL: https://issues.apache.org/jira/browse/PIG-2143
> Project: Pig
> Issue Type: Improvement
> Reporter: Dmitriy V. Ryaboy
> Assignee: Dmitriy V. Ryaboy
> Fix For: 0.10
>
> Attachments: PIG-2143.2.diff, PIG-2143.diff
>
>
> I'd like to propose that we allow for a greater degree of customization in
> PigStorage.
> An incomplete list features that we might want to add:
> - flag to tell it to overwrite existing output if it exists
> - flag to tell it to compress output using gzip|bzip|lzo (currently this can
> be achieved by setting the directory name to end in .gz or .bz2, which is a
> bit awkward)
> - flag to tell it to store the schema and header (perhaps by merging in
> PigStorageSchema work?)
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira