[
https://issues.apache.org/jira/browse/SQOOP-428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lars Francke updated SQOOP-428:
-------------------------------
Attachment: SQOOP-428.2.patch
Turns out my patch actually contains a big flaw which Tom pointed out.
Avro expects the codec to be specified as either {{null}}, {{deflate}} or
{{snappy}} and it doesn't accept full Codec class names. I've changed the code
and the documentation to point this out and also amended the test as well as
fixing those Checkstyle issues.
Updated patch as per ReviewBoard
> AvroOutputFormat doesn't support compression even though documentation claims
> it does
> -------------------------------------------------------------------------------------
>
> Key: SQOOP-428
> URL: https://issues.apache.org/jira/browse/SQOOP-428
> Project: Sqoop
> Issue Type: Bug
> Components: docs
> Affects Versions: 1.4.0-incubating
> Reporter: Lars Francke
> Priority: Minor
> Labels: avro, document
> Attachments: SQOOP-428.1.patch, SQOOP-428.2.patch
>
>
> The documentation claims that Avro files can be compressed as well:
> {quote}
> By default, data is not compressed. You can compress your data by using the
> deflate (gzip) algorithm with the -z or --compress argument, or specify any
> Hadoop compression codec using the --compression-codec argument. This applies
> to SequenceFile, text, and Avro files.
> {quote}
> This is not true as the AvroOutputFormat currently doesn't support
> compression.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira