[jira] Commented: (HADOOP-4918) Fix bzip2 work with SequenceFile

Doug Cutting (JIRA) Thu, 15 Jan 2009 09:31:24 -0800

    [ 
https://issues.apache.org/jira/browse/HADOOP-4918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12664184#action_12664184
 ]


Doug Cutting commented on HADOOP-4918:
--------------------------------------

> BZip2Codec does not work with SequenceFile right now in 0.19.  I consider 
> that to be a bug.

The standard criteria is whether it is a regression.  Did it work in a release 
prior to 0.19?  If so, then it's a regression and should be fixed in 0.19.  If 
not then it's a new feature, and should be added in 0.20.

However sometimes, as an exception, we permit fixes to all-new code, e.g., a 
new contrib module, that are not regressions, if they have zero chance of 
causing a regression anywhere else.  This patch touches only files that were 
added in 0.19, and those files were themselves an independent addition 
(http://svn.apache.org/viewvc?view=rev&revision=680802), so I see no 
possibility for this creating any regressions in 0.19 and would not oppose 
treating it as such an exception.


> Fix bzip2 work with SequenceFile
> --------------------------------
>
>                 Key: HADOOP-4918
>                 URL: https://issues.apache.org/jira/browse/HADOOP-4918
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: io
>    Affects Versions: 0.19.0, 0.20.0, 0.21.0
>            Reporter: Zheng Shao
>            Assignee: Zheng Shao
>             Fix For: 0.19.1, 0.20.0, 0.21.0
>
>         Attachments: HADOOP-4918.3.0.19.patch, HADOOP-4918.3.0.20.patch, 
> HADOOP-4918.3.patch
>
>
> Somehow bzip2 does not work with SequenceFile:
> {code}
>     String codec = "org.apache.hadoop.io.compress.BZip2Codec";
>     SequenceFile.Writer writer = SequenceFile.createWriter(fs, conf, new 
> Path(output), 
>         reader.getKeyClass(), reader.getValueClass(), CompressionType.BLOCK, 
>         (CompressionCodec)Class.forName(codec).newInstance());
> {code}
> The stack trace is here:
> {noformat}
> java.lang.UnsupportedOperationException
>         at 
> org.apache.hadoop.io.compress.BZip2Codec.getCompressorType(BZip2Codec.java:80)
>         at 
> org.apache.hadoop.io.compress.CodecPool.getCompressor(CodecPool.java:98)
>         at 
> org.apache.hadoop.io.SequenceFile$Writer.init(SequenceFile.java:914)
>         at 
> org.apache.hadoop.io.SequenceFile$BlockCompressWriter.<init>(SequenceFile.java:1198)
>         at 
> org.apache.hadoop.io.SequenceFile.createWriter(SequenceFile.java:401)
>         at 
> org.apache.hadoop.io.SequenceFile.createWriter(SequenceFile.java:329)
>         at 
> org.apache.hadoop.mapred.TestSequenceFileBZip.main(TestSequenceFileBZip.java:43)
>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>         at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>         at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>         at java.lang.reflect.Method.invoke(Method.java:597)
>         at org.apache.hadoop.util.RunJar.main(RunJar.java:165)
>         at org.apache.hadoop.mapred.JobShell.run(JobShell.java:54)
>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79)
>         at org.apache.hadoop.mapred.JobShell.main(JobShell.java:68)
> {noformat}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (HADOOP-4918) Fix bzip2 work with SequenceFile

Reply via email to