Daniel Dai created PIG-4600:
-------------------------------
Summary: Delegate bzip processing to Hadoop
Key: PIG-4600
URL: https://issues.apache.org/jira/browse/PIG-4600
Project: Pig
Issue Type: Improvement
Reporter: Daniel Dai
Assignee: Daniel Dai
Fix For: 0.16.0
Pig handling bzip within its own codebase. I am not very sure but that's
probably due to history reason. Hadoop does not support splittable bzip until
0.21 (HADOOP-4012). If so, we shall remove the bzip2 module in Pig and delegate
to Hadoop.
In particular, Pig does not support bzip2 concatenation. Once we delegate bzip
to Hadoop, this issue will be solved.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)