Daniel Dai created PIG-4600: ------------------------------- Summary: Delegate bzip processing to Hadoop Key: PIG-4600 URL: https://issues.apache.org/jira/browse/PIG-4600 Project: Pig Issue Type: Improvement Reporter: Daniel Dai Assignee: Daniel Dai Fix For: 0.16.0
Pig handling bzip within its own codebase. I am not very sure but that's probably due to history reason. Hadoop does not support splittable bzip until 0.21 (HADOOP-4012). If so, we shall remove the bzip2 module in Pig and delegate to Hadoop. In particular, Pig does not support bzip2 concatenation. Once we delegate bzip to Hadoop, this issue will be solved. -- This message was sent by Atlassian JIRA (v6.3.4#6332)