add error option if file-based record-readers fail to consume all input (e.g., concatenated gzip, bzip2) --------------------------------------------------------------------------------------------------------
Key: MAPREDUCE-1795 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1795 Project: Hadoop Map/Reduce Issue Type: Improvement Reporter: Greg Roelofs Assignee: Ravi Gummadi When running MapReduce with concatenated gzip files as input only the first part is read, which is confusing, to say the least. Concatenated gzip is described in http://www.gnu.org/software/gzip/manual/gzip.html#Advanced-usage and in http://www.ietf.org/rfc/rfc1952.txt. (See original report at http://www.nabble.com/Problem-with-Hadoop-and-concatenated-gzip-files-to21383097.html) -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.