[ https://issues.apache.org/jira/browse/MAPREDUCE-1795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Greg Roelofs updated MAPREDUCE-1795: ------------------------------------ Original Estimate: (was: 336h) Remaining Estimate: (was: 336h) Assignee: (was: Ravi Gummadi) Affects Version/s: (was: 0.20.2) > add error option if file-based record-readers fail to consume all input > (e.g., concatenated gzip, bzip2) > -------------------------------------------------------------------------------------------------------- > > Key: MAPREDUCE-1795 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-1795 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Reporter: Greg Roelofs > > When running MapReduce with concatenated gzip files as input, only the first > part ("member" in gzip spec parlance, http://www.ietf.org/rfc/rfc1952.txt) is > read; the remainder is silently ignored. As a first step toward fixing that, > this issue will add a configurable option to throw an error in such cases. > MAPREDUCE-469 is the tracker for the more complete fix/feature, whenever that > occurs. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.