[ 
https://issues.apache.org/jira/browse/MAPREDUCE-4840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13508384#comment-13508384
 ] 

Mostafa Elhemali commented on MAPREDUCE-4840:
---------------------------------------------

Ah OK looks like I was the one confused. I don't believe it works, though as 
mentioned above I can't really test because of unrelated Windows problems. 
There's this code in TaskAttemptListenerImpl though:

{code}
  @Override
  public void reportNextRecordRange(TaskAttemptID taskAttemptID, Range range)
      throws IOException {
    // This is used when the feature of skipping records is enabled.

    // This call exists as a hadoop mapreduce legacy wherein all changes in
    // counters/progress/phase/output-size are reported through statusUpdate()
    // call but not the next record range information.
    throw new IOException("Not yet implemented.");
  }
{code}

So I guess the right thing to do is fix the implementation? Not sure if there's 
a JIRA tracking that.
                
> Delete dead code and deprecate public API related to skipping bad records
> -------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-4840
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-4840
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>    Affects Versions: 2.0.0-alpha
>            Reporter: Mostafa Elhemali
>            Priority: Minor
>         Attachments: MAPREDUCE-4840.patch
>
>
> It looks like the decision was made in MAPREDUCE-1932 to remove support for 
> skipping bad records rather than fix it (it doesn't work right now in trunk). 
> If that's the case then we should probably delete all the dead code related 
> to it and deprecate the public API's for it right?
> Dead code I'm talking about:
> 1. Task class: skipping, skipRanges, writeSkipRecs
> 2. MapTask class:  SkippingRecordReader inner class
> 3. ReduceTask class: SkippingReduceValuesIterator inner class
> 4. Tests: TestBadRecords
> Public API:
> 1. SkipBadRecords class

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to