Progress reported for pipes tasks is incorrect.
-----------------------------------------------

                 Key: MAPREDUCE-1073
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1073
             Project: Hadoop Map/Reduce
          Issue Type: Bug
          Components: pipes
            Reporter: Sreekanth Ramakrishnan


Currently in pipes, 
{{org.apache.hadoop.mapred.pipes.PipesMapRunner.run(RecordReader<K1, V1>, 
OutputCollector<K2, V2>, Reporter)}} we do the following:
{code}
        while (input.next(key, value)) {
          downlink.mapItem(key, value);
          if(skipping) {
            downlink.flush();
          }
        }
{code}

This would result in consumption of all the records for current task and taking 
task progress to 100% whereas the actual pipes application would be trailing 
behind. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to