[ https://issues.apache.org/jira/browse/MAPREDUCE-1073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Dick King updated MAPREDUCE-1073: --------------------------------- Status: Open (was: Patch Available) Removed this patch to replace it with another patch that tests its functionality. > Progress reported for pipes tasks is incorrect. > ----------------------------------------------- > > Key: MAPREDUCE-1073 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-1073 > Project: Hadoop Map/Reduce > Issue Type: Bug > Components: pipes > Affects Versions: 0.20.1 > Reporter: Sreekanth Ramakrishnan > Attachments: mapreduce-1073--2010-03-31.patch, > MAPREDUCE-1073_yhadoop20.patch > > > Currently in pipes, > {{org.apache.hadoop.mapred.pipes.PipesMapRunner.run(RecordReader<K1, V1>, > OutputCollector<K2, V2>, Reporter)}} we do the following: > {code} > while (input.next(key, value)) { > downlink.mapItem(key, value); > if(skipping) { > downlink.flush(); > } > } > {code} > This would result in consumption of all the records for current task and > taking task progress to 100% whereas the actual pipes application would be > trailing behind. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.