[ https://issues.apache.org/jira/browse/HADOOP-2284?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12554545 ]
Amar Kamat commented on HADOOP-2284: ------------------------------------ I think the facility of _batch progress updates_ should be provided at the {{Reporter}} level than at the caller. So that we can set the interval and any call to the reporter within the interval will do nothing. The guess is that the problem reported is in the body of {{progress}}. The check to set the flag should be conditioned. Comments? > BasicTypeSorterBase.compare calls progress on each compare > ---------------------------------------------------------- > > Key: HADOOP-2284 > URL: https://issues.apache.org/jira/browse/HADOOP-2284 > Project: Hadoop > Issue Type: Bug > Components: mapred > Reporter: Owen O'Malley > Assignee: Devaraj Das > Fix For: 0.16.0 > > > The inner loop of the sort is calling progress on each compare. I think it > would make more sense to call progress in the sort rather than the compare or > at most every 10000 compares. In the performance numbers, the call to > progress as part of the sort are consuming 12% of the total cpu time when > running word count under the local runner. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.