[
https://issues.apache.org/jira/browse/HADOOP-4699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12655114#action_12655114
]
Chris Douglas commented on HADOOP-4699:
---------------------------------------
bq. I see that you have removed the Debug statements introduced by Arun for
HADOOP-3647. Is it no longer required?
HADOOP-4754 was going to address this directly, but it would have conflicted
with other patches, so I abandoned it. If you think it merits a separate issue,
then I can rework both patches.
bq. And curious, did you benchmark this patch against trunk for performance?
No. It probably won't have a measurable impact.
> Change TaskTracker.MapOutputServlet to send only the IFile segment, validate
> checksum in Reduce
> -----------------------------------------------------------------------------------------------
>
> Key: HADOOP-4699
> URL: https://issues.apache.org/jira/browse/HADOOP-4699
> Project: Hadoop Core
> Issue Type: Improvement
> Reporter: Chris Douglas
> Assignee: Chris Douglas
> Fix For: 0.20.0
>
> Attachments: 4699-0.patch
>
>
> Instead of validating the checksum of the IFile segment in MapOutputServlet,
> validation may be left to the reduce. While failures may not be detected
> until late in the reduce, the throughput and CPU improvements should make up
> for it in the average case.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.