[
https://issues.apache.org/jira/browse/HADOOP-4749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Zheng Shao updated HADOOP-4749:
-------------------------------
Summary: reducer should output input data size when shuffling is done
(was: reducer should output input data size and record count when shuffling is
done)
> reducer should output input data size when shuffling is done
> ------------------------------------------------------------
>
> Key: HADOOP-4749
> URL: https://issues.apache.org/jira/browse/HADOOP-4749
> Project: Hadoop Core
> Issue Type: Improvement
> Components: mapred
> Reporter: Zheng Shao
>
> Sometimes we see a single slow reducer because of the load balancing problem.
> This information will be very useful to understand how imbalanced the load is.
> Should be easy to fix I guess, since reducer should have all information
> needed at the end of the shuffling phase.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.