reducer should output input data size and record count when shuffling is done
-----------------------------------------------------------------------------
Key: HADOOP-4749
URL: https://issues.apache.org/jira/browse/HADOOP-4749
Project: Hadoop Core
Issue Type: Improvement
Components: mapred
Reporter: Zheng Shao
Sometimes we see a single slow reducer because of the load balancing problem.
This information will be very useful to understand how imbalanced the load is.
Should be easy to fix I guess, since reducer should have all information needed
at the end of the shuffling phase.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.