Jeremy,
I suppose thats doable, please file a MAPREDUCE JIRA so you can
discuss this with others on the development side as well.
I am guessing that MAX operations of most of the user-oriented data
flow front-ends such as Hive and Pig already do this efficiently, so
perhaps there hasn't been a
Done.
https://issues.apache.org/jira/browse/MAPREDUCE-4709
Thanks
J
On Fri, Oct 5, 2012 at 10:13 AM, Harsh J ha...@cloudera.com wrote:
Jeremy,
I suppose thats doable, please file a MAPREDUCE JIRA so you can
discuss this with others on the development side as well.
I am guessing that MAX
HI hadoop-users,
I'm curious if there is an implementation somewhere of a counter which
tracks the maximum of some value across all mappers or reducers?
Thanks
J
Jeremy,
Here's my shot at it (pardon the quick crappy code):
https://gist.github.com/3828246
Basically - you can achieve it in two ways:
Requirement: All tasks must increment the max designated counter
only AFTER the max has been computed (i.e. in cleanup).
1. All tasks may use same counter