Re: Counters that track the max value

2012-10-05 Thread Harsh J
Jeremy, I suppose thats doable, please file a MAPREDUCE JIRA so you can discuss this with others on the development side as well. I am guessing that MAX operations of most of the user-oriented data flow front-ends such as Hive and Pig already do this efficiently, so perhaps there hasn't been a

Re: Counters that track the max value

2012-10-05 Thread Jeremy Lewi
Done. https://issues.apache.org/jira/browse/MAPREDUCE-4709 Thanks J On Fri, Oct 5, 2012 at 10:13 AM, Harsh J ha...@cloudera.com wrote: Jeremy, I suppose thats doable, please file a MAPREDUCE JIRA so you can discuss this with others on the development side as well. I am guessing that MAX

Counters that track the max value

2012-10-03 Thread Jeremy Lewi
HI hadoop-users, I'm curious if there is an implementation somewhere of a counter which tracks the maximum of some value across all mappers or reducers? Thanks J

Re: Counters that track the max value

2012-10-03 Thread Harsh J
Jeremy, Here's my shot at it (pardon the quick crappy code): https://gist.github.com/3828246 Basically - you can achieve it in two ways: Requirement: All tasks must increment the max designated counter only AFTER the max has been computed (i.e. in cleanup). 1. All tasks may use same counter