Re: Update interval of default counters

2014-04-16 Thread Akira AJISAKA
I'm thinking the reason for hard-coding is to protect Hadoop cluster from high network traffic. If the value is too small, there are too many network traffic between Map/Reduce tasks and MRAppMaster. Please see https://issues.apache.org/jira/browse/MAPREDUCE-4381 also. That's why you need to be

Re: Update interval of default counters

2014-04-16 Thread Dharmesh Kakadia
Hi Akira, Thanks fir the quick reply. Any particular reason for hard-coding it? Is there a workaround? I want to be able to get the counters as fine as possible. Also can you point me to the relevant source code. I am willing to take the issue and contribute if required. Thanks, Dharmesh On Wed

Re: Update interval of default counters

2014-04-16 Thread Akira AJISAKA
Moved mapreduce-dev@ to Bcc. Hi Dharmesh, The parameter is to set the interval of polling the progress of the MRAppMaster, not the Map/Reduce tasks. The tasks send the progress (includes the counter information) to MRAppMaster every 3000 milliseconds, which is hard-coded. That's why a sudden bi

Re: Update interval of default counters

2014-04-15 Thread Dharmesh Kakadia
Hi Akira, Thanks for the reply, but as I understand this is the interval of console counter printing. What I am trying to get while(!job.isComplete()){ getcounters() and do some processing on that. } Now this is running fine, but the status I get the same counter values repeatedly and then sudd

Re: Update interval of default counters

2014-04-15 Thread Akira AJISAKA
Moved to u...@hadoop.apache.org. You can configure the interval by setting "mapreduce.client.progressmonitor.pollinterval" parameter. The default value is 1000 ms. For more details, please see http://hadoop.apache.org/docs/stable/hadoop-mapreduce-client/hadoop-mapreduce-client-core/mapred-defau

Update interval of default counters

2014-04-14 Thread Dharmesh Kakadia
Hi, What is the update interval of inbuilt framework counters? Is that configurable? I am trying to collect very fine grained information about the job execution and using counters for that. It would be great if someone can point me to documentation/code for it. Thanks in advance. Thanks, Dharmes