Re: Listing Hadoop Job History Statistics

2010-08-16 Thread Ranjit Mathew
[BCC-ing "general" - again.] On Tuesday 17 August 2010 07:36 AM, Scott Whitecross wrote: Thanks for the answers Doug and Arun. I'm assuming the job-history files mentioned are in ./hadoop-0.20/logs/history/done/. The files look like they were serialized by a class in Hadoop? (If I can read t

Re: Listing Hadoop Job History Statistics

2010-08-16 Thread Scott Whitecross
Thanks for the answers Doug and Arun. I'm assuming the job-history files mentioned are in ./hadoop-0.20/logs/history/done/. The files look like they were serialized by a class in Hadoop? (If I can read the files back into the appropriate class, and then dump them out into a custom format, that'

Re: Listing Hadoop Job History Statistics

2010-08-11 Thread Arun C Murthy
Moving to mapreduce-user@, bcc gene...@. There isn't a direct way. One possible option is just use the per-job job-history file which is on HDFS (See http://hadoop.apache.org/common/docs/r0.20.0/mapred_tutorial.html#Job+Submission+and+Monitoring for info on job-history). Hope that helps. A

Re: Listing Hadoop Job History Statistics

2010-08-11 Thread Doug Balog
I don't know if this is the best way, but this is how I do it. Configuration conf = new Configuration(); JobClient jobClient = new JobClient(new InetSocketAddress("jobTracker",9001),conf); jobClient.setConf(conf); // Bug in constructor, doesn't set conf. for(JobStatus js: jobClient.getAllJobs(

Listing Hadoop Job History Statistics

2010-08-11 Thread Scott Whitecross
Hi - What's the best way to list and query information on Hadoop job histories? For example, I'd like to see the job names from the past week against a Hadoop cluster I'm using. I don't see an API call or a way through the command line to pull the information. Is the best way writing a quick s