Re: Alerting

2012-12-23 Thread Mohammad Tariq
t; -Marcin >> >> ** ** >> >> *From:* Ted Dunning [mailto:tdunn...@maprtech.com] >> *Sent:* Saturday, December 22, 2012 5:12 PM >> *To:* common-u...@hadoop.apache.org >> *Subject:* Re: Alerting >> >> ** ** >> >> Also, I think tha

Re: Alerting

2012-12-23 Thread Junior Mint
t the SLA mechanism should do what’s asked for. > > ** ** > > -Marcin > > ** ** > > *From:* Ted Dunning [mailto:tdunn...@maprtech.com] > *Sent:* Saturday, December 22, 2012 5:12 PM > *To:* common-u...@hadoop.apache.org > *Subject:* Re: Alerting > > ** **

RE: Alerting

2012-12-23 Thread Marcin Mejran
: Saturday, December 22, 2012 5:12 PM To: common-u...@hadoop.apache.org Subject: Re: Alerting Also, I think that Oozie allows for timeouts in job submission. That might answer your need. On Sat, Dec 22, 2012 at 2:08 PM, Ted Dunning mailto:tdunn...@maprtech.com>> wrote: You can write a

Re: Alerting

2012-12-22 Thread Ted Dunning
Also, I think that Oozie allows for timeouts in job submission. That might answer your need. On Sat, Dec 22, 2012 at 2:08 PM, Ted Dunning wrote: > You can write a script to parse the Hadoop job list and send an alert. > > The trick of putting a retry into your workflow system is a nice one.

Re: Alerting

2012-12-22 Thread Ted Dunning
You can write a script to parse the Hadoop job list and send an alert. The trick of putting a retry into your workflow system is a nice one. If your program won't allow multiple copies to run at the same time, then if you re-invoke the program every, say, hour, then 5 retries implies that the pre

Re: Alerting

2012-12-22 Thread Nitin Pawar
you may just add an alert via email to your workflow for the failure you can try the retry with # feature tries and then send alert of job failures (we used this for jobs running for over 5 hrs and worked well for us) On Sun, Dec 23, 2012 at 2:19 AM, Mohit Anchlia wrote: > Need alerting > > > O

Re: Alerting

2012-12-22 Thread Mohit Anchlia
Need alerting On Sat, Dec 22, 2012 at 12:44 PM, Mohammad Tariq wrote: > MR web UI?Although we can't trigger anything, it provides all the info > related to the jobs. I mean it would be easier to just go there and and > have a look at everything rather than opening the shell and typing the > comm

Re: Alerting

2012-12-22 Thread Mohammad Tariq
MR web UI?Although we can't trigger anything, it provides all the info related to the jobs. I mean it would be easier to just go there and and have a look at everything rather than opening the shell and typing the command. I'm a bit lazy ;) Best Regards, Tariq +91-9741563634 https://mtariq.jux.co

Re: Alerting

2012-12-22 Thread Mohit Anchlia
Best I can find is hadoop job list so far On Sat, Dec 22, 2012 at 12:30 PM, Mohit Anchlia wrote: > What's the best way to trigger alert when jobs run for too long or have > many failures? Is there a hadoop command that can be used to perform this > activity?