t; -Marcin
>>
>> ** **
>>
>> *From:* Ted Dunning [mailto:tdunn...@maprtech.com]
>> *Sent:* Saturday, December 22, 2012 5:12 PM
>> *To:* common-u...@hadoop.apache.org
>> *Subject:* Re: Alerting
>>
>> ** **
>>
>> Also, I think tha
t the SLA mechanism should do what’s asked for.
>
> ** **
>
> -Marcin
>
> ** **
>
> *From:* Ted Dunning [mailto:tdunn...@maprtech.com]
> *Sent:* Saturday, December 22, 2012 5:12 PM
> *To:* common-u...@hadoop.apache.org
> *Subject:* Re: Alerting
>
> ** **
: Saturday, December 22, 2012 5:12 PM
To: common-u...@hadoop.apache.org
Subject: Re: Alerting
Also, I think that Oozie allows for timeouts in job submission. That might
answer your need.
On Sat, Dec 22, 2012 at 2:08 PM, Ted Dunning
mailto:tdunn...@maprtech.com>> wrote:
You can write a
nice one. If
> your program won't allow multiple copies to run at the same time, then if
> you re-invoke the program every, say, hour, then 5 retries implies that the
> previous invocation has been running for 5 hours.
>
>
> On Sat, Dec 22, 2012 at 12:49 PM, Mohit A
t the
previous invocation has been running for 5 hours.
On Sat, Dec 22, 2012 at 12:49 PM, Mohit Anchlia wrote:
> Need alerting
>
>
> On Sat, Dec 22, 2012 at 12:44 PM, Mohammad Tariq wrote:
>
>> MR web UI?Although we can't trigger anything, it provides all the info
>>
you may just add an alert via email to your workflow for the failure
you can try the retry with # feature tries and then send alert of job
failures (we used this for jobs running for over 5 hrs and worked well for
us)
On Sun, Dec 23, 2012 at 2:19 AM, Mohit Anchlia wrote:
> Need alert
Need alerting
On Sat, Dec 22, 2012 at 12:44 PM, Mohammad Tariq wrote:
> MR web UI?Although we can't trigger anything, it provides all the info
> related to the jobs. I mean it would be easier to just go there and and
> have a look at everything rather than opening the shell
MR web UI?Although we can't trigger anything, it provides all the info
related to the jobs. I mean it would be easier to just go there and and
have a look at everything rather than opening the shell and typing the
command.
I'm a bit lazy ;)
Best Regards,
Tariq
+91-9741563634
https://mtariq.jux.co
Best I can find is hadoop job list so far
On Sat, Dec 22, 2012 at 12:30 PM, Mohit Anchlia wrote:
> What's the best way to trigger alert when jobs run for too long or have
> many failures? Is there a hadoop command that can be used to perform this
> activity?
What's the best way to trigger alert when jobs run for too long or have
many failures? Is there a hadoop command that can be used to perform this
activity?
10 matches
Mail list logo