Steve Loughran created SLIDER-77:
------------------------------------

             Summary: use windows and weighted moving averages for tracking 
container failures
                 Key: SLIDER-77
                 URL: https://issues.apache.org/jira/browse/SLIDER-77
             Project: Slider
          Issue Type: Sub-task
          Components: appmaster
            Reporter: Steve Loughran
            Priority: Minor


Use sliding windows and/or weighted moving averages to track container failures 
over time, and only react if many are failing in a short period.

What we do want to do here is react fast to a sudden series of failures, as 
well as look at average failure rates over time. I think separating startup 
failures from operational failures could help here. We don't want 5 failures in 
5 minutes to be ignored just because everything worked well for the previous 
month



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to