Steve Loughran created SLIDER-77:
------------------------------------
Summary: use windows and weighted moving averages for tracking
container failures
Key: SLIDER-77
URL: https://issues.apache.org/jira/browse/SLIDER-77
Project: Slider
Issue Type: Sub-task
Components: appmaster
Reporter: Steve Loughran
Priority: Minor
Use sliding windows and/or weighted moving averages to track container failures
over time, and only react if many are failing in a short period.
What we do want to do here is react fast to a sudden series of failures, as
well as look at average failure rates over time. I think separating startup
failures from operational failures could help here. We don't want 5 failures in
5 minutes to be ignored just because everything worked well for the previous
month
--
This message was sent by Atlassian JIRA
(v6.2#6252)