[ 
https://issues.apache.org/jira/browse/SLIDER-1188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15828884#comment-15828884
 ] 

Gour Saha commented on SLIDER-1188:
-----------------------------------

[~billie.rinaldi] the patch looks good to me. I remember keeping the threshold 
reasonably low to make apps fail fairly fast if the replacement AM would not 
come up. But, for long running services I think such a low threshold does not 
make sense. Hence this change will be very helpful.

> Make AM agent heartbeat loss configurable / increase defaults
> -------------------------------------------------------------
>
>                 Key: SLIDER-1188
>                 URL: https://issues.apache.org/jira/browse/SLIDER-1188
>             Project: Slider
>          Issue Type: Bug
>            Reporter: Billie Rinaldi
>            Assignee: Billie Rinaldi
>         Attachments: SLIDER-1188.1.patch
>
>
> Currently containers are marked as lost after a couple of minutes, which is 
> too sensitive for a busy cluster. We should increase the defaults and make 
> the container timeout configurable. We may also want to increase the number 
> of times the agent will retry heartbeating to the AM.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to