[ https://issues.apache.org/jira/browse/SLIDER-1188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15828884#comment-15828884 ]
Gour Saha commented on SLIDER-1188: ----------------------------------- [~billie.rinaldi] the patch looks good to me. I remember keeping the threshold reasonably low to make apps fail fairly fast if the replacement AM would not come up. But, for long running services I think such a low threshold does not make sense. Hence this change will be very helpful. > Make AM agent heartbeat loss configurable / increase defaults > ------------------------------------------------------------- > > Key: SLIDER-1188 > URL: https://issues.apache.org/jira/browse/SLIDER-1188 > Project: Slider > Issue Type: Bug > Reporter: Billie Rinaldi > Assignee: Billie Rinaldi > Attachments: SLIDER-1188.1.patch > > > Currently containers are marked as lost after a couple of minutes, which is > too sensitive for a busy cluster. We should increase the defaults and make > the container timeout configurable. We may also want to increase the number > of times the agent will retry heartbeating to the AM. -- This message was sent by Atlassian JIRA (v6.3.4#6332)