[ https://issues.apache.org/jira/browse/SLIDER-1239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
kyungwan nam updated SLIDER-1239: --------------------------------- Attachment: SLIDER-1239.patch the patch attached. It works well on my cluster. > support to set how many times the command script failure tolerates > ------------------------------------------------------------------ > > Key: SLIDER-1239 > URL: https://issues.apache.org/jira/browse/SLIDER-1239 > Project: Slider > Issue Type: Improvement > Affects Versions: Slider 0.92 > Reporter: kyungwan nam > Assignee: kyungwan nam > Attachments: SLIDER-1239.patch > > > if slider-appmaster receives heartbeat including FAILED from slider-agent, it > will be retried 3 times. > retrying makes sense like in situations where starting command is failed by > the port which has not been freed. > but, if the port is set to dynamic assignment, it is rare to fail by occupied > port. > also, a component which workType is BATCH (SLIDER-494) does not need to retry. > in cases like these, user want to stop the container quickly than retrying. > Currently, 3 times retrying is hardcoded. > it would be good if we can set how many times failure tolerates. -- This message was sent by Atlassian JIRA (v6.4.14#64029)