[ https://issues.apache.org/jira/browse/OOZIE-2467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15183618#comment-15183618 ]
Rohini Palaniswamy commented on OOZIE-2467: ------------------------------------------- bq. int zkSessionTimeout = ConfigurationService.getInt(ZK_SESSION_TIMEOUT, 300 * 1000); It should be 300 and not 300 * 1000 > Oozie can shutdown itself on long GC pause > ------------------------------------------ > > Key: OOZIE-2467 > URL: https://issues.apache.org/jira/browse/OOZIE-2467 > Project: Oozie > Issue Type: Bug > Reporter: Purshotam Shah > Assignee: Purshotam Shah > Attachments: OOZIE-2467-V1.patch > > > There are two issues. > 1. Oozie doesn't set ZK session timeout. So, curator uses default session > timeout which is 1 min. > If there is GC pause for more than 1 min, or frequent GC pause because of > which curator can't send heartbeat, the connection will be dropped and Oozie > will shutdown itself. > 2. Oozie uses ExponentialBackoffRetry with initial wait = 1sec and max > retries = 3. > so Oozie will only retry 3 times, with duration 1sec, 2 sec and 4 sec. We > need increase max retries count to at least 10. -- This message was sent by Atlassian JIRA (v6.3.4#6332)