[ https://issues.apache.org/jira/browse/YARN-3554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14525264#comment-14525264 ]
sandflee commented on YARN-3554: -------------------------------- Hi [~Naganarasimha] 3 mins seems dangerous, If rm fails over and the recover takes serval mins , nm maybe kill all containers, in production env, it's not expected. > Default value for maximum nodemanager connect wait time is too high > ------------------------------------------------------------------- > > Key: YARN-3554 > URL: https://issues.apache.org/jira/browse/YARN-3554 > Project: Hadoop YARN > Issue Type: Bug > Affects Versions: 2.6.0 > Reporter: Jason Lowe > Assignee: Naganarasimha G R > Labels: newbie > Attachments: YARN-3554-20150429-2.patch, YARN-3554.20150429-1.patch > > > The default value for yarn.client.nodemanager-connect.max-wait-ms is 900000 > msec or 15 minutes, which is way too high. The default container expiry time > from the RM and the default task timeout in MapReduce are both only 10 > minutes. -- This message was sent by Atlassian JIRA (v6.3.4#6332)