[ https://issues.apache.org/jira/browse/YARN-3668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14549506#comment-14549506 ]
sandflee commented on YARN-3668: -------------------------------- yes, I agree it's purely a problem of AM,but it seems a bomb in our system, maybe triggered by some factor beyond our control. and we can't afford this risk. > Long run service shouldn't be killed even if Yarn crashed > --------------------------------------------------------- > > Key: YARN-3668 > URL: https://issues.apache.org/jira/browse/YARN-3668 > Project: Hadoop YARN > Issue Type: Wish > Reporter: sandflee > > For long running service, it shouldn't be killed even if all yarn component > crashed, with RM work preserving and NM restart, yarn could take over > applications again. -- This message was sent by Atlassian JIRA (v6.3.4#6332)