[ https://issues.apache.org/jira/browse/YARN-2344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14523788#comment-14523788 ]
Junping Du commented on YARN-2344: ---------------------------------- Assign to myself to drive it forward. > ResourceManager should support maintenance model > ------------------------------------------------ > > Key: YARN-2344 > URL: https://issues.apache.org/jira/browse/YARN-2344 > Project: Hadoop YARN > Issue Type: New Feature > Components: resourcemanager > Reporter: Hemanth Yamijala > Assignee: Junping Du > > We've seen scenarios when we have needed to stop the namenode for a > maintenance activity. In such scenarios, if the jobtracker (JT) continues to > run, jobs would fail due to initialization or task failures (due to DFS). We > could restart the JT enabling job recovery, during such scenarios. But > restart has proved to be a very intrusive activity, particularly if the JT is > not at fault itself and does not require a restart. The ask is for a > admin-controlled feature to pause the JT which would take it to a state > somewhat analogous to the safe mode of DFS. -- This message was sent by Atlassian JIRA (v6.3.4#6332)