[ https://issues.apache.org/jira/browse/YARN-149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13715333#comment-13715333 ]
Karthik Kambatla commented on YARN-149: --------------------------------------- Thanks for posting this. Main comment: IIUC, the approach proposes that, right from the beginning (phase 1), each service in the RM should be aware of the Active/Standby HA states and behave accordingly (different state machines?). While starting all services immediately and waiting for transition to Active might be the correct approach eventually, for simplicity in the first implementation, should we start these services on transition to active? > ResourceManager (RM) High-Availability (HA) > ------------------------------------------- > > Key: YARN-149 > URL: https://issues.apache.org/jira/browse/YARN-149 > Project: Hadoop YARN > Issue Type: New Feature > Reporter: Harsh J > Assignee: Bikas Saha > Attachments: rm-ha-phase1-approach-draft1.pdf, > rm-ha-phase1-draft2.pdf, YARN ResourceManager Automatic > Failover-rev-07-21-13.pdf > > > This jira tracks work needed to be done to support one RM instance failing > over to another RM instance so that we can have RM HA. Work includes leader > election, transfer of control to leader and client re-direction to new leader. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira