[ https://issues.apache.org/jira/browse/YARN-149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13806917#comment-13806917 ]
Zhijie Shen commented on YARN-149: ---------------------------------- Is this a random test failure related to some ZKRMStateStore patch? https://builds.apache.org/job/PreCommit-YARN-Build/2291//testReport/org.apache.hadoop.yarn.server.resourcemanager.recovery/TestZKRMStateStoreZKClientConnections/testZKClientDisconnectAndReconnect/ > ResourceManager (RM) High-Availability (HA) > ------------------------------------------- > > Key: YARN-149 > URL: https://issues.apache.org/jira/browse/YARN-149 > Project: Hadoop YARN > Issue Type: New Feature > Reporter: Harsh J > Assignee: Bikas Saha > Attachments: rm-ha-phase1-approach-draft1.pdf, > rm-ha-phase1-draft2.pdf, YARN ResourceManager Automatic > Failover-rev-07-21-13.pdf, YARN ResourceManager Automatic > Failover-rev-08-04-13.pdf > > > This jira tracks work needed to be done to support one RM instance failing > over to another RM instance so that we can have RM HA. Work includes leader > election, transfer of control to leader and client re-direction to new leader. -- This message was sent by Atlassian JIRA (v6.1#6144)