[ https://issues.apache.org/jira/browse/HDFS-2185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Todd Lipcon updated HDFS-2185: ------------------------------ Attachment: hdfs-2185.txt Attached patch is for the HDFS side after splitting out the common components. It includes a simple unit test which makes sure failover occurs when the NNs shut themselves down. I'll continue to add test cases and also run some cluster tests while this (and its prereqs HADOOP-8206 and HADOOP-8212) are under review. > HA: HDFS portion of ZK-based FailoverController > ----------------------------------------------- > > Key: HDFS-2185 > URL: https://issues.apache.org/jira/browse/HDFS-2185 > Project: Hadoop HDFS > Issue Type: Sub-task > Components: ha > Affects Versions: HA branch (HDFS-1623) > Reporter: Eli Collins > Assignee: Todd Lipcon > Attachments: Failover_Controller.jpg, hdfs-2185.txt, hdfs-2185.txt > > > This jira is for a ZK-based FailoverController daemon. The FailoverController > is a separate daemon from the NN that does the following: > * Initiates leader election (via ZK) when necessary > * Performs health monitoring (aka failure detection) > * Performs fail-over (standby to active and active to standby transitions) > * Heartbeats to ensure the liveness > It should have the same/similar interface as the Linux HA RM to aid > pluggability. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira