[ https://issues.apache.org/jira/browse/HBASE-3142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12929829#action_12929829 ]
Jonathan Gray commented on HBASE-3142: -------------------------------------- If this had thrown an exception, he would have looked up new master location and connected there. > If a master dies and comes back up before his znode expires, the RS heartbeat > can lock up > ----------------------------------------------------------------------------------------- > > Key: HBASE-3142 > URL: https://issues.apache.org/jira/browse/HBASE-3142 > Project: HBase > Issue Type: Bug > Components: master, regionserver > Affects Versions: 0.89.20100924, 0.90.0 > Reporter: Jonathan Gray > Assignee: ryan rawson > Priority: Critical > Fix For: 0.90.0 > > > During a rolling restart, we ran into a case where a master was shutdown and > then brought back up before the znode expired. > On the RS side, while the master was down, it was getting ConnectionRefused > exceptions trying to heartbeat to what it thinks is the active master. > Once the master process comes back up, the next heartbeat done by all the RSs > just blocks indefinitely. > This is somewhat related to HBASE-3141 -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.