[ 
https://issues.apache.org/jira/browse/HBASE-3142?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12929829#action_12929829
 ] 

Jonathan Gray commented on HBASE-3142:
--------------------------------------

If this had thrown an exception, he would have looked up new master location 
and connected there.

> If a master dies and comes back up before his znode expires, the RS heartbeat 
> can lock up
> -----------------------------------------------------------------------------------------
>
>                 Key: HBASE-3142
>                 URL: https://issues.apache.org/jira/browse/HBASE-3142
>             Project: HBase
>          Issue Type: Bug
>          Components: master, regionserver
>    Affects Versions: 0.89.20100924, 0.90.0
>            Reporter: Jonathan Gray
>            Assignee: ryan rawson
>            Priority: Critical
>             Fix For: 0.90.0
>
>
> During a rolling restart, we ran into a case where a master was shutdown and 
> then brought back up before the znode expired.
> On the RS side, while the master was down, it was getting ConnectionRefused 
> exceptions trying to heartbeat to what it thinks is the active master.
> Once the master process comes back up, the next heartbeat done by all the RSs 
> just blocks indefinitely.
> This is somewhat related to HBASE-3141

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to