[ https://issues.apache.org/jira/browse/HBASE-5916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13267232#comment-13267232 ]
stack commented on HBASE-5916: ------------------------------ Well, thats useful, right? Its useful in case where a regionserver crashes and a new one comes up fast, before the original regionserver's znode has expired in zk. We shouldn't remove it. On startup, you should not get this exception unless you have a condition like that described above where there was a regionserver on same host and port registered previously in the master and then a new regionserver comes in w/ same host and port but with different startcode? > RS restart just before master intialization we make the cluster non operative > ----------------------------------------------------------------------------- > > Key: HBASE-5916 > URL: https://issues.apache.org/jira/browse/HBASE-5916 > Project: HBase > Issue Type: Bug > Affects Versions: 0.92.1, 0.94.0 > Reporter: ramkrishna.s.vasudevan > Assignee: ramkrishna.s.vasudevan > Priority: Critical > Fix For: 0.94.1 > > > Consider a case where my master is getting restarted. RS that was alive when > the master restart started, gets restarted before the master initializes the > ServerShutDownHandler. > {code} > serverShutdownHandlerEnabled = true; > {code} > In this case when the RS tries to register with the master, the master will > try to expire the server but the server cannot be expired as still the > serverShutdownHandler is not enabled. > This case may happen when i have only one RS gets restarted or all the RS > gets restarted at the same time.(before assignRootandMeta). > {code} > LOG.info(message); > if (existingServer.getStartcode() < serverName.getStartcode()) { > LOG.info("Triggering server recovery; existingServer " + > existingServer + " looks stale, new server:" + serverName); > expireServer(existingServer); > } > {code} > If another RS is brought up then the cluster comes back to normalcy. > May be a very corner case. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira