[ https://issues.apache.org/jira/browse/HBASE-4124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13089890#comment-13089890 ]
gaojinchao commented on HBASE-4124: ----------------------------------- RS isn't dead. I can reproduce and verify it. ZK status has changed before adding to RIT set. You can look the function processDeadServers. That is the reason why a region is assigned twice. // If region was in transition (was in zk) force it offline for reassign try { //Process with existing RS shutdown code boolean assign = ServerShutdownHandler.processDeadRegion(regionInfo, result, this, this.catalogTracker); if (assign) { ZKAssign.createOrForceNodeOffline(watcher, regionInfo, master.getServerName()); } > ZK restarted while assigning a region, new active HM re-assign it but the RS > warned 'already online on this server'. > -------------------------------------------------------------------------------------------------------------------- > > Key: HBASE-4124 > URL: https://issues.apache.org/jira/browse/HBASE-4124 > Project: HBase > Issue Type: Bug > Components: master > Reporter: fulin wang > Assignee: gaojinchao > Fix For: 0.90.5 > > Attachments: HBASE-4124_Branch90V1_trial.patch, > HBASE-4124_Branch90V2.patch, log.txt > > Original Estimate: 0.4h > Remaining Estimate: 0.4h > > ZK restarted while assigning a region, new active HM re-assign it but the RS > warned 'already online on this server'. > Issue: > The RS failed besause of 'already online on this server' and return; The HM > can not receive the message and report 'Regions in transition timed out'. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira