[ https://issues.apache.org/jira/browse/HBASE-2611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13555213#comment-13555213 ]
Himanshu Vashishtha commented on HBASE-2611: -------------------------------------------- bq. But what can happen is that the region server who wins the race to take over the dead region server's queues could die before it even manages to call multi. Not following your question. How can a regionserver wins a race before calling multi? If regionserver "A" fails, *all* regionserver will call multi to do the failover, and only one (let's say "B") will succeed. Now, if B also dies meanwhile (while it has succeeded in transferring the queue from zk perspective), the regionserver doing the failover for B will also process A's znodes (as they are with B now). Therefore, I don't see we really need a retry. Did I miss anything? > Handle RS that fails while processing the failure of another one > ---------------------------------------------------------------- > > Key: HBASE-2611 > URL: https://issues.apache.org/jira/browse/HBASE-2611 > Project: HBase > Issue Type: Sub-task > Components: Replication > Reporter: Jean-Daniel Cryans > Assignee: Himanshu Vashishtha > Fix For: 0.94.5 > > Attachments: HBase-2611-upstream-v1.patch, HBASE-2611-v2.patch > > > HBASE-2223 doesn't manage region servers that fail while doing the transfer > of HLogs queues from other region servers that failed. Devise a reliable way > to do it. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira