[ https://issues.apache.org/jira/browse/HBASE-2611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13554588#comment-13554588 ]
Chris Trezzo commented on HBASE-2611: ------------------------------------- Also, I don't think your manual test described above hits this corner case. You need at least two region server failures for this to happen. For example, region server "A" fails, region server "B" races and wins the failover of A, and then region server B fails before it finishes copying A's queue to it's own queue. Then when someone picks up B, A's original queue will not get completely replicated. Thanks for working on this though! It is a tricky one. > Handle RS that fails while processing the failure of another one > ---------------------------------------------------------------- > > Key: HBASE-2611 > URL: https://issues.apache.org/jira/browse/HBASE-2611 > Project: HBase > Issue Type: Sub-task > Components: Replication > Reporter: Jean-Daniel Cryans > Assignee: Himanshu Vashishtha > Fix For: 0.94.5 > > Attachments: HBase-2611-upstream-v1.patch, HBASE-2611-v2.patch > > > HBASE-2223 doesn't manage region servers that fail while doing the transfer > of HLogs queues from other region servers that failed. Devise a reliable way > to do it. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira