[jira] [Commented] (HBASE-2611) Handle RS that fails while processing the failure of another one

Himanshu Vashishtha (JIRA) Wed, 16 Jan 2013 09:06:20 -0800

    [ 
https://issues.apache.org/jira/browse/HBASE-2611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13555213#comment-13555213
 ]


Himanshu Vashishtha commented on HBASE-2611:
--------------------------------------------

bq. But what can happen is that the region server who wins the race to take 
over the dead region server's queues could die before it even manages to call 
multi.
Not following your question. How can a regionserver wins a race before calling 
multi? If regionserver "A" fails, *all* regionserver will call multi to do the 
failover, and only one (let's say "B") will succeed. Now, if B also dies 
meanwhile (while it has succeeded in transferring the queue from zk 
perspective), the regionserver doing the failover for B will also process A's 
znodes (as they are with B now). Therefore, I don't see we really need a retry. 
Did I miss anything?

                
> Handle RS that fails while processing the failure of another one
> ----------------------------------------------------------------
>
>                 Key: HBASE-2611
>                 URL: https://issues.apache.org/jira/browse/HBASE-2611
>             Project: HBase
>          Issue Type: Sub-task
>          Components: Replication
>            Reporter: Jean-Daniel Cryans
>            Assignee: Himanshu Vashishtha
>             Fix For: 0.94.5
>
>         Attachments: HBase-2611-upstream-v1.patch, HBASE-2611-v2.patch
>
>
> HBASE-2223 doesn't manage region servers that fail while doing the transfer 
> of HLogs queues from other region servers that failed. Devise a reliable way 
> to do it.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-2611) Handle RS that fails while processing the failure of another one

Reply via email to