[jira] [Commented] (ZOOKEEPER-1032) speed up recovery from leader failure

Mahadev konar (JIRA) Sun, 27 Mar 2011 23:18:48 -0700

    [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13011920#comment-13011920
 ]


Mahadev konar commented on ZOOKEEPER-1032:
------------------------------------------

added to 3.4 for tracking.

jiangwen,
  good suggestions. Are you working on any of these? 1) is a little tricky 
since we cannot proceed without having a recoverable state on the disk, so 
asynchronous snapshot is not an option. 


> speed up recovery from leader failure
> -------------------------------------
>
>                 Key: ZOOKEEPER-1032
>                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1032
>             Project: ZooKeeper
>          Issue Type: Improvement
>          Components: server
>            Reporter: jiangwen wei
>             Fix For: 3.4.0
>
>
> when the number of nodes is large, it may take a long time to recover from 
> leader failure
> there are some points to improve:
> 1. Follower should take snapshot asynchronously when follower up to date
> 2. Currently Leader/Follower will clear the DataTree on leader failures, and 
> then restore it from a snapshot and transaction logs. DataTree should not be 
> cleared, only restore it from transaction logs.
> 3. FileTxnLog should store recently transaction logs in memory, so when 
> DataTree is not behind the transaction logs a lot, the transaction logs in 
> memory can be used to restore DataTree.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (ZOOKEEPER-1032) speed up recovery from leader failure

Reply via email to