[ https://issues.apache.org/jira/browse/ZOOKEEPER-1032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13011981#comment-13011981 ]
Flavio Junqueira commented on ZOOKEEPER-1032: --------------------------------------------- Thanks for responding. I don't understand your answer for point (2), so I'll have to review the code to try to understand what you're proposing. > speed up recovery from leader failure > ------------------------------------- > > Key: ZOOKEEPER-1032 > URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1032 > Project: ZooKeeper > Issue Type: Improvement > Components: server > Reporter: jiangwen wei > Fix For: 3.4.0 > > > when the number of nodes is large, it may take a long time to recover from > leader failure > there are some points to improve: > 1. Follower should take snapshot asynchronously when follower up to date > 2. Currently Leader/Follower will clear the DataTree on leader failures, and > then restore it from a snapshot and transaction logs. DataTree should not be > cleared, only restore it from transaction logs. > 3. FileTxnLog should store recently transaction logs in memory, so when > DataTree is not behind the transaction logs a lot, the transaction logs in > memory can be used to restore DataTree. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira