[ https://issues.apache.org/jira/browse/SOLR-8227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14995021#comment-14995021 ]
Yonik Seeley commented on SOLR-8227: ------------------------------------ There is coordination between replica and leader for recovery to happen seamlessly... The replica tells the leader it is going to recover, so the leader starts forwarding updates to the replica (which the replica starts buffering), and *then* does a hard commit that the replica can use to copy from. This ensures overlap so no updates are lost. Introducing a 3rd party into this would really complicate things, and it's not clear how we would maintain that same overlap to ensure no updates are lost. > Recovering replicas should be able to recover from any active replica > --------------------------------------------------------------------- > > Key: SOLR-8227 > URL: https://issues.apache.org/jira/browse/SOLR-8227 > Project: Solr > Issue Type: Improvement > Reporter: Varun Thacker > > Currently when a replica goes into recovery it uses the leader to recover. It > first tries to do a PeerSync. If thats not successful it does a > replication. Most of the times it ends up doing a full replication because > segment merging, autoCommits causing segments to be formed differently on the > replicas ( We should explore improving that in another issue ) . > But when many replicas are recovering and hitting the leader, the leader can > become a bottleneck. Since Solr is a CP system , we should be able to recover > from any of the 'active' replicas instead of just the leader. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org