Re: Replication opens new SegmentReader for all segments which deem unnecessary

Shawn Heisey Wed, 23 Nov 2022 11:27:25 -0800

On 11/23/22 11:49, Patson Luk wrote:

We are testing multiple replica setup here (1 NRT + 1 PULL) and noticed
that CPU consumption for replication is unreasonably high. Profiling shows
that `SolrCore#openNewSearcher` triggered from `IndexFetcher` takes much
more CPU time than the same method triggered from regular commits.

NB: A collection with 1 NRT replica and 1 PULL replica is not faulttolerant. In the event you lose the NRT replica, the PULL replicacannot become leader, so even if the collection stays online for queries(and I am not sure that it would), it will not be possible to updateit. If you want full fault tolerance, you need at least two replicasthat are either NRT or TLOG, and the rest can be PULL.

Debugging shows that when `SolrCore#openNewSearcher` is triggered from
`IndexFetcher`, it opens a new `SegmentReader` for every single fragment
for the updated collection. As a new `IndexWriter`, which keeps a
`ReaderPool`, is instantiated for each replication. And such pool is not
reused nor previous segment readers are carried over.

I suspect that in the case of a commit on NRT, Lucene can re-useSegmentReader instances for segments that did not change, because all ofthat is entirely at the Lucene level, so the new Lucene searcher knowsabout existing segments in the old Lucene searcher.

But with replication (which TLOG when not leader and PULL replicasutilize), the files are handled at the Solr level, and then the index ispassed to the Lucene level. Solr does not know that file x is relatedto an existing segment, because all that is handled at the Lucenelevel. Replication *CAN* involve replacing every single file in theindex ... so I am pretty sure that Solr must ask Lucene to load thenewly replicated index from scratch and not use the existing Lucenesearcher, and that means that Lucene must create all new SegmentReaderinstances. I don't think Solr can safely ask Lucene to re-use its oldsearcher on a replicated index. Even if that is possible, I imaginethat implementing it would take some very involved code that might breakwith every new Lucene version that Solr upgrades to.

Details in this ticket https://issues.apache.org/jira/browse/SOLR-16560.

This should have been discussed on the users mailing list beforecreating the issue. I know you are alluding to potential changes toSolr code, but it's not yet time for a dev list discussion on this.


Thanks,
Shawn


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: Replication opens new SegmentReader for all segments which deem unnecessary

Reply via email to