Re: firstSearcher queries to warmup a brand new core

Shawn Heisey Mon, 08 Nov 2021 14:45:20 -0800

On 11/8/21 2:05 PM, Nick Vladiceanu wrote:

Ok, makes sense. However, when the core is initially created, the data is not 
yet there. Running the firstSearcher queries against empty index won’t have any 
beneficial effects when it comes to cache warming. Is there any way to open the 
first searcher after the data is pulled from the leader, and therefore, run the 
warmup queries? What’s the point of opening the first searcher when initially 
the core is created, if there is no data?

For the purposes of things like warming queries, the searcher isn'taware that the index is empty when it starts. It just knows when it isthe first searcher, and when that is the case, it runs any configuredfirstSearcher queries. Making it aware of something like that for thepurposes of avoiding such queries is possible, but that would add a lotof complexity. Bugs are more likely as the code gets more complex. AndI would strongly argue that any benefits of added complexity in such animportant piece of code do not outweigh the risks.

If this really concerns you, just have your indexing software reload thecore after the index is built, so firstSearcher queries are executedagain. If the list of firstSearcher queries takes a long time to run,just set useColdSearcher to true, and the searcher will be made readyfor queries before the warming queries are executed. I don't rememberwhere in solrconfig.xml that config is.

What I will generally recommend that people do is define a set ofqueries in firstSearcher that will do initial warming on a completelycold index, set useColdSearcher to true, and mostly rely on cacheautowarming after that. If cache autowarming doesn't do a good job,then there are some possible remedies:


1) Add more memory so the OS can cache the index better.
2) Change the cache autowarming config.
3) Define some queries in newSearcher.
  newSearcher is usually a smaller list than firstSearcher.

A lot of performance issues are cured by adding more memory so the OScan cache the index better. Good index caching is critical to gettinggood performance out of any Lucene based software, which includes Solr. Note that I am not talking about heap size -- I am talking about memorythat is not allocated to any program.

When building the list of firstSearcher queries, the idea is to beginthe process of populating the OS disk cache and Solr's caches, not torun every possible query variation users are likely to create. If youend up with more than a handful of queries in that list, it's probablytoo long.


Thanks,
Shawn

Re: firstSearcher queries to warmup a brand new core

Reply via email to