Slow searching limited but high rows across many shards all with high hits

Per Steffensen Thu, 13 Nov 2014 03:00:33 -0800

Hi

* Using Solr 4.4.0

* Up to 1000 shards total - spread across about 20-40 Solr-servers on20-40 machines

Searching "limited but high rows across many shards all with high hits"is slow

E.g.
* Query from outside client: q=content:something&rows=1000
* Resulting in sub-requests to each shard something a-la this
** 1) q=content&rows=1000&fl=id,score

** 2) Request the full documents with ids in the global-top-1000 foundamong the top-1000 from each shard


Interpretation
* limited but high rows are means 1000 in the example above
* many shards means 200-1000 in our case

* all with high hits, means that each of the shards have a significantnumber of hits on the query (q-param)

Doing such a query on our system takes between 5 min to 1 hour -depending on a lot of things. We have profiled and made our own PoCsolution that brings the response-time down to between 5 secs and 1minute (about a factor 60 faster) - and not requiring nearly as manyresources from the system while performing the search. Of course we wantto have a solution going into production. We have to either mature outPoC solution and use that, or adopt an existing solution from the newestSolr release.Do any of you guys know if there are a solution to this "problem" in thenewest Solr release?


Regards, Per Steffensen



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Slow searching limited but high rows across many shards all with high hits

Reply via email to