Varun Thacker created SOLR-12674:
------------------------------------

             Summary: RollupStream should not use the HashQueryParser for 1 
worker
                 Key: SOLR-12674
                 URL: https://issues.apache.org/jira/browse/SOLR-12674
             Project: Solr
          Issue Type: Bug
      Security Level: Public (Default Security Level. Issues are Public)
            Reporter: Varun Thacker


Let's say I have a dataset of a 100M document

After applying a filter the number of documents found would be 5k so it's tiny.

If I do a search and a rollup stream the query returns in the 200ms range.

But if by mistake I add the "partitionKeys" param to the search stream the hash 
query parser is invoked which runs on the entire document set and the query 
time spikes up to 7 seconds. 

If we aren't providing a parallel stream we should ignore the partitionKeys 
param

Sample Query:
{code:java}
rollup(search(gettingstarted,q="*:*",fl="id",sort="id 
desc",partitionKeys="id"),over="id"){code}
Because of the partitionKeys the underlying query formed is:
{code:java}
params={q=*:*&distrib=false&fl=id&sort=id+desc&partitionKeys=id&fq={!hash+workers%3D1+worker%3D0}&wt=json&version=2.2}
 hits=2 status=0 QTime=30{code}
This is a dummy dataset so don't see the the hits and QTime but this query 
certainly doesn't need to add the hash query parser filter clause for workers=1



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to