proposal on distributed search
------------------------------
Key: SOLR-1524
URL: https://issues.apache.org/jira/browse/SOLR-1524
Project: Solr
Issue Type: Sub-task
Components: clients - java
Reporter: johnson.hong
Hi,all.
some days ago,I put a question that "why it gets slower while keep on
increasing the start value accross distributed search ?".
And one reply by Shalin Shekhar Mangar was "distributed search fetches
start+rows documents from each shard in order to correctly merge the results".
After this I read the source code,and I found query across distributed
search would be separated into nshards' query.
Each shard query will be excute as follows:
1.get ids of matched documents into DocList //take little time
2.get all documents by id which result in step 1. //take little time
3.write all the documents found to binary string
4.parse the binary string back to SolrDocumentList // step 4 take
almost all the time used
From above,I propose :It is not necessary to execute step 3 and step 4 even
step 2,the ids found is enough to merge results,isn't it?
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.