[ 
https://issues.apache.org/jira/browse/SOLR-303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12611372#action_12611372
 ] 

Yonik Seeley commented on SOLR-303:
-----------------------------------

{quote}
http://localhost:8983/solr/select?shards=[4 shards]&q=*:*&start=5000&rows=1000
Seems to request &rows=6000 from all the shards?
{quote}

It's a feature.

To retrieve documents 5000-6000, one must find the first 6000 documents then 
take the last 1000.
Since it's possible that all top 6000 documents could come from a single shard, 
the top 6000 documents must be collected from each and merged.

There are alternatives:
1) Optimistically request less than 6000 documents per shard and re-query if we 
are wrong
2) Add an optional mode that treats documents across shards in the same 
position as equal, so if you had 10 shards, you would simply get the top 100 
docs starting at 500.  This might be OK for some applications.

In general, search engines are optimized at retrieving the top 10 of something, 
and bad at retrieving the top 10 starting at a big number.  Limit the depth 
people can page, or restructure queries to avoid the latter case.

> Distributed Search over HTTP
> ----------------------------
>
>                 Key: SOLR-303
>                 URL: https://issues.apache.org/jira/browse/SOLR-303
>             Project: Solr
>          Issue Type: New Feature
>          Components: search
>            Reporter: Sharad Agarwal
>            Assignee: Yonik Seeley
>             Fix For: 1.3
>
>         Attachments: distributed.patch, distributed.patch, distributed.patch, 
> distributed.patch, distributed.patch, distributed.patch, distributed.patch, 
> distributed.patch, distributed.patch, distributed.patch, distributed.patch, 
> distributed.patch, distributed_add_tests_for_intended_behavior.patch, 
> distributed_facet_count_bugfix.patch, distributed_pjaol.patch, 
> fedsearch.patch, fedsearch.patch, fedsearch.patch, fedsearch.patch, 
> fedsearch.patch, fedsearch.patch, fedsearch.patch, fedsearch.stu.patch, 
> fedsearch.stu.patch, shards_qt.patch, solr-dist-faceting-non-ascii-all.patch
>
>
> Searching over multiple shards and aggregating results.
> Motivated by http://wiki.apache.org/solr/DistributedSearch

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to