Some people in the Elasticsearch community are using random scoring [1] to sample a document subset from the search results. Maybe something similar could be implemented for Solr ?

There are probably more efficient sampling solution than this one, but this solution is likely more straightforward to implement.

[1] https://www.elastic.co/guide/en/elasticsearch/guide/current/random-scoring.html

--
Renaud Delbru

On 27/09/16 15:57, googoo wrote:
Hi,

Is it possible I can sampling based on  "search result"?
Like run query first, and search result return 1 million documents.
With random sampling, 50% (500K) documents return for facet, and stats.

The sampling need based on "search result".

Thanks,
Yongtao



--
View this message in context: 
http://lucene.472066.n3.nabble.com/how-to-sampling-search-result-tp4298269.html
Sent from the Solr - User mailing list archive at Nabble.com.


Reply via email to