I have an ElasticSearch index with 20+ million documents, which I'm using 
to perform GeoBoundingBox filters, combined to other queries.
I'd like to have, for each search, a random sample from my results.
After reading the documentation, I tried using the RandomScoreFunction, 
which gives a random score to each result from my queries.
The problem is performance wise. Applying this over around 10 million 
documents (which can happen sometimes), takes more than
10 seconds. Actually there the ES cluster have 6 servers, and 10 seconds is 
quite a long time for the kind of application I want.
Is there anyway to get a consistent sample from my results, preferably 
having a good geographical distribution?

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/2905a438-6dbc-4c01-b4d1-ff00feefc3bf%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to