On 5/5/13 7:48 PM, Mingfeng Yang wrote:
Dear Solr Users,

Does anyone know what is the best way to iterate through each document in a
Solr index with billion entries?

I tried to use  select?q=*:*&start=xx&rows=500  to get 500 docs each time
and then change start value, but it got very slow after getting through
about 10 million docs.

Thanks,
Ming-

You need to use a unique and stable sort key and get documents > sortkey. For example, if you have a unique key, retrieve documents ordered by the unique key, and for each batch get documents > max (key) from the previous batch

-Mike

Reply via email to