Bram Van Dam [bram.van...@intix.eu] wrote:

[Solr cursors]

> Oh thanks, that's a pretty interesting read. The scale we're
> investigating is several orders of magnitude larger than what was tested
> there, so I'm still a bit worried.

The beauty of the cursor is that it is has little to no overhead, relative to a 
standard top-X sorted search. A standard search uses a sliding window over the 
full result set, as does a cursor-search. Same amount of work. It is just a 
question of limits for the window.

> The largest index I currently have access to is
> about a billion documents in size. Paging there is a nightmare, but the
> Solr version is too old to support cursors so I'm afraid I can't offer
> any useful data.

Non-cursor paging in Solr uses a sliding window sort with a heap that contains 
all documents up to the paging number. A heap is a very fine thing for sliding 
window sort, as long as it is small. But performance drops to horrible levels 
when it gets large as it is extremely RAM-cache unfriendly.

> Does anyone have any performance data on multi-billion-document indexes?

Sorry, no. I could do a test on our 7 billion documents index, but it would 
have to wait until the end of January.

>Nobody will hit next 499 times, but a lot of our users skip to the last
> page quite often. Maybe I should make *that* as hard as possible. Hmm.

Issue a search with sort in reverse order, then reverse the returned list of 
documents?

- Toke Eskildsen

Reply via email to