Re: Automatic index balancing plugin or other solution?

2014-04-09 Thread joergpra...@gmail.com
The number of documents is not relevant to the search time. Important factors for search time are the type of query, shard size, the number of unique terms (the dictionary size), the number of segments, network latency, disk drive latency, ... Maybe you mean equal distribution of docs with same

Automatic index balancing plugin or other solution?

2014-04-08 Thread Josh Harrison
I have heard that ideally, you want to have a similar number of documents per shard for optimal search times, is that correct? I have data volumes that are just all over the place, from 100k to tens of millions in a week. I'm thinking about a river plugin that could: Take a mapping object as a