The number of documents is not relevant to the search time.
Important factors for search time are the type of query, shard size, the
number of unique terms (the dictionary size), the number of segments,
network latency, disk drive latency, ...
Maybe you mean equal distribution of docs with same
I have heard that ideally, you want to have a similar number of documents
per shard for optimal search times, is that correct?
I have data volumes that are just all over the place, from 100k to tens of
millions in a week.
I'm thinking about a river plugin that could:
Take a mapping object as a