On 3/6/2018 11:53 AM, Nawab Zada Asad Iqbal wrote:
I have 117 shards and i tried to use document ids from zero to 116. I find
that the distribution is very uneven, e.g., the largest bucket receives
total 5 documents; and around 38 shards will be empty. Is it expected?
With such a small data set, this fits what I would expect.
Choosing buckets by hashing (which is what compositeId does) is not
perfect, but if you send it thousands or millions of documents, it will
be *generally* balanced.
Thanks,
Shawn