On 3/6/2018 11:53 AM, Nawab Zada Asad Iqbal wrote:
I have 117 shards and i tried to use document ids from zero to 116. I find
that the distribution is very uneven, e.g., the largest bucket receives
total 5 documents; and around 38 shards will be empty.  Is it expected?

With such a small data set, this fits what I would expect.

Choosing buckets by hashing (which is what compositeId does) is not perfect, but if you send it thousands or millions of documents, it will be *generally* balanced.

Thanks,
Shawn

Reply via email to