not even number of keys per CFs in fully balanced cluster with random partitioner

Piavlo Tue, 29 Oct 2013 16:11:06 -0700

 Hi,

There is a 12 node cluster , still stuck on 1.0.8.
All nodes in the cluster ring are balanced.
Using random partitioner.
All CFs use compression.
Data size on nodes varies from 40G to 75G.

This variance is not due to the bigger nodes having more uncompactedsstables than others.Most biggest CFs have exact same row keys, just store different data, sodata for same same key should end up on same node for these CFs.The keys estimate for each of these biggest CF on the nodes with largerdata size is almost twice larger than key estimate on the nodes withsmallest data size, thus proportional to the data size on the node.These CFs have about 50-100 millions for rows per node.

I can't understand how statistically it's possible that with randompartitioner some nodes have x2 more keys than others with 50-100millions of keys per node.

Any ideas how it's possible?
Anything else I can check?

tnx
Alex

not even number of keys per CFs in fully balanced cluster with random partitioner

Reply via email to