I also do repair, compact and cleanup every couple of days, and also have daily restarts on crontab. It doesn't hurt and I avoid having a node becoming unresponsive after many days of operation, that has happened before. Older files get cleaned up on restart.

It doesn't take long to shut down and restart a node,
so if there is enough replication in the cluster it's not any issue.

Maxim


On 1/25/2012 1:13 PM, Karl Hiramoto wrote:
On 01/25/12 16:09, R. Verlangen wrote:
Hi there,

I'm currently running a 2-node cluster for some small projects that might need to scale-up in the future: that's why we chose Cassandra. The actual problem is that one of the node's harddrive usage keeps growing.

For example:
- after a fresh restart ~ 10GB
- after a couple of days running ~ 60GB

I know that Cassandra uses lots of diskspace but is this still normal? I'm running cassandra 0.8.7



I run 9 nodes with cassandra 0.7.8 and we see this same behaviour, but we keep it under control by doing the sequence:

nodetool repair
nodetool compact
nodetool cleanup

According to the 1.0.x changelog IIRC this disk usage is supposed to be improved.


--
Karl

Reply via email to