I also do repair, compact and cleanup every couple of days, and also
have daily restarts on
crontab. It doesn't hurt and I avoid having a node becoming unresponsive
after many days
of operation, that has happened before. Older files get cleaned up on
restart.
It doesn't take long to shut down and restart a node,
so if there is enough replication in the cluster it's not any issue.
Maxim
On 1/25/2012 1:13 PM, Karl Hiramoto wrote:
On 01/25/12 16:09, R. Verlangen wrote:
Hi there,
I'm currently running a 2-node cluster for some small projects that
might need to scale-up in the future: that's why we chose Cassandra.
The actual problem is that one of the node's harddrive usage keeps
growing.
For example:
- after a fresh restart ~ 10GB
- after a couple of days running ~ 60GB
I know that Cassandra uses lots of diskspace but is this still
normal? I'm running cassandra 0.8.7
I run 9 nodes with cassandra 0.7.8 and we see this same behaviour,
but we keep it under control by doing the sequence:
nodetool repair
nodetool compact
nodetool cleanup
According to the 1.0.x changelog IIRC this disk usage is supposed to
be improved.
--
Karl