Hi all,

What is the recommended strategy for backing up the data stored inside
cassandra?

I realized that Cass. is a distributed database, and with a decent
replication factor, backups are "already done" in some sense.  But, as a
relatively new user, I'm always concerned that the data is only within the
system and not stored *anywhere* else.

In an earlier email in the list, the recommendation was:

Until tickets 193 and 520 are done, the easiest thing is to copy all
the sstables from the other nodes that have replicas for the ranges it
is responsible for (e.g. for replication factor of 3 on rack unaware
partitioner, the nodes before it and the node after it on the right
would suffice), and then run nodeprobe cleanup to clear out the
excess.

Is this still the recommended approach?  If I backed up the files in
DataDirectories/*, is it possible to restore a node using those files?
 (That is, bring up a new node, copy the backed up files from the
crashed node onto the new node, then have the new node join the
cluster?)


Thanks

Dave Viner

Reply via email to