Hi all, What is the recommended strategy for backing up the data stored inside cassandra?
I realized that Cass. is a distributed database, and with a decent replication factor, backups are "already done" in some sense. But, as a relatively new user, I'm always concerned that the data is only within the system and not stored *anywhere* else. In an earlier email in the list, the recommendation was: Until tickets 193 and 520 are done, the easiest thing is to copy all the sstables from the other nodes that have replicas for the ranges it is responsible for (e.g. for replication factor of 3 on rack unaware partitioner, the nodes before it and the node after it on the right would suffice), and then run nodeprobe cleanup to clear out the excess. Is this still the recommended approach? If I backed up the files in DataDirectories/*, is it possible to restore a node using those files? (That is, bring up a new node, copy the backed up files from the crashed node onto the new node, then have the new node join the cluster?) Thanks Dave Viner