Cleanup is a great way to free up disk space. Just note you might run into https://issues.apache.org/jira/browse/CASSANDRA-9036 if you use a version older than 2.0.15.
On Thu, Feb 14, 2019 at 10:20 AM Oleksandr Shulgin < oleksandr.shul...@zalando.de> wrote: > On Wed, Feb 13, 2019 at 6:47 PM Jeff Jirsa <jji...@gmail.com> wrote: > >> Depending on how bad data resurrection is, you should run it for any host >> that loses a range. In vnodes, that's usually all hosts. >> >> Cleanup with LCS is very cheap. Cleanup with STCS/TWCS is a bit more work. >> > > Wait, doesn't cleanup just rewrite every SSTable one by one? Why would > compaction strategy matter? Do you mean that after cleanup STCS may pick > some resulting tables to re-compact them due to the min/max size > difference, which would not be the case with LCS? > > >> If you're just TTL'ing all data, it may not be worth the effort. >> > > Indeed, but in our case the main reason to scale out is that the nodes are > running out of disk space, so we really want to get rid of the extra copies. > > -- > Alex > >