Although it's not exactly the ability to list specific SSTables, the ability
to only compact specific CFs will be in upcoming releases:

https://issues.apache.org/jira/browse/CASSANDRA-1812

- Tyler

On Wed, Jan 5, 2011 at 7:46 PM, Edward Capriolo <edlinuxg...@gmail.com>wrote:

> On Wed, Jan 5, 2011 at 4:31 PM, Jonathan Ellis <jbel...@gmail.com> wrote:
> > Pretty sure there's logic in there that says "don't bother compacting
> > a single sstable."
> >
> > On Wed, Jan 5, 2011 at 2:26 PM, shimi <shim...@gmail.com> wrote:
> >> How does minor compaction is triggered? Is it triggered Only when a new
> >> SStable is added?
> >>
> >> I was wondering if triggering a compaction
> with minimumCompactionThreshold
> >> set to 1 would be useful. If this can happen I assume it will do
> compaction
> >> on files with similar size and remove deleted rows on the rest.
> >> Shimi
> >> On Tue, Jan 4, 2011 at 9:56 PM, Peter Schuller <
> peter.schul...@infidyne.com>
> >> wrote:
> >>>
> >>> > I don't have a problem with disk space. I have a problem with the
> data
> >>> > size.
> >>>
> >>> [snip]
> >>>
> >>> > Bottom line is that I want to reduce the number of requests that goes
> to
> >>> > disk. Since there is enough data that is no longer valid I can do it
> by
> >>> > reclaiming the space. The only way to do it is by running Major
> >>> > compaction.
> >>> > I can wait and let Cassandra do it for me but then the data size will
> >>> > get
> >>> > even bigger and the response time will be worst. I can do it manually
> >>> > but I
> >>> > prefer it to happen in the background with less impact on the system
> >>>
> >>> Ok - that makes perfect sense then. Sorry for misunderstanding :)
> >>>
> >>> So essentially, for workloads that are teetering on the edge of cache
> >>> warmness and is subject to significant overwrites or removals, it may
> >>> be beneficial to perform much more aggressive background compaction
> >>> even though it might waste lots of CPU, to keep the in-memory working
> >>> set down.
> >>>
> >>> There was talk (I think in the compaction redesign ticket) about
> >>> potentially improving the use of bloom filters such that obsolete data
> >>> in sstables could be eliminated from the read set without
> >>> necessitating actual compaction; that might help address cases like
> >>> these too.
> >>>
> >>> I don't think there's a pre-existing silver bullet in a current
> >>> release; you probably have to live with the need for
> >>> greater-than-theoretically-optimal memory requirements to keep the
> >>> working set in memory.
> >>>
> >>> --
> >>> / Peter Schuller
> >>
> >>
> >
> >
> >
> > --
> > Jonathan Ellis
> > Project Chair, Apache Cassandra
> > co-founder of Riptano, the source for professional Cassandra support
> > http://riptano.com
> >
>
> I was wording if it made sense to have a JMX operation that can
> compact a list of tables by file name. This opens it up for power
> users to have more options then compact entire keyspace.
>

Reply via email to