[ https://issues.apache.org/jira/browse/CASSANDRA-1041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12863553#action_12863553 ]
Jonathan Ellis commented on CASSANDRA-1041: ------------------------------------------- You don't want to skip large files in major compactions, since the definition of major is "compact everything so it is safe to remove tombstones." (See http://wiki.apache.org/cassandra/DistributedDeletes and http://wiki.apache.org/cassandra/MemtableSSTable). Major compactions are not run automatically so if you don't want that going on just don't invoke nodeprobe compact. Minor compaction are batched up according to sstable size and only similar sizes are merged. Here again ignoring larger sstables indefinitely is a bad idea, because the number of sstables you have to merge rows from for reads is unbounded. Better solution: expose CompactionManager.minimumCompactionThreshold and CM.maximumCompactionThreshold in the configuration file, so you can tune minor compactions to require more or less sstables in the same "bucket" before compacting. > Skip large size (Configurable) SSTable in minor or/and major compaction > ----------------------------------------------------------------------- > > Key: CASSANDRA-1041 > URL: https://issues.apache.org/jira/browse/CASSANDRA-1041 > Project: Cassandra > Issue Type: New Feature > Components: Core > Reporter: Schubert Zhang > Priority: Minor > Attachments: CASSANDRA-1041-0.6.1.patch, CASSANDRA-1041-0.6.patch > > > When the SSTable files are large enough, such as 100GB, the compaction > (include minor and major) cost is big (disk IO, CPU, memory), etc. > In some applications, we accept not compcating all SSTables to the final very > large ones. > This feature provide two optional configurable attributes > MinorCompactSkipInGB and MajorCompactSkipInGB for each ColumnFamily. > The optional MinorCompactSkipInGB attribute specifies the maximum size of > SSTables which will be compcated in minor-compaction. The SSTables larger > than MinorCompactSkipInGB will be skipped. The optional MajorCompactSkipInGB > attribute is same for major-compaction. > The default of these attributes are 0, means do not skip, just as current > 0.6.1. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.