[ https://issues.apache.org/jira/browse/CASSANDRA-9013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jeremy Hanna updated CASSANDRA-9013: ------------------------------------ Labels: dtcs (was: ) > Add new option making DTCS unify larger time windows sooner > ----------------------------------------------------------- > > Key: CASSANDRA-9013 > URL: https://issues.apache.org/jira/browse/CASSANDRA-9013 > Project: Cassandra > Issue Type: Improvement > Reporter: Björn Hegerfors > Assignee: Björn Hegerfors > Priority: Minor > Labels: dtcs > Attachments: cassandra-2.0-CASSANDRA-9013.txt > > > In my very long post on CASSANDRA-6602, I mentioned a more aggressive > windowing strategy, which looks for opportunities to compact into larger > SSTables sooner. The original approach was that when we have min_threshold > windows of the same size and another one of the same size appears next to > them, those windows (not including the newest addition) merge. This new > approach doesn't wait for a (min_threshold+1)th one. As soon as min_threshold > windows of one size are created, they merge at once. The only exception is > the "incoming window", which stays outside of merging with other windows > until it is no longer the incoming window. > This does mean that occasionally more than min_threshold SSTables, not all of > similar size get compacted, intentionally. For example, let's say > min_threshold is 4, then if we have 3 windows size 16, 3 windows size 4 and > just get a 4th size 1 window that isn't the incoming window, we immediately > merge all of those into a size 64 window. Typically we expect one SSTable to > be in each window with a file size corresponding to the window size in some > unit of measure. So we merge roughly 10 SSTables in that scenario. > These bigger compactions happen rarely, about as often as a similar thing > happens in STCS (on occasion the number of SSTables gets very small). This > tweak to DTCS is meant to mimic that behavior in STCS. It has been observed > that DTCS typically has 50% to 100% more SSTables than STCS, so this is a way > to counter that. -- This message was sent by Atlassian JIRA (v6.3.4#6332)