[ https://issues.apache.org/jira/browse/CASSANDRA-9830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15064573#comment-15064573 ]
Paulo Motta commented on CASSANDRA-9830: ---------------------------------------- Rebased and submitted cstar_perf tests comparing LCS with default settings vs enabled {{skip_top_level_bloom_filter}} option on [ssd|http://cstar.datastax.com/tests/id/35023b30-a5bc-11e5-aa2d-0256e416528f] and [hdd|http://cstar.datastax.com/tests/id/3d14a8ac-a5bd-11e5-aa2d-0256e416528f]. Will report back when results are ready and maybe play around with parameters (sstable size, bloom filter fp chance, etc) after initial results. > Option to disable bloom filter in highest level of LCS sstables > --------------------------------------------------------------- > > Key: CASSANDRA-9830 > URL: https://issues.apache.org/jira/browse/CASSANDRA-9830 > Project: Cassandra > Issue Type: New Feature > Components: Compaction > Reporter: Jonathan Ellis > Assignee: Paulo Motta > Priority: Minor > Labels: performance > Fix For: 3.2 > > > We expect about 90% of data to be in the highest level of LCS in a fully > populated series. (See also CASSANDRA-9829.) > Thus if the user is primarily asking for data (partitions) that has actually > been inserted, the bloom filter on the highest level only helps reject > sstables about 10% of the time. > We should add an option that suppresses bloom filter creation on top-level > sstables. This will dramatically reduce memory usage for LCS and may even > improve performance as we no longer check a low-value filter. > (This is also an idea from RocksDB.) -- This message was sent by Atlassian JIRA (v6.3.4#6332)