Considerations for larger bloom filters
---------------------------------------

                 Key: CASSANDRA-1555
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-1555
             Project: Cassandra
          Issue Type: Improvement
          Components: Core
            Reporter: Stu Hood
             Fix For: 0.8


To (optimally) support SSTables larger than 143 million keys, we need to 
support bloom filters larger than 2 GB, which java.util.BitSet can't handle 
directly.

A few options:
* Switch to a BitSet class which supports 2^63 bits (Lucene's OpenBitSet)
* Partition the java.util.BitSet behind our current BloomFilter
** Straightforward bit partitioning: bit N is in bitset N // 2^31
** Separate equally sized complete bloom filters for member ranges, which can 
be used independently or OR'd together under memory pressure.

All of these options require new approaches to serialization.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to