Nate McCall created CASSANDRA-5970:
--------------------------------------

             Summary: FilteredRangeSlice command for regex searches against 
column names on known sets of keys
                 Key: CASSANDRA-5970
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-5970
             Project: Cassandra
          Issue Type: New Feature
          Components: Core
            Reporter: Nate McCall


This is the ability to apply a regex against columns when the set of keys is 
known. In filtering the keys, we would like to allow for the following clauses: 
E, GTE, LTE, NE, inclusive list, exclusive list.

The end goal is to provide for efficient searching in the case where you have 
some knowledge of the keys. A specific use case would be, say, searching user 
agent strings in the given set of date buckets in the classic time-series web 
log use case. This is a "sweet spot" for Cassandra and providing a more direct 
method of access for such will help a lot of users.

Additionally, this will provide some level of feature parity with RDBMS crowd 
who've had this feature for some time.

Internally, this will include the introduction of a new Verb, SSTableScanner 
extension and an ExtendedFilter implementation which applies the regex as well 
as a new method on StorageProxy.

This issue does not cover exposing this new query method to thrift and CQL, but 
obviously that will be required for this to be of any practical use. Those 
should be covered by separate issues.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to