Mike Adamson created CASSANDRA-18167:
----------------------------------------

             Summary: Bypass row-awareness for small partitions
                 Key: CASSANDRA-18167
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-18167
             Project: Cassandra
          Issue Type: Improvement
          Components: Feature/SAI
            Reporter: Mike Adamson


SAI supports row-awareness in that it indexes both the partition key and the 
clustering key of a row. This improves query performance significantly for wide 
partitions with many rows but it can impact performance for small partitions 
where it could make sense to bypass row-awareness post-filter the results (read 
the whole partition) or batch rows for a single partition.

However this is achieved it would be necessary for the index to have an idea of 
the size of the partition being read and to be aware of whether reading the 
whole partition is likely to improve read performance.  

SAI is aware of partition sizes during indexing so one option would be feed 
these sizes into a histogram in the index metadata and apply a set of rules to 
this metadata to decide whether we should attempt any optimisation.





--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org

Reply via email to