Yifan Cai created CASSANDRA-19325:
-------------------------------------

             Summary: [Analytics] Fix range splitting that can produce 
overlapping ranges
                 Key: CASSANDRA-19325
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-19325
             Project: Cassandra
          Issue Type: Bug
            Reporter: Yifan Cai
            Assignee: Yifan Cai


The range splitting implementation can produce the following false results. 

- Given a tiny range, it can produce duplicated ranges, leading to spark 
executors working on the same data set.
- The produced ranges are closed on both ends, making the same tokens being 
shared by 2 ranges, leading to data duplication. 

Besides the splitting error, during write process, the split sub-ranges are 
unordered. It does not seem to affect correctness, but can be confusing.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org

Reply via email to