[ https://issues.apache.org/jira/browse/CASSANDRA-19325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Yifan Cai updated CASSANDRA-19325: ---------------------------------- Test and Documentation Plan: unit; ci Status: Patch Available (was: Open) > [Analytics] Fix range splitting that can produce overlapping ranges > ------------------------------------------------------------------- > > Key: CASSANDRA-19325 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19325 > Project: Cassandra > Issue Type: Bug > Components: Analytics Library > Reporter: Yifan Cai > Assignee: Yifan Cai > Priority: Normal > Time Spent: 10m > Remaining Estimate: 0h > > The range splitting implementation can produce the following false results. > - Given a tiny range, it can produce duplicated ranges, leading to spark > executors working on the same data set. > - The produced ranges are closed on both ends, making the same tokens being > shared by 2 ranges, leading to data duplication. > Besides the splitting error, during write process, the split sub-ranges are > unordered. It does not seem to affect correctness, but can be confusing. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org