[ https://issues.apache.org/jira/browse/CASSANDRA-19727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Francisco Guerrero updated CASSANDRA-19727: ------------------------------------------- Test and Documentation Plan: Added tests for RandomPartitioner. Status: Patch Available (was: In Progress) CI : https://app.circleci.com/pipelines/github/frankgh/cassandra-analytics?branch=CASSANDRA-19727 > [Analytics] Bulk writer fails validation stage when writing to a cluster > using RandomPartitioner > ------------------------------------------------------------------------------------------------ > > Key: CASSANDRA-19727 > URL: https://issues.apache.org/jira/browse/CASSANDRA-19727 > Project: Cassandra > Issue Type: Bug > Components: Analytics Library > Reporter: Francisco Guerrero > Assignee: Francisco Guerrero > Priority: Normal > Fix For: NA > > > In bulk writer after writing SSTables, written data to SSTables is read back > again to perform validations before shipping the files to the Cassandra > instances. The logic to validate the SSTables assumes > {{{}Murmur3Partitioner{}}}. This validation fails however when a bulk writer > job is running against a cluster using the {{RandomPartitioner}} with the > following stacktrace: > {code} > java.lang.RuntimeException: java.lang.IllegalStateException: Partitioner in > ValidationMetadata does not match TableMetaData: > org.apache.cassandra.dht.RandomPartitioner vs. > org.apache.cassandra.dht.Murmur3Partitioner > at > org.apache.cassandra.spark.utils.Throwing.lambda$function$2(Throwing.java:84) > at > java.base/java.util.stream.ReferencePipeline$3$1.accept(ReferencePipeline.java:197) > at > java.base/java.util.HashMap$KeySpliterator.forEachRemaining(HashMap.java:1707) > at > java.base/java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:509) > at > java.base/java.util.stream.AbstractPipeline.wrapAndCopyInto(AbstractPipeline.java:499) > at > java.base/java.util.stream.ReduceOps$ReduceOp.evaluateSequential(ReduceOps.java:921) > at > java.base/java.util.stream.AbstractPipeline.evaluate(AbstractPipeline.java:234) > at > java.base/java.util.stream.ReferencePipeline.collect(ReferencePipeline.java:682) > at > org.apache.cassandra.spark.data.BasicSupplier.openAll(BasicSupplier.java:44) > at > org.apache.cassandra.bridge.CassandraBridgeImplementation.getCompactionScanner(CassandraBridgeImplementation.java:248) > at > org.apache.cassandra.spark.data.DataLayer.openCompactionScanner(DataLayer.java:262) > at > org.apache.cassandra.spark.bulkwriter.SSTableWriter.validateSSTables(SSTableWriter.java:129) > at > org.apache.cassandra.spark.bulkwriter.SSTableWriter.close(SSTableWriter.java:113) > at > org.apache.cassandra.spark.bulkwriter.RecordWriter.finalizeSSTable(RecordWriter.java:224) > at > org.apache.cassandra.spark.bulkwriter.RecordWriter.write(RecordWriter.java:122) > ... 15 more > {code} -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org