[ https://issues.apache.org/jira/browse/CASSANDRA-9302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15031474#comment-15031474 ]
Stefania commented on CASSANDRA-9302: ------------------------------------- I've implemented three more things: * tests with failure injection, in a fashion similar to CASSANDRA-9304 - pull request is [here|https://github.com/riptano/cassandra-dtest/pull/689]. * manual replica selection rather than relying on the driver TAR - we get better performance because we can batch by replica, not just by primary key. * ingest_rate parameter to control how many rows per second we feed to the worker processes - this is required by CASSANDRA-9303 but it made sense to change it here. > Optimize cqlsh COPY FROM, part 3 > -------------------------------- > > Key: CASSANDRA-9302 > URL: https://issues.apache.org/jira/browse/CASSANDRA-9302 > Project: Cassandra > Issue Type: Improvement > Components: Tools > Reporter: Jonathan Ellis > Assignee: Stefania > Priority: Critical > Fix For: 2.1.x > > > We've had some discussion moving to Spark CSV import for bulk load in 3.x, > but people need a good bulk load tool now. One option is to add a separate > Java bulk load tool (CASSANDRA-9048), but if we can match that performance > from cqlsh I would prefer to leave COPY FROM as the preferred option to which > we point people, rather than adding more tools that need to be supported > indefinitely. > Previous work on COPY FROM optimization was done in CASSANDRA-7405 and > CASSANDRA-8225. -- This message was sent by Atlassian JIRA (v6.3.4#6332)