[ https://issues.apache.org/jira/browse/CASSANDRA-4047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15201523#comment-15201523 ]
Aleksey Yeschenko commented on CASSANDRA-4047: ---------------------------------------------- [~pkolaczk] Please reopen if anything has changed since Dec 2014 and this is now relevant for Spark. > Bulk hinting > ------------ > > Key: CASSANDRA-4047 > URL: https://issues.apache.org/jira/browse/CASSANDRA-4047 > Project: Cassandra > Issue Type: Improvement > Reporter: Brandon Williams > Labels: hints > Attachments: 4047-2.0-wip.txt, 4047-wip.txt > > > With the introduction of the BulkOutputFormat, there may be cases where > someone would like to tolerate node failures and have the job complete, but > afterwards since we streamed they have to repair or rely on read repair. We > don't currently have any way of hinting streams, but a node could take a > snapshot before acknowledging the stream session, then remember to send the > files in the snapshot to the unavailable nodes when they come back up. This > isn't quite ideal since of course the node may have compacted these files, > however it's much simpler than any sort of key tracking at this scale. -- This message was sent by Atlassian JIRA (v6.3.4#6332)