[ 
https://issues.apache.org/jira/browse/CASSANDRA-8193?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14212539#comment-14212539
 ] 

Jimmy Mårdell commented on CASSANDRA-8193:
------------------------------------------

New patched added. I've added an enum for specifying the degree of parallelism. 
This cascaded up in the code path a bit. Backward compatibility should be 
maintained, at the expense of adding a few more forceRepair methods in 
StorageService.

As a side note, can't we remove many of forceRepair methods in 
StorageServiceMBean in 2.1? It's getting quite ugly. nodetool only uses two of 
them (one with range and one without range).


> Multi-DC parallel snapshot repair
> ---------------------------------
>
>                 Key: CASSANDRA-8193
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-8193
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Jimmy Mårdell
>            Assignee: Jimmy Mårdell
>            Priority: Minor
>             Fix For: 2.0.12
>
>         Attachments: cassandra-2.0-8193-1.txt, cassandra-2.0-8193-2.txt
>
>
> The current behaviour of snapshot repair is to let one node at a time 
> calculate a merkle tree. This is to ensure only one node at a time is doing 
> the expensive calculation. The drawback is that it takes even longer time to 
> do the merkle tree calculation.
> In a multi-DC setup, I think it would make more sense to have one node in 
> each DC calculate the merkle tree at the same time. This would yield a 
> significant improvement when you have many data centers.
> I'm not sure how relevant this is in 2.1, but I don't see us upgrading to 2.1 
> any time soon. Unless there is an obvious drawback that I'm missing, I'd like 
> to implement this in the 2.0 branch.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to