[ 
https://issues.apache.org/jira/browse/CASSANDRA-5419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Ellis updated CASSANDRA-5419:
--------------------------------------

    Comment: was deleted

(was: Detecting diffs smaller than a row is actually the hard part.  (See 
link.))
    
> Employ column differencing (as done for read repairs) during node repairs 
> --------------------------------------------------------------------------
>
>                 Key: CASSANDRA-5419
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-5419
>             Project: Cassandra
>          Issue Type: Improvement
>          Components: Core
>    Affects Versions: 1.1.10
>         Environment: Production
>            Reporter: Ahmed Bashir
>              Labels: compaction, repair
>
> In particular for wide rows, the headroom required for node repairs can be 
> substantial given that entire rows are streamed for any and all row hash 
> discrepancies.
> This headroom must be sustained until compaction slowly compacts these newly 
> streamed SSTables and reduces the overall load on each instance.
> The overall footprint of node repairs would be greatly reduced if we employed 
> differencing at the column level and sent over row mutations, similar to what 
> is done during read repair.  This is a great alternative for deployments 
> wherein sending over entire rows rather than the deltas is not an option.  
> Since node repairs can now specify start and end tokens (i.e. subrange 
> repairs), the additional computation can be broken down easily, and it's a 
> welcome trade-off for significantly less streaming, compaction, and temporary 
> headroom requirements.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to