[jira] [Updated] (CASSANDRA-2419) Risk of counter over-count when recovering commit log

Sylvain Lebresne (JIRA) Fri, 29 Apr 2011 06:58:44 -0700

     [ 
https://issues.apache.org/jira/browse/CASSANDRA-2419?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Sylvain Lebresne updated CASSANDRA-2419:
----------------------------------------

    Attachment: 0001-Record-CL-replay-infos-alongside-sstables-v2.patch

v2 removes commit log header completely in favor of sstable metadata about 
where to replay (patch against 0.8).

This differs from v1 in that instead of keeping every (segment, 
replay_position) pair, we keep for a given sstable, only the position for the 
most recent segment (that is, we leverage the fact that we use increasing 
timestamps for commit logs).

The reason for this is twofold:
  # this more compact (and simple)
  # if we remove the commit log header, we need to be able to say if a given 
segment is dirty or not for a given column family. That is, we don't want to 
know if some replay position existed on this segment, but if a relevant one 
still exist. So for a given column family we really only care about the newest 
(segment, replay_position) pair.

Now there is the question of the update path. With this patch, the (existing) 
commit log headers will be ignored. This means that ideally before updating to 
a version having this patch people would use drain. If they do not, then the 
commit logs will be fully replayed. Pre-0.8, it's not a big deal. With 
counters, this could mean over-counts (that's exactly what this ticket is 
about). So I would be in favor of putting this for 0.8.0, since it is a bug fix 
and it will avoids the problem of upgrading from a version already having 
counters. But I would admit this is not trivial patch, so ...


> Risk of counter over-count when recovering commit log
> -----------------------------------------------------
>
>                 Key: CASSANDRA-2419
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2419
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Core
>    Affects Versions: 0.8 beta 1
>            Reporter: Sylvain Lebresne
>            Assignee: Sylvain Lebresne
>              Labels: counters
>             Fix For: 0.8.0
>
>         Attachments: 0001-Record-CL-replay-infos-alongside-sstables-v2.patch, 
> 0001-Record-and-use-sstable-replay-position.patch
>
>   Original Estimate: 8h
>  Remaining Estimate: 8h
>
> When a memtable was flush, there is a small delay before the commit log 
> replay position gets updated. If the node fails during this delay, all the 
> updates of this memtable will be replay during commit log recovery and will 
> end-up being over-counts.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (CASSANDRA-2419) Risk of counter over-count when recovering commit log

Reply via email to