There are a lot of people on 0.7 for whom CL is working as advertised.
 Not saying it's impossible that there's a bug, but the odds are
against it.

Is it possible for instance that sometimes your cron job takes longer
than five minutes?

On Thu, Aug 18, 2011 at 9:49 AM, Kyle Gibson
<kyle.gib...@frozenonline.com> wrote:
> I am running cassandra 0.7.8. pycassa 1.1.0
>
> Nodes=7, RF=3
>
> This problem started a few months ago and only occurs sporadically.
>
> I receive notifications from paypal's IPN. The IPN data is saved into
> a column family. I add another column for "processed" which is set to
> 0.
>
> Every 5 minutes, a cron script runs and pulls down IPN events that
> haven't been processed. It does some work, and then writes back
> processed to 1.
>
> Usually this worked, but then (when exactly I don't recall),
> occasionally, it started having a problem. Processed wasn't been set
> to 1. So, IPN events would be processed twice.
>
> I tried a few things to fix this. Repair, compact, restarting the
> cluster, upgrading. I even did a complete rebuild of the cluster,
> wiping the data directory and starting fresh on 0.7.8.
>
> I then ditched the "processed" column and decide to use two column
> families. IPNs are put into column family A, and after being
> processed, it is inserted into column family B and deleted from A.
>
> The problem still persisted. At this point I was using CL.QUORUM. So,
> I started using CL.ALL.
>
> And the problem still persists. Having IPN events processed twice
> causes a fair bit of problems, so this is something I really need to
> get resolved.
>
> Thanks,
>
> Kyle
>



-- 
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of DataStax, the source for professional Cassandra support
http://www.datastax.com

Reply via email to