Re: Inconsistent count(*) and distinct results from Cassandra

2015-03-12 Thread DuyHai Doan
First idea to eliminate any issue with regards to staled data: issue the same count query with RF=QUORUM and check whether there are still inconsistencies On Tue, Mar 10, 2015 at 9:13 AM, Rumph, Frens Jan wrote: > Hi Jens, Mikhail, Daemeon, > > Thanks for your replies. Sorry for my reply being l

Re: Inconsistent count(*) and distinct results from Cassandra

2015-03-12 Thread Rumph, Frens Jan
Hi Jens, Mikhail, Daemeon, Thanks for your replies. Sorry for my reply being late ... mails from the user-list were moved to the wrong inbox on my side. I'm in a development environment and thus using replication factor = 1 and consistency = ONE with three nodes. So the 'results from different no

Re: Inconsistent count(*) and distinct results from Cassandra

2015-03-10 Thread Rumph, Frens Jan
Thanks for the suggestion DuyHai. I assume you mean CL=QUORUM (as in consistency level, not replication factor). As expected, setting the consistency level to quorum or all yields equally inconsistent results for the select count and select distinct queries. Which is good in a way, because if RF=1

Re: Inconsistent count(*) and distinct results from Cassandra

2015-03-04 Thread DuyHai Doan
"Is it to be expected that select count(*) from ... and select distinct partition-key-columns from ... to yield inconsistent results between executions even though the table at hand isn't written to?" Actually, depending on the definition of your primary key, select count(*) and select distinct pa

Re: Inconsistent count(*) and distinct results from Cassandra

2015-03-04 Thread daemeon reiydelle
What is the replication? Could you be serving stale data from a node that was not properly replicated (hints timeout exceeded by a node being down?) On Wed, Mar 4, 2015 at 11:03 AM, Jens Rantil wrote: > Frens, > > What consistency are you querying with? Could be you are simply receiving > resu

Re: Inconsistent count(*) and distinct results from Cassandra

2015-03-04 Thread Jens Rantil
Frens, What consistency are you querying with? Could be you are simply receiving result from different nodes each time. Jens – Skickat från Mailbox On Wed, Mar 4, 2015 at 7:08 PM, Mikhail Strebkov wrote: > We have observed the same issue in our production Cassandra cluster (5 nodes > in o

Re: Inconsistent count(*) and distinct results from Cassandra

2015-03-04 Thread Mikhail Strebkov
We have observed the same issue in our production Cassandra cluster (5 nodes in one DC). We use Cassandra 2.1.3 (I joined the list too late to realize we shouldn’t user 2.1.x yet) on Amazon machines (created from community AMI). In addition to count variations with 5 to 10% we observe variati

Inconsistent count(*) and distinct results from Cassandra

2015-03-04 Thread Rumph, Frens Jan
Hi, Is it to be expected that select count(*) from ... and select distinct partition-key-columns from ... to yield inconsistent results between executions even though the table at hand isn't written to? I have a table in a keyspace with replication_factor = 1 which is something like: CREATE TABL