Re: Cassandra nodes loaded unequally

2012-10-18 Thread Ben Kaehne
After some time. I believe this is correct. The load seems to be
correlated to compactions/number of files for keyspace/IO etc.

Thanks all!

Regards,

On Thu, Oct 18, 2012 at 9:35 PM, aaron morton aa...@thelastpickle.comwrote:

 At times of high load check the CPU % for the java service running C* to
 confirm C* is the source of load.

 If the load is generated from C* check the logs (or use OpsCentre / other
 monitoring) to see if it correlated to compaction, or Garbage Collection or
 repair or high throughput.

 Cheers

 -
 Aaron Morton
 Freelance Developer
 @aaronmorton
 http://www.thelastpickle.com

 On 17/10/2012, at 12:22 PM, Ben Kaehne ben.kae...@sirca.org.au wrote:

 Nothing unusual.

 All servers are exactly the same. Nothing unusual in the log files. Is
 there any level of logging that I should be turning on?

 Regards,

 On Wed, Oct 17, 2012 at 9:51 AM, Andrey Ilinykh ailin...@gmail.comwrote:

 With your environment (3 nodes, RF=3) it is very difficult to get
 uneven load. Each node receives the same number of read/write
 requests. Probably something is wrong on low level, OS or VM. Do you
 see anything unusual in log files?

 Andrey

 On Tue, Oct 16, 2012 at 3:40 PM, Ben Kaehne ben.kae...@sirca.org.au
 wrote:
  Not connecting to the same node every time. Using Hector to ensure an
 even
  distribution of connections accross the cluster.
 
  Regards,
 
  On Sat, Oct 13, 2012 at 4:15 AM, B. Todd Burruss bto...@gmail.com
 wrote:
 
  are you connecting to the same node every time?  if so, spread out
  your connections across the ring
 
  On Fri, Oct 12, 2012 at 1:22 AM, Alexey Zotov azo...@griddynamics.com
 
  wrote:
   Hi Ben,
  
   I suggest you to compare amount of queries for each node. May be the
   problem
   is on the client side.
   Yoy can do that using JMX:
   org.apache.cassandra.db:type=ColumnFamilies,keyspace=YOUR
   KEYSPACE,columnfamily=YOUR CF,ReadCount
   org.apache.cassandra.db:type=ColumnFamilies,keyspace=YOUR
   KEYSPACE,columnfamily=YOUR CF,WriteCount
  
   Also I suggest to check output of nodetool compactionstats.
  
   --
   Alexey
  
  
 
 
 
 
  --
  -Ben




 --
 -Ben





-- 
-Ben


Re: Cassandra nodes loaded unequally

2012-10-16 Thread Ben Kaehne
I checked this and all the numbers seemed to be about the same. Although
the files would compact from time to time. There was nothing to suggest why
1 node, ongoingly had less load then the others.

Regards,

On Fri, Oct 12, 2012 at 7:22 PM, Alexey Zotov azo...@griddynamics.comwrote:

 Hi Ben,

 I suggest you to compare amount of queries for each node. May be the
 problem is on the client side.
 Yoy can do that using JMX:
 org.apache.cassandra.db:type=ColumnFamilies,keyspace=YOUR
 KEYSPACE,columnfamily=YOUR CF,ReadCount
 org.apache.cassandra.db:type=ColumnFamilies,keyspace=YOUR
 KEYSPACE,columnfamily=YOUR CF,WriteCount

 Also I suggest to check output of nodetool compactionstats.

 --
 Alexey





-- 
-Ben


Re: Cassandra nodes loaded unequally

2012-10-16 Thread Ben Kaehne
Not connecting to the same node every time. Using Hector to ensure an even
distribution of connections accross the cluster.

Regards,

On Sat, Oct 13, 2012 at 4:15 AM, B. Todd Burruss bto...@gmail.com wrote:

 are you connecting to the same node every time?  if so, spread out
 your connections across the ring

 On Fri, Oct 12, 2012 at 1:22 AM, Alexey Zotov azo...@griddynamics.com
 wrote:
  Hi Ben,
 
  I suggest you to compare amount of queries for each node. May be the
 problem
  is on the client side.
  Yoy can do that using JMX:
  org.apache.cassandra.db:type=ColumnFamilies,keyspace=YOUR
  KEYSPACE,columnfamily=YOUR CF,ReadCount
  org.apache.cassandra.db:type=ColumnFamilies,keyspace=YOUR
  KEYSPACE,columnfamily=YOUR CF,WriteCount
 
  Also I suggest to check output of nodetool compactionstats.
 
  --
  Alexey
 
 




-- 
-Ben


Re: Cassandra nodes loaded unequally

2012-10-16 Thread Ben Kaehne
Nothing unusual.

All servers are exactly the same. Nothing unusual in the log files. Is
there any level of logging that I should be turning on?

Regards,

On Wed, Oct 17, 2012 at 9:51 AM, Andrey Ilinykh ailin...@gmail.com wrote:

 With your environment (3 nodes, RF=3) it is very difficult to get
 uneven load. Each node receives the same number of read/write
 requests. Probably something is wrong on low level, OS or VM. Do you
 see anything unusual in log files?

 Andrey

 On Tue, Oct 16, 2012 at 3:40 PM, Ben Kaehne ben.kae...@sirca.org.au
 wrote:
  Not connecting to the same node every time. Using Hector to ensure an
 even
  distribution of connections accross the cluster.
 
  Regards,
 
  On Sat, Oct 13, 2012 at 4:15 AM, B. Todd Burruss bto...@gmail.com
 wrote:
 
  are you connecting to the same node every time?  if so, spread out
  your connections across the ring
 
  On Fri, Oct 12, 2012 at 1:22 AM, Alexey Zotov azo...@griddynamics.com
  wrote:
   Hi Ben,
  
   I suggest you to compare amount of queries for each node. May be the
   problem
   is on the client side.
   Yoy can do that using JMX:
   org.apache.cassandra.db:type=ColumnFamilies,keyspace=YOUR
   KEYSPACE,columnfamily=YOUR CF,ReadCount
   org.apache.cassandra.db:type=ColumnFamilies,keyspace=YOUR
   KEYSPACE,columnfamily=YOUR CF,WriteCount
  
   Also I suggest to check output of nodetool compactionstats.
  
   --
   Alexey
  
  
 
 
 
 
  --
  -Ben




-- 
-Ben


Syncing nodes + Cassandra Data Availability

2012-08-08 Thread Ben Kaehne
Good morning,

Our application runs on a 3 node cassandra cluster with RF of 3.

We use quorum operations against this cluster in hopes of garunteeing
consistency.

One scenario in which an issue can occur here is:
Out of our 3 nodes, only 2 are up.
We perform a write to say, a new key.
The down node is started again, at the same time, a different node is
brought offline.
At this point. The data we have written above is on one node, but not the
other online node. Meaning quorum reads will fail.

Surely other people have encountered such issue before.

We disabled hinted handoffs originally as to not have to worry about race
conditions of disk space on servers filling up due to piling up handoffs.
Although perhaps this may somewhat aid the situation (although from what I
read, it does not completely remedy the circumstance).

If so, how are you dealing with it?
From what I understand a read repair (in which we have set to 1.0) will
only be performed on a successful read occurs, in which will not happen
here.

nodetool repair seems rather slow, is manual and does not suit our
situation where data has to be available apon demand.

Regards,

-- 
-Ben


Cassandra startup times

2012-07-18 Thread Ben Kaehne
Good evening,

I am interested in improving the startup time of our cassandra cluster.

We have a 3 node cluster (replication factor of 3) in which our application
requires quorum reads and writes to function.

Each machine is well specced with 24gig of ram, 10 cores, jna enabled etc.

On each server our keyspace files are so far around 90 Gb (stored on NFS
although I am not seeing signs that we have much network io). This size
will grow in future.

Our startup time for 1 server at the moment is greater then half an hour
(45 minutes to 50 minutes even) which is putting a risk factor on the
resiliance of our service. I have tried version 1.09 to latest 1.12.

I do not see too much system utilization while starting either.

I gazed apon an article suggesting increased speed in 1.2 although when I
set it up, it did not seem to be any faster at all (if not slower).

I was observing what was happening during startup and I noticed (via
strace), cassandra was doing lots of 8 byte reads from:

 
/var/lib/cassandra/data/XX/YY/XXX-YYY-hc-1871-CompressionInfo.db
 
/var/lib/cassandra/data/XX/YY/XXX-YYY-hc-1874-CompressionInfo.db

Also... Is there someone I can change the 8 byte reads to something
greater? 8 byte reads across NFS is terribly inefficient (and I am guessing
the cause of our terribly slow startup times).

Regards,

-- 
-Ben


Multiple keyspace question

2012-07-05 Thread Ben Kaehne
Good evening,

I have read multiple keyspaces are bad before in a few discussions, but to
what extent?

We have some reasonably powerful machines and looking to host
an additional (currently we have 1) 2 keyspaces within our cassandra
cluster (of 3 nodes, using RF3).

At what point does adding extra keyspaces start becoming an issue? Is there
anything special we should be considering or watching out for as we
implement this?

I could not imagine that all cassandra users out there are running one
massive keyspace, and at the same time can not imaging that all cassandra
users have multiple clusters just to host different keyspaces.

Regards.

-- 
-Ben


Re: Maximum load per node

2012-06-07 Thread Ben Kaehne
Does this max load have correlation to replication factor?

IE a 3 node cluster with rf of 3. Should i be worried at {max load} X 3 or
what people generally mention the max load is?

On Thu, Jun 7, 2012 at 10:55 PM, Filippo Diotalevi fili...@ntoklo.comwrote:

  Hi,
 one of latest Aaron's observation about the max load per Cassandra node
 caught my attention

 At ~840GB I'm probably running close
 to the max load I should have on a node,

 [AM] roughly 300GB to 400GB is the max load
 Since we currently have a Cassandra node with roughly 330GB of data, it
 looks like that's a good time for us to really understand what's that limit
 in our case. Also, a (maybe old) Stackoverflow question at
 http://stackoverflow.com/questions/4775388/how-much-data-per-node-in-cassandra-cluster
  ,
 seems to suggest a higher limit per node.

 Just considering the compaction issues, what are the factors we need to
 account to determine the max load?

 * disk space
 Datastax cassandra docs state (pg 97) that a major compaction temporarily
 doubles disk space usage. Is it a safe estimate to say that the Cassandra
 machine needs to have roughly the same amount of free disk space as the
 current load of the Cassandra node, or are there any other factor to
 consider?

 * RAM
 Is the amount of RAM in the machine (or dedicated to the Cassandra node)
 affecting in any way the speed/efficiency of the compaction process?

 * Performance degradation for overloaded nodes?
 What kind of performance degradation can we expect for a Cassandra node
 which is overloaded? (f.i. with 500GB or more of data)


 Thanks for the clarifications,
 --
 Filippo Diotalevi





-- 
-Ben