Re: Apache cassandra not joining cluster ring

2014-04-16 Thread Joyabrata Das
After explicitly enabling port 9160 (Cassandra client port (Thrift)) in that particular node it's connected to ring however other nodes are working with same configuration. Now Opscenter is installed in that particular node after explicitly enabling 61620 (OpsCenter monitoring port), 61621 (OpsCen

RE: Stalled streams during repairs

2014-04-16 Thread Andrew Cooper
First, thanks for the quick reply and jira links! Its helpful to know we are not the only ones experiencing these issues. "Are you sure you actually want/need to run repair as frequently as you currently are? Reducing the frequency won't make it work any better, but it will reduce the number o

Re: How safe is "nodetool move" in 1.2 ?

2014-04-16 Thread Colin
I have recently tested this scenario under a couple versions of Cassandra and have been able to write and read to/from the cluster while performing a move. I performed these tests utilizing an RF=2 on a three node cluster while performing quorum reads and received no errors due to unavailable re

Re: Stalled streams during repairs

2014-04-16 Thread Robert Coli
On Wed, Apr 16, 2014 at 3:17 PM, Andrew Cooper wrote: > We are running cassandra 1.2.5. I have checked through the change logs up > to 1.2.16 and do not see any indications of this being a known (and fixed) > issue. > Repair has been re-written in 2.0, because it was broken; that's why you don't

Re: How safe is "nodetool move" in 1.2 ?

2014-04-16 Thread Richard Low
On 16 April 2014 05:08, Jonathan Lacefield wrote: > Assuming you have enough nodes not undergoing "move" to meet your CL > requirements, then yes, your cluster will still accept reads and writes. > However, it's always good to test this before doing it in production to > ensure your cluster and a

Stalled streams during repairs

2014-04-16 Thread Andrew Cooper
We are running into a reproducible issue in one of our cassandra clusters. We are seeing that during an anti-entropy repair, if a particular sstable is streaming to multiple endpoints and the two streams happen to hit the same section of the sstable, it stalls all streams indefinitely on the so

Re: Bootstrap Timing

2014-04-16 Thread Phil Burress
I've shut down two of the nodes and am bootstrapping one right now. Is there any way to tell when it will finish bootstrapping? On Wed, Apr 16, 2014 at 2:56 PM, Phil Burress wrote: > Thanks! > > > On Wed, Apr 16, 2014 at 2:50 PM, Robert Coli wrote: > >> On Wed, Apr 16, 2014 at 11:16 AM, Phil Bu

Re: Bootstrap Timing

2014-04-16 Thread Phil Burress
Thanks! On Wed, Apr 16, 2014 at 2:50 PM, Robert Coli wrote: > On Wed, Apr 16, 2014 at 11:16 AM, Phil Burress > wrote: > >> Thanks very much for the response. I'm not using vnodes, does that >> matter? >> > > Not in your case. In some cases it is safe to bootstrap multiple nodes > into a cluste

Re: Bootstrap Timing

2014-04-16 Thread Robert Coli
On Wed, Apr 16, 2014 at 11:16 AM, Phil Burress wrote: > Thanks very much for the response. I'm not using vnodes, does that matter? > Not in your case. In some cases it is safe to bootstrap multiple nodes into a cluster at once AT SPECIFIC TOKENS, because there is more than one replica set to boot

Re: Embedded Cassandra Performance

2014-04-16 Thread Sávio Teles
Thanks Chris! 2014-04-16 12:53 GMT-03:00 Chris Lohfink : > There will be a small performance improvement from not having the > app->cluster latency. If ran on same system (which I wouldn’t recommend) > the latency would be pretty irrelevant anyway unless you are fighting for > sub millisecond l

Re: Bootstrap Timing

2014-04-16 Thread Ken Hancock
Seed nodes don't bootstrap. https://issues.apache.org/jira/browse/CASSANDRA-5836 On Wed, Apr 16, 2014 at 2:17 PM, Phil Burress wrote: > Also, one more quick question. For the new nodes, do I add all three > existing nodes as seeds? Or just add one? > > > On Wed, Apr 16, 2014 at 2:16 PM, Phil

Re: Bootstrap Timing

2014-04-16 Thread Phil Burress
Also, one more quick question. For the new nodes, do I add all three existing nodes as seeds? Or just add one? On Wed, Apr 16, 2014 at 2:16 PM, Phil Burress wrote: > Thanks very much for the response. I'm not using vnodes, does that matter? > > > On Wed, Apr 16, 2014 at 2:13 PM, Robert Coli wro

Re: Bootstrap Timing

2014-04-16 Thread Phil Burress
Thanks very much for the response. I'm not using vnodes, does that matter? On Wed, Apr 16, 2014 at 2:13 PM, Robert Coli wrote: > On Wed, Apr 16, 2014 at 11:10 AM, Phil Burress > wrote: > >> How long does bootstrapping typically take? I have 3 existing nodes in >> our cluster with about 40GB ea

Re: Bootstrap Timing

2014-04-16 Thread Robert Coli
On Wed, Apr 16, 2014 at 11:10 AM, Phil Burress wrote: > How long does bootstrapping typically take? I have 3 existing nodes in our > cluster with about 40GB each. I've added three new nodes to the cluster. > They have been in bootstrap mode for a little over 3 days now. Should I be > concerned? Is

Bootstrap Timing

2014-04-16 Thread Phil Burress
Greetings, How long does bootstrapping typically take? I have 3 existing nodes in our cluster with about 40GB each. I've added three new nodes to the cluster. They have been in bootstrap mode for a little over 3 days now. Should I be concerned? Is there a way to tell how long it will take to finis

Re: Embedded Cassandra Performance

2014-04-16 Thread Chris Lohfink
There will be a small performance improvement from not having the app->cluster latency. If ran on same system (which I wouldn’t recommend) the latency would be pretty irrelevant anyway unless you are fighting for sub millisecond latency (in which case get off the JVM). You would be able to acc

Re: Problems with adding datacenter and schema version disagreement

2014-04-16 Thread Russell Hatch
I think you might be seeing the issue reported in https://issues.apache.org/jira/browse/CASSANDRA-6971 If that's the case, it looks like a fix will be in 2.0.7 Thanks, Russ On Tue, Apr 15, 2014 at 11:48 PM, Umut Kocasaraç wrote: > Hi Olek, > > Could you solve the problem. Because we are exper

Re: Embedded Cassandra Performance

2014-04-16 Thread Sávio Teles
Is it advisable to run the embedded Cassandra in production? 2014-04-16 12:08 GMT-03:00 Sávio Teles : > I'm running a cluster with Cassandra and my app embedded. > > Regarding performance, it is better to run embedded Cassandra? > > What are the implications of running an embedded Cassandra ? >

Embedded Cassandra Performance

2014-04-16 Thread Sávio Teles
I'm running a cluster with Cassandra and my app embedded. Regarding performance, it is better to run embedded Cassandra? What are the implications of running an embedded Cassandra ? Tks -- Atenciosamente, Sávio S. Teles de Oliveira voice: +55 62 9136 6996 http://br.linkedin.com/in/savioteles Me

Re: GC histogram analysis

2014-04-16 Thread Chris Lohfink
You can take a heap dump and find out who has references to it. Can find out more which column family they are from. Do you have a lot of tombstones or have data thats over written a lot or and doing a ton of reads? Maybe wide rows that your querying across or using filtering? Reads could hav

Re: How to clear all data using CQL?

2014-04-16 Thread James Horey
If you’re running unit tests and repeatadly clearing the Cassandra keyspaces, you may want to check out Ferry (ferry.opencore.io). It lets you standup/destroy multiple Cassandra stacks locally on your machine and is useful for the use case you described. I’m the author of Ferry, and would be gla

Re: GC histogram analysis

2014-04-16 Thread Ruchir Jha
No we don't. Sent from my iPhone > On Apr 16, 2014, at 9:21 AM, Mark Reddy wrote: > > Do you delete and/or set TTLs on your data? > > >> On Wed, Apr 16, 2014 at 2:14 PM, Ruchir Jha wrote: >> Hi, >> >> I am trying to investigate ParNew promotion failures happening routinely in >> productio

Re: GC histogram analysis

2014-04-16 Thread Mark Reddy
Do you delete and/or set TTLs on your data? On Wed, Apr 16, 2014 at 2:14 PM, Ruchir Jha wrote: > Hi, > > I am trying to investigate ParNew promotion failures happening routinely > in production. As part of this exercise, I enabled > -XX:PrintHistogramBeforeFullGC and saw the following output. A

GC histogram analysis

2014-04-16 Thread Ruchir Jha
Hi, I am trying to investigate ParNew promotion failures happening routinely in production. As part of this exercise, I enabled -XX:PrintHistogramBeforeFullGC and saw the following output. As you can see there are a ton of Columns, ExpiringColumns and DeletedColumns before GC ran and these numbers

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Vivek Mishra
Thanks, i think got the point.CAS doesn't make much sense on single node. -Vivek On Wed, Apr 16, 2014 at 6:37 PM, Tupshin Harper wrote: > It will work for correctness, but give you a very inaccurate view of > performance,. > > -Tupshin > On Apr 16, 2014 6:05 AM, "Vivek Mishra" wrote: > >> Tha

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Tupshin Harper
It will work for correctness, but give you a very inaccurate view of performance,. -Tupshin On Apr 16, 2014 6:05 AM, "Vivek Mishra" wrote: > Thanks Mark and Tuphsin. > > So on single node, if i set consistency level to SERIAL and create a > keyspace with RF=1? Would that work? > > > -Vivek > > >

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Vivek Mishra
Thanks Mark and Tuphsin. So on single node, if i set consistency level to SERIAL and create a keyspace with RF=1? Would that work? -Vivek On Wed, Apr 16, 2014 at 6:32 PM, Mark Reddy wrote: > The Paxos protocol used for CAS operations will always use at least a > consistency level effectively

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Mark Reddy
The Paxos protocol used for CAS operations will always use at least a consistency level effectively equivalent to QUORUM (called SERIAL) when writing, even if you explicitly specify a lower level, e.g. ANY or ONE. Setting consistency level to ALL will make the write execute on all replicas if the c

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Tupshin Harper
No, but you do need a quorum of nodes. http://www.datastax.com/documentation/cassandra/2.0/cassandra/dml/dml_config_consistency_c.html SERIAL A write must be written conditionally to the commit log and memory table on a quorum of replica nodes. Used to achievelinearizable consistency

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Vivek Mishra
Thanks Mark. does this mean with RF=3, all 3 nodes must be up and running for CAS updates? -Vivek On Wed, Apr 16, 2014 at 6:22 PM, Mark Reddy wrote: > create keyspace twitter with replication = {'class':'SimpleStrategy', >> 'replication_factor' : 3} > > > Your replication factor is your issue

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Tupshin Harper
That is entirely the problem. By having one node, but RF=3, you have created this exact problem. Each node can own, at most, one copy of each row of data, so the cluster is treating this (correctly) as a degraded runtime environment where no operation requiring CL > 1 will always fail. LWT requ

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Mark Reddy
> > create keyspace twitter with replication = {'class':'SimpleStrategy', > 'replication_factor' : 3} Your replication factor is your issue here, you have a single node and a RF=3. For a single node setup your RF should be 1. You can find more info about replication here: http://www.datastax.com/

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Vivek Mishra
Hi, Mine is a simple case. Running on single node only. Keyspace is: create keyspace twitter with replication = {'class':'SimpleStrategy', 'replication_factor' : 3} -Vivek On Wed, Apr 16, 2014 at 1:27 AM, Tupshin Harper wrote: > Please provide your keyspace definition, and the output of "nod

CqlPagingInputFormat: paging through wide rows

2014-04-16 Thread Paolo Estrella
Hello, I've just upgraded to Cassandra 1.2.16. I've also started using the CqlPagingInputFormat within my map/reduce tasks. I have a question with regard to using CqlPagingInputFormat for paging through wide rows. I don't see a way to input more than one column at a time into my Mapper. I suppos

Re: How safe is "nodetool move" in 1.2 ?

2014-04-16 Thread Jonathan Lacefield
Assuming you have enough nodes not undergoing "move" to meet your CL requirements, then yes, your cluster will still accept reads and writes. However, it's always good to test this before doing it in production to ensure your cluster and app will function as designed. Jonathan Lacefield Solutions

How safe is "nodetool move" in 1.2 ?

2014-04-16 Thread Oleg Dulin
I need to rebalance my cluster. I am sure this question has been asked before -- will 1.2 continue to serve reads and writes correctly while move is in progress ? Need this for my sanity. -- Regards, Oleg Dulin http://www.olegdulin.com

Deleting column names

2014-04-16 Thread Sebastian Schmidt
Hi, I'm using a Cassandra table to store some data. I created the table like this: CREATE TABLE IF NOT EXISTS table_name (s BLOB, p BLOB, o BLOB, c BLOB, PRIMARY KEY (s, p, o, c)); I need the at least the p column to be sorted, so that I can use it in a WHERE clause. So as far as I understand, th

Re: How to clear all data using CQL?

2014-04-16 Thread Sebastian Schmidt
Thank you that worked! Am 16.04.2014 10:46, schrieb Mark Reddy: > /select keyspace_name from system.schema_keyspaces;/ >

Re: Replication Factor question

2014-04-16 Thread Markus Jais
Hi Rob, thanks. How many nodes to you have running in those 5 racks and RF 5? Only 5 nodes or more? Markus Robert Coli schrieb am 20:36 Dienstag, 15.April 2014: On Tue, Apr 15, 2014 at 6:14 AM, Ken Hancock wrote: > >Keep in mind if you lose the wrong two, you can't satisfy quorum.  In a 5-no

Re: How to clear all data using CQL?

2014-04-16 Thread Mark Reddy
To get a list of all keyspaces via CQL you can run: *describe keyspaces *or *select keyspace_name from system.schema_keyspaces;* On Wed, Apr 16, 2014 at 9:23 AM, Sebastian Schmidt wrote: > Hey > > I want to do unit tests for software that uses Cassandra as storage > backend. Using the Hector cl

How to clear all data using CQL?

2014-04-16 Thread Sebastian Schmidt
Hey I want to do unit tests for software that uses Cassandra as storage backend. Using the Hector client, I could just iterate over all keyspaces and delete them. In CQL, there is no query to list all keyspaces. How can I drop all Keyspaces from Cassandra, without knowing every single Keyspace nam