RE: Reboot, now node down 0.8rc1

2011-05-24 Thread Scott McPheeters
It was a 0.8beta2 install that last week I upgraded to rc1. I will turn the logging up to debug the next time I have issues to get more details. Thank you. Scott -Original Message- From: aaron morton [mailto:aa...@thelastpickle.com] Sent: Monday, May 23, 2011 6:42 PM To: user@cassa

Re: Reboot, now node down 0.8rc1

2011-05-24 Thread Sylvain Lebresne
Do you have a one of your super column family with a fairly short value for gc_grace_seconds ? I (strongly) suspect you're hitting https://issues.apache.org/jira//browse/CASSANDRA-2675. -- Sylvain On Tue, May 24, 2011 at 2:29 PM, Scott McPheeters wrote: > It was a 0.8beta2 install that last wee

RE: Reboot, now node down 0.8rc1

2011-05-24 Thread Scott McPheeters
I have not changed any defaults as of yet. Yes, I do have super columns, but my gc_grace_seconds is default. Scott -Original Message- From: Sylvain Lebresne [mailto:sylv...@datastax.com] Sent: Tuesday, May 24, 2011 8:53 AM To: user@cassandra.apache.org Subject: Re: Reboot, now node d

Re: repair question

2011-05-24 Thread Sylvain Lebresne
On Mon, May 23, 2011 at 9:21 PM, Peter Schuller wrote: >> I'm a bit lost: I tried a repair yesterday with only one CF and that didn't >> really work the way I expected but I thought that would be a bug which only >> affects that special case. >> >> So I tried again for all CFs. >> >> I started w

Re: repair question

2011-05-24 Thread Sylvain Lebresne
On Tue, May 24, 2011 at 12:40 AM, Daniel Doubleday wrote: > We are performing the repair on one node only. Other nodes receive reasonable > amounts of data (~500MB).  It's only the repairing node itself which > 'explodes'. That, for instance, is a bit weird. That the node on which the repair is

Re: repair question

2011-05-24 Thread Edward Capriolo
On Tue, May 24, 2011 at 9:41 AM, Sylvain Lebresne wrote: > On Tue, May 24, 2011 at 12:40 AM, Daniel Doubleday > wrote: > > We are performing the repair on one node only. Other nodes receive > reasonable amounts of data (~500MB). It's only the repairing node itself > which 'explodes'. > > That, f

EC2 node adding trouble

2011-05-24 Thread Marcus Bointon
Hi, First time here. I'm having trouble adding a third node to an existing 2-node ring (successfully upgraded from 0.72) running cassandra 0.8rc1 (successfully upgraded from 0.72) on ubuntu on EC2. Evidently the seed node is working as the second node is already talking to it, nodetool lists b

issue/minor bug with counters ?

2011-05-24 Thread Yang
if you have only counter columns in your keyspace, and do a lot of updates on a few keys, the getLiveSize() of memtable actually returns the total amount of traffic that has gone into the Memtable, not the real size, so you end up producing very small SStables, with only a few KBytes. (I have alrea

Re: issue/minor bug with counters ?

2011-05-24 Thread Jonathan Ellis
"the total amount of traffic that has gone into the Memtable" is how throughput is defined, so this is working as expected. On Tue, May 24, 2011 at 11:15 AM, Yang wrote: > if you have only counter columns in your keyspace, and do a lot of > updates on a few keys, > the getLiveSize() of memtable a

Re: issue/minor bug with counters ?

2011-05-24 Thread Yang
then I guess Memtable.getLiveSize() should not rely on the currentThroughput for size calculation, an extra var currentSize is needed during the bookkeeping of resolve() On Tue, May 24, 2011 at 9:47 AM, Jonathan Ellis wrote: > "the total amount of traffic that has gone into the Memtable" is how

Link & mirrors to download Cassandra is down...

2011-05-24 Thread Sameer Farooqui
http://cassandra.apache.org/download If you click the link on the right to download 0.7.6-2, the main link to download Cassandra (@ kahuki.com) and a bunch of the mirrors are down. Also, one of the HTTP mirrors takes me to a bare chested picture of Brad Pitt: http://www.gossipcheck.com/mirrors/ap

Re: Cassandra 0.8 questions

2011-05-24 Thread Jian Fang
Does anyone have a good suggestion on my second question? I believe that question is a pretty common one. My third question is a design question. For the same data, we can stored them into multiple column families or a single column family with multiple super columns. >From Cassandra read/write pe

Re: Link & mirrors to download Cassandra is down...

2011-05-24 Thread Jeremy Hanna
The link was fixed in cassandra.apache.org/download a couple of hours ago. For the time being it may be better to scroll down to the Backup Sites section and use one of those links. On May 24, 2011, at 12:24 PM, Sameer Farooqui wrote: > http://cassandra.apache.org/download > > If you click th

Re: EC2 node adding trouble

2011-05-24 Thread Sameer Farooqui
What region and availability zones are the different nodes in? Are you using EC2 Snitch? Did you set up the cluster using the Datastax AMI? - Sameer On Tue, May 24, 2011 at 8:35 AM, Marcus Bointon wrote: > Hi, > > First time here. I'm having trouble adding a third node to an existing > 2-node r

Re: Cassandra 0.8 questions

2011-05-24 Thread Victor Kabdebon
It's not really possible to give a general answer your second question, it depends of your implementation. Personally I do two thing : the first one is to map arrays with a key and then name of column as a key of your array and value of column as the data storage. However for some application, as I

Re: Cassandra 0.8 questions

2011-05-24 Thread Jian Fang
Thanks a lot. This is really helpful. John On Tue, May 24, 2011 at 1:34 PM, Victor Kabdebon wrote: > It's not really possible to give a general answer your second question, it > depends of your implementation. Personally I do two thing : the first one is > to map arrays with a key and then name

Re: repair question

2011-05-24 Thread Daniel Doubleday
Ok thanks for your help Sylvain - much appreciated In short: I believe that most of this is me not looking clearly yesterday. There are only one / two points that i don't get. Maybe you could help me out there. First the ~500MB thing is BS. The closer neighbors recieved around 80G and the oth

Re: EC2 node adding trouble

2011-05-24 Thread Marcus Bointon
On 24 May 2011, at 19:33, Sameer Farooqui wrote: > What region and availability zones are the different nodes in? Are you using > EC2 Snitch? Did you set up the cluster using the Datastax AMI? The two existing ones are in us-east-1c and us-east-1d, the new one is in us-east-1c, so all same regi

Re: repair question

2011-05-24 Thread Peter Schuller
> And the more spread out the out of sync is, the worse it will be. Though in > general we can expect that to not be too spread out. For the same reason than > why caches work. (Speaking generally now, not addressing the OP's issue) I'm not sure I buy that. Unless your data is such that hotness i

Re: repair question

2011-05-24 Thread Peter Schuller
> Hmmm, I'm starting to like this idea more and more the more I think of it ;) Filed: https://issues.apache.org/jira/browse/CASSANDRA-2699 -- / Peter Schuller

Re: EC2 node adding trouble

2011-05-24 Thread Sameer Farooqui
Even with AutoBootstrap it is recommended that you always specify the InitialToken on the new node because the picking of an initial token will almost certainly result in an unbalanced ring. Right now, I'm afraid that if you simply copied the YAML file from one of the two nodes to the 3rd node, th

Re: EC2 node adding trouble

2011-05-24 Thread aaron morton
Check the listen_address and rpc_address in the yaml file for each node. I think they are normally set to the private and public respectively. This may make your live easier http://www.datastax.com/dev/blog/setting-up-a-cassandra-cluster-with-the-datastax-ami Cheers - Aaron Mor

Measure Latency

2011-05-24 Thread Stephan Pfammatter
What’s the recommended way of measuring latency between nodes in a cluster? By that I’m not referring to read/write latency for a given KS/CF. Basically I want to inject a row in a node A and want to see how long it takes to get to node B (assuming proper RF is set). I have already some network

How to make use of Cassandra raw row keys?

2011-05-24 Thread Suan Aik Yeo
We're using Cassandra to store our sessions, all in a single column family "Sessions" with the format: Sessions['session_key'] = {'val': } (session_key is a randomly generated hash) The "raw" keys I'm talking about are for example the 'key' value as seen from Cassandra DEBUG output: insert writing

Re: Measure Latency

2011-05-24 Thread Aaron Morton
Once the cluster has returned to the client you know the write has been committed to Consistency Level  number of nodes. i.e. If you send an insert using QUORUM consistency to a cluster with Replication Factor 3, and you get a non error response you know the write has occurred on at least 2 nodes (

Re: How to make use of Cassandra raw row keys?

2011-05-24 Thread Aaron Morton
The key printed in the DEBUG message is the byte array the server was given as the key converted to hex. Your client API may have converted the string to ascii bytes before sending to the server.e.g. here is me writing a 'foo' key to the server DEBUG 15:52:15,818 insert writing local RowMutation(ke

monitoring cassandra with JMX

2011-05-24 Thread vineet daniel
Hi I have just written a little note on how to monitor cassandra... http://vineetdaniel.me/2011/03/26/monitoring-cassandra-with-jmx/ I hope it helps the community. Regards Vineet Daniel Cell : +918106217121 Websites : Blog | Linkedin

Re: EC2 node adding trouble

2011-05-24 Thread Marcus Bointon
On 25 May 2011, at 02:10, aaron morton wrote: > Check the listen_address and rpc_address in the yaml file for each node. I > think they are normally set to the private and public respectively. On all modes, listen_address and rpc_address are blank and 0.0.0.0 respectively, the seed node addres

Re: EC2 node adding trouble

2011-05-24 Thread Marcus Bointon
On 24 May 2011, at 23:58, Sameer Farooqui wrote: > Even with AutoBootstrap it is recommended that you always specify the > InitialToken on the new node because the picking of an initial token will > almost certainly result in an unbalanced ring. > > Right now, I'm afraid that if you simply copi