Re: Problem on node join the ring

2013-01-28 Thread aaron morton
> there is no streaming anymore Nodes only bootstrap once, when they are first started. > I have turned on the debug, this is what it is doing now(cpu is pretty much > idle), no any error message. Looks like it is receiving writes and reads, looks like it's part of the ring. Is this ring out

Re: why set replica placement strategy at keyspace level ?

2013-01-28 Thread aaron morton
> So If I write to CF Users with rowkey="dean" > and to CF Schedules with rowkey="dean", it is actually one row? In my mental model that's correct. A RowMutation is a row key and a collection of (internal) ColumnFamilies which contain the columns to write for a single CF. This is the thing th

Re: Cassandra pending compaction tasks keeps increasing

2013-01-28 Thread aaron morton
> * Why nodetool repair increases the data size that much? It's not likely > that much data needs to be repaired. Will that happen for all the subsequent > repair? Repair only detects differences in entire rows. If you have very wide rows then small differences in rows can result in a large

RE: data not shown up after some time

2013-01-28 Thread Matthias Zeilinger
How can I check for this secondary index read fails? Is it in the system.log or over the nodetool? Br, Matthias Zeilinger Production Operation - Shared Services P: +43 (0) 50 858-31185 M: +43 (0) 664 85-34459 E: matthias.zeilin...@bwinparty.com bwin.party services (Austria) GmbH Marxergasse 1B A

Re: JDBC, Select * Cql2 vs Cql3 problem ?

2013-01-28 Thread aaron morton
What is your table spec ? Do you have the full stack trace from the exception ? Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 29/01/2013, at 8:15 AM, Andy Cobley wrote: > I have the following code in my app using

Re: cluster issues

2013-01-28 Thread aaron morton
> We can always be proactive in keeping the time sync. But, Is there any way to > recover from a time drift (in a reactive manner)? Since it was a lab > environment, I dropped the KS (deleted data directory) There is a way to remove future dated columns, but it not for the faint hearted. Basic

Re: Cassandra timeout whereas it is not much busy

2013-01-28 Thread aaron morton
> From what I could read there seems to be a contention issue around the > flushing (the "switchlock" ?). Cassandra would then be slow, but not using > the entire cpu. I would be in the strange situation I was where I reported my > issue in this thread. > Does my theory makes sense ? If you are

getting error for decimal type data

2013-01-28 Thread Kuldeep Mishra
while I an trying to list column family data using cassandra-cli then I am getting following problem for decimal type data, any suggestion will be appreciated. Exception in thread "main" java.lang.OutOfMemoryError: Java heap space at java.lang.AbstractStringBuilder.(AbstractStringBuilder.java:

Re: data not shown up after some time

2013-01-28 Thread aaron morton
If you are seeing failed secondary index reads you may be seeing this https://issues.apache.org/jira/browse/CASSANDRA-5079 Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 29/01/2013, at 3:31 AM, Matthias Zeilinger

RE: Accessing Metadata of Column Familes

2013-01-28 Thread Rishabh Agrawal
Any points on the same. - Rishabh - Reply message - From: "Rishabh Agrawal" To: "user@cassandra.apache.org" Subject: Accessing Metadata of Column Familes Date: Mon, Jan 28, 2013 5:56 pm I found following issues while working on Cassandra version 1.2, CQL 3 and Thrift protocol 19.35.0.

Re: JNA not found.

2013-01-28 Thread Tim Dunphy
I went to github to try to download jna again. I downloaded version 3.5.1 [root@cassandra-node01 cassandrahome]# ls -l lib/jna-3.5.1.jar -rw-r--r-- 1 root root 692603 Jan 28 21:57 lib/jna-3.5.1.jar I noticed in the datastax docs that java 7 was not recommended so I downgraded to java 6 [root@cas

Re: Node selection when both partition key and secondary index field constrained?

2013-01-28 Thread Mike Sample
Thanks Aaron. So basically it's merging the results 2 separate queries: Indexed scan (token-range) intersect foo.flag_index=true where the latter query hits the entire cluster as per the secondary index FAQ entry. Thus the overall query would fail if LOCAL_QUORUM was requested, RF=3 and 2 n

1.2 Authentication

2013-01-28 Thread Daning Wang
We were using SimpleAuthenticator on 1.1.x, it worked fine. While testing 1.2, I have put classes under example/simple_authentication in a jar and copy to lib directory, the class is loaded. however, when I try to connect with correct user/password, it gives me error ./cqlsh s2.dsat103-e1a -u xxx

Cass returns Incorrect column data on writes during flushing

2013-01-28 Thread Elden Bishop
I'm trying to track down some really worrying behavior. It appears that writing multiple columns while a table flush is occurring can result in Cassandra recording its data in a way that makes columns visible only to some queries but not others. Ie. Query for a single column works but the colum

Re: What is the default 'key_validation_class' on secondary INDEX(es)

2013-01-28 Thread Alan Ristić
2013/1/28 Sylvain Lebresne > If you are asking for the key_validation_class of the Index CF, then it's > the column type that defines it Sylvain, that was the one I meant, great. Tnx for explanation. *Alan Ristić* *m*: 040 423 688

JNA not found.

2013-01-28 Thread Tim Dunphy
Hey List, I just downloaded 1.21 and have set it up across my cluster. When I noticed the following notice: INFO 18:14:53,828 JNA not found. Native methods will be disabled. So I downloaded jna.jar from git hub and moved it to the cassandra /lib directory. I changed mod to 755 as per the datas

Understanding Virtual Nodes on Cassandra 1.2

2013-01-28 Thread Zhong Li
Hi All, Virtual Nodes is great feature. After I searched some document on Datastax website and some old ticket, seems that it works for random partitioner only, and leaves order preserved partitioner out of the luck. I may misunderstand, please correct me. if it doesn't love order preserved par

Problem on node join the ring

2013-01-28 Thread Daning Wang
I add a new node to ring(version 1.1.6), after more than 30 hours, it is still in the 'Joining' state Address DC RackStatus State Load Effective-Ownership Token 141784319550391026443072753096570088105 10.28.78.123datacenter1 rack1 Up Normal

Re: why set replica placement strategy at keyspace level ?

2013-01-28 Thread Hiller, Dean
"If you write to 4 CF's with the same row key that is considered one mutation" Hm, I never considered this, never knew either.(very un-intuitive from a user perspective IMHO). So If I write to CF Users with rowkey="dean" and to CF Schedules with rowkey="dean", it is actually one row? (it's s

Re: cql: show tables in a keystone

2013-01-28 Thread Theo Hultberg
the DESCRIBE family of commands in cqlsh are wrappers around queries to the system keyspace, so if you want to inspect what keyspaces and tables exist from your application you can do something like: SELECT columnfamily_name, comment FROM system.schema_columnfamilies WHERE keyspace_name = 'test';

Re: Cassandra pending compaction tasks keeps increasing

2013-01-28 Thread Derek Williams
I could be wrong about this, but when repair is run, it isn't just values that are streamed between nodes, it's entire sstables. This causes a lot of duplicate data to be written which was already correct on the node, which needs to be compacted away. As for speeding it up, no idea. On Mon, Jan

Re: Issues with CQLSH in Cassandra 1.2

2013-01-28 Thread aaron morton
I was able to replicate it… $ bin/nodetool -h 127.0.0.1 -p 7100 describering foo Schema Version:253da4a3-e277-35b5-8d04-dbeeb3c9508e TokenRange: TokenRange(start_token:3074457345618258602, end_token:-9223372036854775808, endpoints:[], rpc_endpoints:[], endpoint_details:[]) Toke

Re: Node selection when both partition key and secondary index field constrained?

2013-01-28 Thread aaron morton
It uses the index... cqlsh:dev> tracing on; Now tracing requests. cqlsh:dev> cqlsh:dev> cqlsh:dev> SELECT id, flag from foo WHERE TOKEN(id) > '-9939393' AND TOKEN(id) <= '0' AND flag=true; Tracing session: 128cab90-6982-11e2-8cd1-51eaa232562e activity

Re: why set replica placement strategy at keyspace level ?

2013-01-28 Thread aaron morton
> > Another thing that's been confusing me is that when we talk about the data > model should the row key be inside or outside a column family? My mental model is: cluster == database keyspace == table row == a row in a table CF == a family of columns in one row (I think that's different to oth

Re: Cassandra pending compaction tasks keeps increasing

2013-01-28 Thread Wei Zhu
Two fundamental questions: * Why did nodetool repairs bring so much data. A lot of SSTables are created, disk space almost doubled. * Why does level compactions run so slow? We turned off throtting completely and don't see much utilization of the SSD and CPU. One example, 0.7MB/s on

Re: cql: show tables in a keystone

2013-01-28 Thread Brian O'Neill
cqlsh> use keyspace; cqlsh:cirrus> describe tables; For more info: cqlsh> help describe -brian --- Brian O'Neill Lead Architect, Software Development Health Market Science The Science of Better Results 2700 Horizon Drive € King of Prussia, PA € 19406 M: 215.588.6024 € @boneill42

cql: show tables in a keystone

2013-01-28 Thread Paul van Hoven
Is there some way in cql to get a list of all tables or column families that belong to a keystore like "show tables" in sql?

Re: Cassandra pending compaction tasks keeps increasing

2013-01-28 Thread Wei Zhu
Any thoughts? Thanks. -Wei - Original Message - From: "Wei Zhu" To: user@cassandra.apache.org Sent: Friday, January 25, 2013 10:09:37 PM Subject: Re: Cassandra pending compaction tasks keeps increasing To recap the problem, 1.1.6 on SSD, 5 nodes, RF = 3, one CF only. After data load,

JDBC, Select * Cql2 vs Cql3 problem ?

2013-01-28 Thread Andy Cobley
I have the following code in my app using the JDBC (cassandra-jdbc-1.1.2.jar) drivers to CQL: try { rs= stmt.executeQuery("SELECT * FROM users"); }catch(Exception et){ System.out.println("Can not execute statement "+et); } When connecting to a CQL2 server (cassandra 1.1.5) the co

cluster issues

2013-01-28 Thread S C
One of our node in a 3 node cluster drifted by ~ 20-25 seconds. While I figured this pretty quickly, I had few questions that am looking for some answers. We can always be proactive in keeping the time sync. But, Is there any way to recover from a time drift (in a reactive manner)? Since it wa

Re: Unavaliable Exception

2013-01-28 Thread Everton Lima
Thanks for replies. 2013/1/25 Michael Kjellman > More nodes! > > On Jan 25, 2013, at 7:21 AM, "Alain RODRIGUEZ" wrote: > > fwiw, I have a mixed ubuntu 11.10 / 12.04 6 node cluster (AWS m1.xlarge). > > The load average is always between 0 and 5 for 11.10 nodes while 12.04 > nodes shows all the t

Re: unsubscribe

2013-01-28 Thread Alain RODRIGUEZ
You can try it a third time or you can rather try writing to user-unsubscr...@cassandra.apache.org Alain 2013/1/28 Olivier Devos > unsubscribe > >

Re: unsubscribe

2013-01-28 Thread Eric Evans
http://i.imgur.com/2ch9L.gif On Mon, Jan 28, 2013 at 8:36 AM, Olivier Devos wrote: > unsubscribe > -- Eric Evans Acunu | http://www.acunu.com | @acunu

unsubscribe

2013-01-28 Thread Olivier Devos
unsubscribe

Re: Cassandra timeout whereas it is not much busy

2013-01-28 Thread Nicolas Lalevée
I did some testing, I have a theory. First, we have it seems "a lot" of CF. And two are particularly every hungry in RAM, consuming a quite big amount of RAM for the bloom filters. Cassandra do not force the flush of the memtables if it has more than 6G of Xmx (luckily for us, this is the maxim

unsubscribe

2013-01-28 Thread Olivier Devos
unsubscribe From: Matthias Zeilinger [mailto:matthias.zeilin...@bwinparty.com] Sent: lundi 28 janvier 2013 15:32 To: user@cassandra.apache.org Subject: RE: data not shown up after some time Hi, No I have checked the TTL: 7776000 Very interesting is, if I do a simple “list ;” the da

RE: data not shown up after some time

2013-01-28 Thread Matthias Zeilinger
Hi, No I have checked the TTL: 7776000 Very interesting is, if I do a simple "list ;" the data is shown, but it I do a "get where ='';" it returns "0 Row Returned". How can that be? Br, Matthias Zeilinger Production Operation - Shared Services P: +43 (0) 50 858-31185 M: +43 (0) 664 85-34459

RE: data not shown up after some time

2013-01-28 Thread Viktor Jevdokimov
Are you sure your app is setting TTL correctly? TTL is in seconds. For 90 days it have to be 90*24*60*60=7776000. What If you set by accident 777600 (10 times less) - that will be 9 days, almost what you see. Best regards / Pagarbiai Viktor Jevdokimov Senior Developer Email: viktor.jevdoki...@ad

[RELEASE] Apache Cassandra 1.2.1 released

2013-01-28 Thread Sylvain Lebresne
The Cassandra team is pleased to announce the release of Apache Cassandra version 1.2.1. Cassandra is a highly scalable second-generation distributed database, bringing together Dynamo's fully distributed design and Bigtable's ColumnFamily-based data model. You can read more here: http://cassand

data not shown up after some time

2013-01-28 Thread Matthias Zeilinger
Hi, I´m a simple operations guy and new to Cassandra. I have the problem that one of our application is writing data into Cassandra (but not deleting them, because we should have a 90 days TTL). The application operates in 1 KS with 5 CF. my current setup: 3 node cluster and KS has a RF of 3 (I

Re: Accessing Metadata of Column Familes

2013-01-28 Thread Brian O'Neill
Through CQL, you see the logical schema. Through CLI, you see the physical schema. This may help: http://www.datastax.com/dev/blog/cql3-for-cassandra-experts -brian On Mon, Jan 28, 2013 at 7:26 AM, Rishabh Agrawal wrote: > I found following issues while working on Cassandra version 1.2, CQL 3 a

RE: Accessing Metadata of Column Familes

2013-01-28 Thread Rishabh Agrawal
I found following issues while working on Cassandra version 1.2, CQL 3 and Thrift protocol 19.35.0. Case 1: Using CQL I created a table t1 with columns col1 and col2 with col1 being my primary key. When I access same data using CLI, I see col1 gets adopted as rowkey and col2 being another colu

Re: Denormalization

2013-01-28 Thread chandra Varahala
My experience we can design main column families and lookup column families. Main column family have all denormalized data,lookup column families have rowkey of denormalized column families's column. In users column family all user's denormalized data and lookup column family name like userB

Re: CQL3 jdbc and Tomcat resource

2013-01-28 Thread Andy Cobley
Apologies, I was missing a few cassandra jar libs in the tomcat library. Andy On 28 Jan 2013, at 11:31, Andy Cobley wrote: > I tried to add a CQL3 jdbc resource to tomcat 7 in a context.xml file (in a > Eclipse project) as follows: > > name="jdbc/CF1" > factory="o

Re: What is the default 'key_validation_class' on secondary INDEX(es)

2013-01-28 Thread Sylvain Lebresne
Your question is missing a "what". What do you want to know the default of? If you are asking for the key_validation_class of the Index CF, then it's the column type that defines it. If you're asking about the index CF comparator, then in that example it would use a comparator that sorts like your

CQL3 jdbc and Tomcat resource

2013-01-28 Thread Andy Cobley
I tried to add a CQL3 jdbc resource to tomcat 7 in a context.xml file (in a Eclipse project) as follows: JDBC driver is cassandra-jdbc-1.1.2. When Tomcat (7.035) restarts it throws a series of errors. Is this known, or expected ? Removing the resource from contact.xml allows the server to

Re: why set replica placement strategy at keyspace level ?

2013-01-28 Thread Manu Zhang
On Mon 28 Jan 2013 04:42:49 PM CST, aaron morton wrote: The row is the unit of replication, all values with the same storage engine row key in a KS are on the same nodes. if they were per CF this would not hold. Not that it would be the end of the world, but that is the first thing that comes

Re: Does setstreamthroughput also throttle the network traffic caused by nodetool repair?

2013-01-28 Thread aaron morton
> Will that throttle the network traffic caused by nodetool repair? yes. > Should I call it to all the nodes on the cluster? Or set it in the yaml file. Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 25/01/2013, a

Re: why set replica placement strategy at keyspace level ?

2013-01-28 Thread aaron morton
The row is the unit of replication, all values with the same storage engine row key in a KS are on the same nodes. if they were per CF this would not hold. Not that it would be the end of the world, but that is the first thing that comes to mind. Cheers - Aaron Morton Freelanc