Re: Error while startup - latest trunk build

2011-04-13 Thread Shariq
Hi again, Small update on the issue, *KeyspaceDefinition keyspaceDefinition = HFactory.createKeyspaceDefinition(KEYSPACE_NAME, org.apache.cassandra.locator.SimpleStrategy, 1, Arrays.asList(cfDef));* works fine with cassandra-0.7.4/thrift-19.4.0. But when I am running the same

Issue with deleting in CLI on 0.7.4

2011-04-13 Thread Wenjun Che
I am having an issue with del CF[key] not deleting data in CLI on 0.7.4. Here is how I can reproduce: 1. create a CF with UUID as row key and run 'assume CF keys as lexicaluuid' 2. insert a row, with 62397363-7d4f-4b12-a1eb-f0cd29dca735 as key (just an example). 3. 'get

Re: json2sstable

2011-04-13 Thread Steven Teo
managed to resolve the earlier error but is getting this exception now, any way to increase the heap space, or is a non-related issue? Exception in thread main java.lang.OutOfMemoryError: Java heap space at

Re: json2sstable

2011-04-13 Thread aaron morton
see conf/cassandra-env.sh to increase the JVM Heap Background reading http://wiki.apache.org/cassandra/MemtableThresholds There is a fix in 0.7.5 that would help you right now :) If you cannot increase the JVM Heap reduce the in_memory_compaction_limit in conf/cassandra/yaml Aaron On 13 Apr

Re: cassandra 0.6.3 error Connection refused to host: 127.0.0.1;

2011-04-13 Thread Ali Ahsan
I am not running any firewall this physical machine not EC2,I can telnet to port 8080 telnet 127.0.0.1 8080 Trying 127.0.0.1... Connected to localhost.localdomain (127.0.0.1). Escape character is '^]'. On 04/13/2011 06:59 AM, aaron morton wrote: Can you connect from the local machine

Re: Cassandra node's replication factor two with random partition non Bootstrap node problem

2011-04-13 Thread Ali Ahsan
This solved we are doing some thing wrong in our application thanks for your help. On 04/13/2011 01:43 AM, Peter Schuller wrote: I have two cassandra node's,If Boot strapped nodes goes down my service remains alive,But if my non Bootstrap (master) node goes down my live site goes down as

choose which Hector's serializer for Cassandra performance?

2011-04-13 Thread 박용욱
I have questions below. 1. In cassandra server, row key, column name and column value are saved in byte[], aren't they? 2. If I call hector's mutator.addInsertion(rowKey, cfName, HColumnString, Integer), does the transformation from String/Integer to byte[] occur at client? or server? 3. If 2

RE: Cassandra 2 DC deployment

2011-04-13 Thread Nair, Rajesh
Peter all great questions. Let me try to answer them. You are right about the automatic fallback to ONE. Its quite possible, if 2 nodes die for some reason I will have the same problem. So probably the right thing to do would be to read/write at ONE only when we lose a DC by changing some

database design

2011-04-13 Thread Jean-Yves LEBLEU
Hi all, Just some thoughts and question I have about cassandra data modeling. If I understand well, cassandra is better on writing than on reading. So you have to think about your queries to design cassandra schema. We are doing incremental design, and already have our system in production and

Link to Hudson on the download page is broken

2011-04-13 Thread Jeremiah Jordan
The apache Hudson server address needs to be updated on the download page, it is now: https://builds.apache.org The link to the latest builds from the download page: http://cassandra.apache.org/download/ Needs to be updated from:

Re: database design

2011-04-13 Thread Edward Capriolo
On Wed, Apr 13, 2011 at 10:39 AM, Jean-Yves LEBLEU jleb...@gmail.com wrote: Hi all, Just some thoughts and question I have about cassandra data modeling. If I understand well, cassandra is better on writing than on reading. So you have to think about your queries to design cassandra schema.

Re: value of hinted handoff column not really empty...?

2011-04-13 Thread Jonathan Ellis
I'd be more inclined to suspect a bug in sstable2json. On Wed, Apr 13, 2011 at 8:59 AM, Terje Marthinussen tmarthinus...@gmail.com wrote: Hi, we do see occasional row corruptions now and then and especially in hinted handoffs. This may be related to fairly long rows (millions of columns) I

Re: database design

2011-04-13 Thread Victor Kabdebon
Dear Jean-Yves, You can have a different approach of the problem. You need on one side a relational database (MySQL, PostGreSQL) or SolR (as an very efficient index) and on the other side Cassandra. The relational database or SolR must contain the minimum amount of information possible : a date

Re: flush_largest_memtables_at messages in 7.4

2011-04-13 Thread mcasandra
Peter Schuller wrote: Saturated. But read latency is still something like 30ms which I would think would be much higher if it's saturated. -- View this message in context:

Indexes on heterogeneous rows

2011-04-13 Thread David Boxenhorn
Is it possible in 0.7.x to have indexes on heterogeneous rows, which have different sets of columns? For example, let's say you have three types of objects (1, 2, 3) which each had three members. If your rows had the following pattern type=1 a=? b=? c=? type=2 d=? e=? f=? type=3 g=? h=? i=?

Re: flush_largest_memtables_at messages in 7.4

2011-04-13 Thread mcasandra
One correction qu size in iostat ranges between 6-120. But still this doesn't explain why read latency is low in cfstats. -- View this message in context: http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/flush-largest-memtables-at-messages-in-7-4-tp6266221p6269875.html Sent from

CL.ONE reads and SimpleSnitch unnecessary timeouts

2011-04-13 Thread Erik Onnen
Sorry for the complex setup, took a while to identify the behavior and I'm still not sure I'm reading the code correctly. Scenario: Six node ring w/ SimpleSnitch and RF3. For the sake of discussion assume the token space looks like: node-0 1-10 node-1 11-20 node-2 21-30 node-3 31-40 node-4

unsubscribe

2011-04-13 Thread Prasanna Jayapalan
*Prasanna Jayapalan Senior Solution Consultant Evident Software **O:(973)-622-5656 X 253 *

Re: CL.ONE reads and SimpleSnitch unnecessary timeouts

2011-04-13 Thread Jonathan Ellis
First, our contract with the client says we'll give you the answer or a timeout after rpc_timeout. Once we start trying to cheat on that the client has no guarantee anymore when it should expect a response by. So that feels iffy to me. Second, retrying to a different node isn't expected to give

Re: flush_largest_memtables_at messages in 7.4

2011-04-13 Thread Peter Schuller
But read latency is still something like 30ms which I would think would be much higher if it's saturated. No. You're using stress, so you have some total cap on concurrency. Given a fixed concurrency, you'll saturate at some particular average latency which is mostly a function of the backlog

Re: unsubscribe

2011-04-13 Thread Peter Schuller
http://wiki.apache.org/cassandra/FAQ#unsubscribe -- / Peter Schuller

Re: CQL, 0.8, and need for language drivers

2011-04-13 Thread Ryan King
On Tue, Apr 12, 2011 at 7:16 PM, Jeremy Hanna jeremy.hanna1...@gmail.com wrote: As some may have heard, CQL is going to be in 0.8.  It's a level of abstraction that will hopefully make the lives of client developers substantially easier.  The ideal is to make it so client devs only need to do

Re: CL.ONE reads and SimpleSnitch unnecessary timeouts

2011-04-13 Thread Erik Onnen
So we're not currently using a dynamic snitch, only the SimpleSnitch is at play (lots of history as to why, I won't go into it). If this would solve our problems I'm fine changing it. Understood re: client contract. I guess in this case my issue is that the server we're connected to never tries

Re: CL.ONE reads and SimpleSnitch unnecessary timeouts

2011-04-13 Thread Jonathan Ellis
Yes, we've had dynamic snitch on by default in all the 0.7 releases so it's pretty well tested by this point. On Wed, Apr 13, 2011 at 1:17 PM, Erik Onnen eon...@gmail.com wrote: So we're not currently using a dynamic snitch, only the SimpleSnitch is at play (lots of history as to why, I won't

Re: flush_largest_memtables_at messages in 7.4

2011-04-13 Thread mcasandra
I still don't understand. You would expect read latency to increase drastically when it's fully saturated and lot of READ drop messages also, correct? I don't see that in cfstats or system.log which I don't really understand why. -- View this message in context:

Re: choose which Hector's serializer for Cassandra performance?

2011-04-13 Thread Aaron Morton
1. Yes 2. Client 3. NA 4. Not sure how Hector handles it, try https://github.com/zznate/cassandra-tutorial or https://github.com/zznate/hector-examples Aaron On 14/04/2011, at 1:28 AM, 박용욱 colin.wook.p...@gmail.com wrote: I have questions below. 1. In cassandra server, row key, column

Re: Ec2Snitch + NetworkTopologyStrategy if only in one region?

2011-04-13 Thread William Oberman
One last coda, for other noobs to cassandra like me. If you use NetworkTopologyStrategy with replication_factor 1, make sure you have EC2 instance in multiple availability zones. I was doing baby steps, and tried doing a cluster in one AZ (before spreading to multiple AZs) and was getting the

Re: flush_largest_memtables_at messages in 7.4

2011-04-13 Thread Peter Schuller
I still don't understand. You would expect read latency to increase drastically when it's fully saturated and lot of READ drop messages also, correct? I don't see that in cfstats or system.log which I don't really understand why. No. With a fixed concurrency there is only so many outstanding

Possible design flaw in Cassandra By Example blog

2011-04-13 Thread Drew Kutcharian
Hi Everyone, I was going thru Cassandra By Example Blog http://www.rackspace.com/cloud/blog/2010/05/12/cassandra-by-example/ and I had a question about the user sign up section: username = 'jericevans' password = '**' useruuid = str(uuid()) columns = {'id': useruuid, 'username':

Re: Pyramid Organization of Data

2011-04-13 Thread Patrick Julien
We have been successful in implementing, at scale, the comments you posted here. I'm wondering what we can do about deleting data however. The way I see it, we have considerably more storage capacity in NY, but not in the other sites. Using this technique here, it occurs to me that we would

Re: flush_largest_memtables_at messages in 7.4

2011-04-13 Thread mcasandra
Actually when I run 2 stress clients in parallel I see Read Latency stay the same. I wonder if cassandra is reporting accurate nos. I understand your analogy but for some reason I don't see that happening with the results I am seeing with multiple stress clients running. So I am just confused

Re: Possible design flaw in Cassandra By Example blog

2011-04-13 Thread Eric Evans
On Wed, 2011-04-13 at 15:07 -0700, Drew Kutcharian wrote: username = 'jericevans' password = '**' useruuid = str(uuid()) columns = {'id': useruuid, 'username': username, 'password': password} USER.insert(useruuid, columns) USERNAME.insert(username, {'id': useruuid}) How can I

Re: Possible design flaw in Cassandra By Example blog

2011-04-13 Thread Drew Kutcharian
Thanks for your response. In general, seems like you always need some kind of external coordination if you are doing inverted indexes. How do others tackle this issue? Now would using secondary indexes be a good idea in this case considering cardinality of the keys will be pretty high?

Re: choose which Hector's serializer for Cassandra performance?

2011-04-13 Thread 박용욱
Thanks very much Aaron! 4. Not sure how Hector handles it, try https://github.com/zznate/cassandra-tutorial or https://github.com/zznate/hector-examples https://github.com/zznate/hector-examplesLet me ask question 4 again. :) 1. RandomPartitioner and OrderPreservingPartitioner response same

raid 0 and ssd

2011-04-13 Thread Anurag Gujral
Hi All, We are using three ssd disks with cassandra 0.7.3 , should we set them as raid0 .What are the advantages and disadvantages of doing this. Please advise. Thanks Anurag

Re: raid 0 and ssd

2011-04-13 Thread Drew Kutcharian
RAID 0 is the fastest, but you'll lose the whole array if you lose a drive. One thing to keep in mind is that SSDs get slower as they get filled up and closer to their capacity due to garbage collection. If you want more info on how SSDs perform in general, Percona guys have done extensive