RE: Iterate all keys - doing it as the faq fails for me :(

2010-07-12 Thread Per Olesen
Anyone? - Hi, I was reading http://wiki.apache.org/cassandra/FAQ#iter_world and decided to implement the get_range_slices method for listing all keys of a CF. Only thing is, it doesn't work that well for me :-) I do as it says (I think), and take KeyRanges of size N and use the key

Re: advice, is cassandra suitable for a multi-tanency vBulletin type application?

2010-07-12 Thread Sandeep Kalidindi at PaGaLGuY.com
we were one of the vbulletin customers and our forums has been facing some bad scaling issues. we coded our forum software to work with cassandra. we are still testing for bugs and might go live in couple of weeks. You can ask any specific questions about vbulletin and cassandra and i will answer

Re: Is anyone using version 0.7 schema update API

2010-07-12 Thread GH
Hi, My problem is that I cannot locate Java equivalents to the api calls you present in the ruby files you have presented. They are not visible in the java client packages I have (My code is not that old of trunk). I located the code below from some of the unit test code files This code will

Re: Iterate all keys - doing it as the faq fails for me :(

2010-07-12 Thread Jonathan Ellis
This is a bug. Can you submit a ticket with test data to reproduce? On Fri, Jul 9, 2010 at 6:40 AM, Per Olesen p...@trifork.com wrote: Hi, I was reading http://wiki.apache.org/cassandra/FAQ#iter_world and decided to implement the get_range_slices method for listing all keys of a CF. Only

SV: Iterate all keys - doing it as the faq fails for me :(

2010-07-12 Thread Per Olesen
This is a bug. Can you submit a ticket with test data to reproduce? Uuuh, maybe...:) Right now it is happening on some life user data, that I am not sure I can ship. Haven't tried if I can reproduce locally. One question: We are running 0.6.2. Could this be fixed in 0.6.3? Not that big a

Re: advice, is cassandra suitable for a multi-tanency vBulletin type application?

2010-07-12 Thread S Ahmed
Very interesting! What kind of integration do you have between vB and Cassandra? its not a port then? On Mon, Jul 12, 2010 at 3:34 AM, Sandeep Kalidindi at PaGaLGuY.com sandeep.kalidi...@pagalguy.com wrote: we were one of the vbulletin customers and our forums has been facing some bad

Re: High CPU usage on all nodes without any read or write

2010-07-12 Thread Olivier Rosello
But in Cassandra output log : r...@cassandra-2:~#  tail -f /var/log/cassandra/output.log  INFO 15:32:05,390 GC for ConcurrentMarkSweep: 1359 ms, 4295787600 reclaimed leaving 1684169392 used; max is 6563430400  INFO 15:32:09,875 GC for ConcurrentMarkSweep: 1363 ms, 4296991416 reclaimed

Re: TechCrunch article on Twitter and Cassandra

2010-07-12 Thread Eric Evans
On Sun, 2010-07-11 at 01:06 +0530, Sumit Datta wrote: What I do not see are details as to why Cassandra is not being used to store tweets. Or the details of the implementation that does have Cassandra. I wouldn't let that stop you. You should consider doing what so many others are: treat all

RE: server needs thrift to run also?

2010-07-12 Thread Stu Hood
The Thrift server is embedded in Cassandra, and starts by default. Look for references to Thrift on: http://wiki.apache.org/cassandra/GettingStarted Thanks, Stu -Original Message- From: S Ahmed sahmed1...@gmail.com Sent: Monday, July 12, 2010 3:43pm To: user@cassandra.apache.org

Re: server needs thrift to run also?

2010-07-12 Thread S Ahmed
confused, why does the installation guide say to build and make it then? http://github.com/ericflo/twissandra http://github.com/ericflo/twissandratwissandar is for 0.6.1 is that why? i.e. it was embedded in a later version? On Mon, Jul 12, 2010 at 4:46 PM, Stu Hood stu.h...@rackspace.com wrote:

Re: server needs thrift to run also?

2010-07-12 Thread Stu Hood
You'll need Thrift installed to generate the _client_ code: the server code is embedded within Cassandra. -Original Message- From: S Ahmed sahmed1...@gmail.com Sent: Monday, July 12, 2010 3:49pm To: user@cassandra.apache.org Subject: Re: server needs thrift to run also? confused, why

Re: server needs thrift to run also?

2010-07-12 Thread Michael Pearson
Twissandra is packaged with pycassa + correct generated thrift transports under /deps already, so really just need the thrift binary to build from a cassandra.thrift API newer than what's currently supported by the bundled pycassa. -michael On Mon, Jul 12, 2010 at 1:55 PM, Stu Hood

Re: server needs thrift to run also?

2010-07-12 Thread S Ahmed
Ok I guess I have to read up on exactly what is going on here. I figured I could download twissandra, fire up cassandra and run the app! I thought all you needed was the python driver which comes with twissandra. Let me read more about Thrift and generating client code etc. thanks! On Mon,

Re: advice, is cassandra suitable for a multi-tanency vBulletin type application?

2010-07-12 Thread S Ahmed
What sort of traffic levels made you port the application to Cassandra? Very interested in seeing this go live. What sort of server setup are you looking at using? On Mon, Jul 12, 2010 at 4:39 PM, Sandeep Kalidindi at PaGaLGuY.com sandeep.kalidi...@pagalguy.com wrote: No we re-coded from

Re: advice, is cassandra suitable for a multi-tanency vBulletin type application?

2010-07-12 Thread Sandeep Kalidindi at PaGaLGuY.com
well we were going down constantly with VB running on 3-4 dedicated servers due to huge traffic(couple of tens of millions of page views). We are also planning on some new major features, hence the shift to cassandra with future in mind. Well roughly the architecture is like this(in order of how

Re: Question regarding consistency and deletion

2010-07-12 Thread Aaron Morton
The Tomstones are removed after GCGraceSeconds (in the storage-config.xml), at the next Major Compaction http://wiki.apache.org/cassandra/MemtableSSTable?highlight=%28tombstones%29Take a look at http://wiki.apache.org/cassandra/DistributedDeletes and Handling Failure on

Re: Question about CL.ZERO

2010-07-12 Thread Aaron Morton
My understanding is that the coordinator will acknowledge the writes faster then they can actually be written. Eventually it will run out of buffer space. see http://wiki.apache.org/cassandra/FAQ#slows_down_after_lotso_insertsUsing CL.ONE makes it harder for the clients to flood the cluster with

Re: Iterate all keys - doing it as the faq fails for me :(

2010-07-12 Thread Jonathan Ellis
I can't picture how you could be reading data that sorts *before* the start key with a range slice. So, probably not fixed in 0.6.3. On Mon, Jul 12, 2010 at 7:56 AM, Per Olesen p...@trifork.com wrote: This is a bug.  Can you submit a ticket with test data to reproduce? Uuuh, maybe...:)

Re: Question about CL.ZERO

2010-07-12 Thread Benjamin Black
CL.ONE represents the fastest you can sustain. CL.ZERO represents writing to memory on the coordinator, regardless of what the nodes can sustain for durable writes. That is a bad situation, regardless of your durability goals. So, there is no good reason. What you are describing is a

Re: server needs thrift to run also?

2010-07-12 Thread Benjamin Black
You were just told it is packaged with what it needs. The API is not changed from 0.6.1 to 0.6.3. Why do you think you need to generate client code? On Mon, Jul 12, 2010 at 2:16 PM, S Ahmed sahmed1...@gmail.com wrote: Ok I guess I have to read up on exactly what is going on here. I figured I

Re: server needs thrift to run also?

2010-07-12 Thread Miguel Verde
I'll take a guess. S Ahmed, the Thrift compiler takes a .thrift file and can generate client and server code for it in your language of choice. This code depends on the Thrift runtime library in that language. For instance, the Thrift Java runtime library is bundled with Cassandra as a jar.

Re: advice, is cassandra suitable for a multi-tanency vBulletin type application?

2010-07-12 Thread Paul Prescod
Why Cassandra *and* Redis? What do you perceive as the strengths or weaknesses of the two? On Mon, Jul 12, 2010 at 2:40 PM, Sandeep Kalidindi at PaGaLGuY.com sandeep.kalidi...@pagalguy.com wrote: well we were going down constantly with VB running on 3-4 dedicated servers due to huge

RE: Question about CL.ZERO

2010-07-12 Thread Todd Burruss
the goal i am reaching for with ZERO is to return control to the user ASAP, with super fast response times. the load isn't high at all, but persisting does take time even under light load. we are not actually using ZERO at the moment but were considering it for fire and forget type of events.

GCGraceSeconds per ColumnFamily/Keyspace

2010-07-12 Thread Todd Burruss
I have two CFs in my keyspace. one i care about allowing a good amount of time for tombstones to propagate (GCGraceSeconds large) ... but the other i couldn't care and in fact i want them gone ASAP so i don't iterate over them. has any thought been given to making this setting per Keyspace or

Re: GCGraceSeconds per ColumnFamily/Keyspace

2010-07-12 Thread Jonathan Ellis
GCGS per CF sounds totally reasonable to me. On Mon, Jul 12, 2010 at 6:33 PM, Todd Burruss bburr...@real.com wrote: I have two CFs in my keyspace.  one i care about allowing a good amount of time for tombstones to propagate (GCGraceSeconds large) ... but the other i couldn't care and in fact

Re: TechCrunch article on Twitter and Cassandra

2010-07-12 Thread Jonathan Ellis
On Sat, Jul 10, 2010 at 2:22 PM, Colin Clark co...@cloudeventprocessing.com wrote: Although I'm a fan of Cassandra, there's no way I'd use it today for my tier 1 deployments, because I don't have the resources of Facebook, and even though Cassandra is open source, that doesn't mean I can fix it

Re: server needs thrift to run also?

2010-07-12 Thread Eric Evans
On Mon, 2010-07-12 at 17:16 -0400, S Ahmed wrote: Ok I guess I have to read up on exactly what is going on here. I figured I could download twissandra, fire up cassandra and run the app! Pretty much, but you do need to install the Thrift python module (which the README does say). Try:

Re: GCGraceSeconds per ColumnFamily/Keyspace

2010-07-12 Thread Todd Burruss
Is it possible to get this feature in 0.7? -Original Message- From: Jonathan Ellis [jbel...@gmail.com] Received: 7/12/10 5:06 PM To: user@cassandra.apache.org [u...@cassandra.apache.org] Subject: Re: GCGraceSeconds per ColumnFamily/Keyspace GCGS per CF sounds totally reasonable to me.

Re: concurrent reads

2010-07-12 Thread Jonathan Ellis
if you're not sure where your bottleneck is, you aren't hitting it hard enough :) On Mon, Jul 12, 2010 at 9:00 PM, Lee Parker l...@socialagency.com wrote: Has anyone experimented with different settings for concurrent reads?  I have set our servers to 4 ( 2 per processor core ).  I have noticed

Re: server needs thrift to run also?

2010-07-12 Thread Jonathan Ellis
would it be hard to make easy_install pycassa install thrift automagically? On Mon, Jul 12, 2010 at 10:36 PM, Eric Evans eev...@rackspace.com wrote: On Mon, 2010-07-12 at 17:16 -0400, S Ahmed wrote: Ok I guess I have to read up on exactly what is going on here. I figured I could download

Re: GCGraceSeconds per ColumnFamily/Keyspace

2010-07-12 Thread Jonathan Ellis
Probably. Can you open a ticket? On Mon, Jul 12, 2010 at 10:41 PM, Todd Burruss bburr...@real.com wrote: Is it possible to get this feature in 0.7? -Original Message- From: Jonathan Ellis [jbel...@gmail.com] Received: 7/12/10 5:06 PM To: user@cassandra.apache.org

Re: server needs thrift to run also?

2010-07-12 Thread Eric Evans
On Mon, 2010-07-12 at 23:13 -0500, Jonathan Ellis wrote: would it be hard to make easy_install pycassa install thrift automagically? I think it would do that already, assuming that pycassa itself was installable from the cheeseshop (not sure why it isn't). Twissandra is actually using an