RE: Question about seeds in tow node cluster.

2011-02-14 Thread nicolas lattuada

Hi 

I have a two nodes cluster running and have both of them in the seeds list.

regards

> Date: Sun, 13 Feb 2011 16:04:41 +0800
> Subject: Question about seeds in tow node cluster.
> From: guxiaobo1...@gmail.com
> To: user@cassandra.apache.org
> 
> Hi,
> If the cluster only have tow nodes, should they both in the seeds list?
> 
> Regards,
> 
> Xiaobo Gu
  

The size of the data, I must be doing smth wrong....

2011-01-05 Thread nicolas lattuada

Hi 

i have some data size issues:

i am storing super columns with the following content:

{a=>1, b=>2, c=>3...n=>14}

i am storing it 300 000 times and i have a data size on the disk about 283Mo

And in other side i have a mysql table which stores a bunch of data the schema 
follows:
6 varchars +100
5 ints +6

I put about 1 300 000 records on it and end up with 150Mo of data and 57Mo of 
index.

Then i think i am certainly doing something wrong...

The other thing is when i run flush and then compact the size of my data 
increases, then i imagine something is copied up on compaction
So is there a way to remove the unused data? (cleanup doesn t seem to do the 
job).

Any help to reduce the size of the data would be greatly apreciated!
Greetings

  

somebody interested in hacking some very simple php client

2011-01-04 Thread nicolas lattuada

Yesterday i made it real quick, maybe it can help someone.

Here it is:

http://pastebin.com/bAyWMfXD

Hope it helps.

Nicolas
  

How to setup a big shared files system

2010-12-27 Thread nicolas lattuada

Hi

I am working for an afiliate marketing program, we are logging a lot of data, 
and thinking about using Cassandra.

But i wonder which is the way to setup properly a cassandra cluster.

For example let s say i would like to store about 100 000 Giga bytes of data, 
and will need to access them quite fast sometimes.


Of course the data cannot stand on a single server, so what would you recomend 
for these needs?

Is it possible to have multiple cassandras instances accessing the same index 
over a shared file cluster, or is it better to do partitionning?

Is  hardware for such a setup expensive?

thank you.