Re: what causes MESSAGE-DESERIALIZER-POOL to spike

2010-07-26 Thread Dathan Pattishall
On Mon, Jul 26, 2010 at 8:30 PM, Jonathan Ellis wrote: > MDP is backing up because RRS is full at 4096. This means you're not > able to process reads as quickly as the requests are coming in. Make > whatever is doing those reads be less aggressive. > > So, for cassandra to function correctly I

Re: what causes MESSAGE-DESERIALIZER-POOL to spike

2010-07-26 Thread Dathan Pattishall
@Garo htop. On Mon, Jul 26, 2010 at 11:28 PM, Juho Mäkinen wrote: > Off topic, but what was this tool which prints per cpu utilization? > > - Garo > > On Mon, Jul 26, 2010 at 10:22 PM, Dathan Pattishall > wrote: > > But the 16 cores are hardly utilized. Which indicates to me there is some > > b

singular or plural column family names

2010-07-26 Thread uncle mantis
I know this is an age old question. Kinda like the chicken and the egg. I know that everyone's solution is different but I wanted to get an open opinion. Do you all use singular or plural column family names in your keyspaces? I have been using plural for years and I have worked at jobs that use

Re: what causes MESSAGE-DESERIALIZER-POOL to spike

2010-07-26 Thread Juho Mäkinen
Off topic, but what was this tool which prints per cpu utilization? - Garo On Mon, Jul 26, 2010 at 10:22 PM, Dathan Pattishall wrote: > But the 16 cores are hardly utilized. Which indicates to me there is some > bad thread thrashing, but why? > >   1  [| 

Pig Integration

2010-07-26 Thread Mubarak Seyed
Does anyone try to get data from Super column family using grunt shell? The example pig script explains how to analyze data for standard column family. -- Thanks, Mubarak Seyed.

Re: what causes MESSAGE-DESERIALIZER-POOL to spike

2010-07-26 Thread Jonathan Ellis
MDP is backing up because RRS is full at 4096. This means you're not able to process reads as quickly as the requests are coming in. Make whatever is doing those reads be less aggressive. As to why the reads are slow in the first place, usually this means you are disk i/o bound. Posting your cf

Re: Design questions/Schema help

2010-07-26 Thread Mark
On 7/26/10 7:06 PM, Dave Viner wrote: AFAIK, atomic increments are not available. There recently has been quite a bit of discussion about them. So, you might search the archives. Dave Viner On Mon, Jul 26, 2010 at 7:02 PM, Mark > wrote: On 7/26/10 6:0

Re: Key Caching

2010-07-26 Thread Dathan Pattishall
@Todd, I noticed some new ops in your cassandra.in.sh. Is there any documentation on what these ops are, and what they do? For instance AggressiveOpts, etc. On Mon, Jul 26, 2010 at 4:33 PM, B. Todd Burruss wrote: > i run cassandra with a 30gb heap on machines with 48gb total with good > resul

Re: Design questions/Schema help

2010-07-26 Thread Dave Viner
AFAIK, atomic increments are not available. There recently has been quite a bit of discussion about them. So, you might search the archives. Dave Viner On Mon, Jul 26, 2010 at 7:02 PM, Mark wrote: > On 7/26/10 6:06 PM, Dave Viner wrote: > > I'd love to hear other's opinions here... but here

Re: Design questions/Schema help

2010-07-26 Thread Mark
On 7/26/10 6:06 PM, Dave Viner wrote: I'd love to hear other's opinions here... but here are my 2 cents. With Cassandra, you need to think of the queries - which you've pretty much done. For the most popular queries, you could do something like: And then access it as: key-space.QueriesCount

Re: Cassandra Horizontal Scalability

2010-07-26 Thread Benjamin Black
Are you persisting connections or opening a new one for each operation? Do you understand what deletes actually do internally? On Mon, Jul 26, 2010 at 12:45 PM, SSam wrote: > Thanks for reply > > I am doing very simple test: > I am stressing my webapplication with grinder  testing tool. > Number

Re: Design questions/Schema help

2010-07-26 Thread Dave Viner
I'd love to hear other's opinions here... but here are my 2 cents. With Cassandra, you need to think of the queries - which you've pretty much done. For the most popular queries, you could do something like: And then access it as: key-space.QueriesCounted['query-foo-bar'] = $count;

Re: Cassandra Graphical Modeling

2010-07-26 Thread Jonathan Shook
As long as you only want to edit YEd files and print them, it's great. Anything else to do with it is proprietary and expensive (for me, at least). On Mon, Jul 26, 2010 at 7:12 PM, Ashwin Jayaprakash wrote: > > YEd ( http://www.yworks.com/en/products_yed_about.html > http://www.yworks.com/en/prod

Re: Cassandra Graphical Modeling

2010-07-26 Thread Ashwin Jayaprakash
YEd ( http://www.yworks.com/en/products_yed_about.html http://www.yworks.com/en/products_yed_about.html ) is a pretty good tool. No setup required, free, very versatile and good for drawing graphs quickly. -- View this message in context: http://cassandra-user-incubator-apache-org.3065146.n2.nab

Design questions/Schema help

2010-07-26 Thread Mark
We are thinking about using Cassandra to store our search logs. Can someone point me in the right direction/lend some guidance on design? I am new to Cassandra and I am having trouble wrapping my head around some of these new concepts. My brain keeps wanting to go back to a RDBMS design. We wi

Re: Key Caching

2010-07-26 Thread B. Todd Burruss
i run cassandra with a 30gb heap on machines with 48gb total with good results. i don't use more just because i want to leave some for the OS to cache disk pages, etc. i did have the problem a couple of times with GC doing a full stop on the JVM because it couldn't keep up. my understanding of t

Re: Looking for recommendation on DELL server types/spec for running Cassandra nodes

2010-07-26 Thread Jonathan Ellis
Not vendor-specific, but http://wiki.apache.org/cassandra/CassandraHardware may help. On Thu, Jul 22, 2010 at 4:53 AM, Simon Reavely wrote: > Hi, > I'm looking for a recommendation on DELL server types/spec for running > Cassandra nodes > What model DELL servers are folks using with success? > Wh

Re: Correct steps how to extend cluster size and RF

2010-07-26 Thread Jonathan Ellis
On Wed, Jul 21, 2010 at 1:30 PM, Juho Mäkinen wrote: > I'm just about to extend my current two node production cluster into > five node cluster and I'd like to be sure that my plan is correct. > > Currently cluster has two nodes with RF=2. The target is to add four > nodes, increase RF to 3 and dr

Re: UnavailableException on QUORUM write

2010-07-26 Thread Jonathan Ellis
Can you turn on debug logging and try this patch? On Wed, Jul 21, 2010 at 1:23 AM, Per Olesen wrote: >>> And when one of my non-seed nodes in my 3 node cluster is down, I do NOT >>> get the exception. >>> Anyway, guess I need to try and reproduce it in small scale. >> >>Does it return w/ UE imme

Re: what causes MESSAGE-DESERIALIZER-POOL to spike

2010-07-26 Thread Dathan Pattishall
Okay so why would the pending be so large? I looked at iostat and the diskload is not bad at all. The service times /etc are all good. If it's not disk based I/O then is it thread thrashing? If so which thread pool should I monitor more closely. MESSAGE-DESERIALIZER-POOL:1 which would indicate t

Re: Failing to create a 2 Node cluster on a Windows machine

2010-07-26 Thread Alaa Zubaidi
Thanks.. Kochheiser,Todd W - TOK-DITT-1 wrote: If you are using XP/SP2, I think MS broke support for using anything other than 127.0.0.1 for loopback. Here is the KB article: http://support.microsoft.com/kb/884020 I’ve only tested Cassandra clusters using multiple Windows machines. The Cas

Re: Failing to create a 2 Node cluster on a Windows machine

2010-07-26 Thread Alaa Zubaidi
Thanks for the help, I tried to create an alias for 127.0.0.2 but it did not work, so I created aliases for the DHCP IP and I tried it, the error disappeared, but I am not sure that it works the way I want it, I am checking to make sure that I have 2 nodes running not only one.. Thanks Jona

Re: Help! Cassandra Data Loader threads are getting stuck

2010-07-26 Thread Malcolm Smith
Also make sure you have consistency level set to at least ONE Sent from my iPhone On Jul 26, 2010, at 5:31 PM, Aaron Morton wrote: > Try running it without threading to see if it's a cassandra problem or an > issue with your threading. > > Perhaps split the file and run many single threaded

Re: Cassandra behaviour

2010-07-26 Thread tsuraan
> It's reading through keys in the index and adding offset information > about roughly every 128th entry in RAM, in order to speed up reads. > Performing a binary search in an sstable from scratch would be > expensive. Because of the high cost of disk seeks, most storage > systems use btrees with a

Re: Help! Cassandra Data Loader threads are getting stuck

2010-07-26 Thread Aaron Morton
Try running it without threading to see if it's a cassandra problem or an issue with your threading. Perhaps split the file and run many single threaded processes to load the data. AaronOn 27 Jul, 2010,at 07:14 AM, Rana Aich wrote:Hi All,I have to load huge quantity of data into Cassandra (~10Bill

Re: Can we filter a key or a column name using regular expression?

2010-07-26 Thread Aaron Morton
Say you have the colums "foo", "foo.bar", "foo.baz", "monkeys"And you want to read all the columns that start with 'foo.'You could set the start column for the SliceRange to "foo." and the end column to "" and make repeated get_slice calls until you see a column that does not start with "foo."Or yo

Re: Key Caching

2010-07-26 Thread Peter Schuller
> If the cache is stored in the heap, how big can the heap be made > realistically on a 24gb ram machine? I am a java newbie but I have read > concerns with going over 8gb for the heap as the GC can be too painful/take > too long. I already have seen timeout issues (node is dead errors) under > loa

Re: Key Caching

2010-07-26 Thread Wayne
If the cache is stored in the heap, how big can the heap be made realistically on a 24gb ram machine? I am a java newbie but I have read concerns with going over 8gb for the heap as the GC can be too painful/take too long. I already have seen timeout issues (node is dead errors) under load during G

Re: what causes MESSAGE-DESERIALIZER-POOL to spike

2010-07-26 Thread Peter Schuller
> Hpw does one set/get the read-concurrency? Also which read-concurrency, for > which pool? According to the threads tab in jconsole read-concurrency could > be among different pools. There are row-read-stag threads, lots of write > threads, row mutation stage etc. Short version: Ignoremy concurre

Re: Cassandra Graphical Modeling

2010-07-26 Thread Chaker Nakhli
I'm glad you appreciated the article Michael. Thank you for the PP hint. -- Chaker uncle mantis wrote: >Thank you sir! Fantastic article BTW! > >It would be nice to have something draw the models like this automatically >based on a schema. > >I am currently using PowerPoint's Table hierarchy an

Re: what causes MESSAGE-DESERIALIZER-POOL to spike

2010-07-26 Thread Dathan Pattishall
Hpw does one set/get the read-concurrency? Also which read-concurrency, for which pool? According to the threads tab in jconsole read-concurrency could be among different pools. There are row-read-stag threads, lots of write threads, row mutation stage etc. On Mon, Jul 26, 2010 at 12:56 PM, P

Re: Cassandra Graphical Modeling

2010-07-26 Thread uncle mantis
Johnathan. I will give it a shot. At first glance it really did not look like much at all so I passed it up. I will give it a try now. Thanks! I love the Cassandra community. I hang out mostly in IRC and this is my first real post on the mailing list and the response time is surprisingly rapid! Th

Re: what causes MESSAGE-DESERIALIZER-POOL to spike

2010-07-26 Thread Peter Schuller
> MESSAGE-DESERIALIZER-POOL 1    682008    1818682 On the (unverified by me) assumption that disk reads en up in this stage as Anthony indicates, and looking at the above line and the iostat output, makes me want to suggest that maybe the read concurrency is set to 1 and you need to in

Re: Cassandra basics

2010-07-26 Thread sonia gehlot
Thanks themanatuf, This is awesome link... really they explained data model very nicely especially for folks from RDMS background. Regards, Sonia On Sat, Jul 24, 2010 at 4:17 AM, Uncle Mantis wrote: > I want to point out my up hill battle tips. > > Make some friends in the Freenode #Cassandra

Re: Cassandra Graphical Modeling

2010-07-26 Thread Jonathan Shook
I'm not trying to push Inkscape, but this may be helpful. I use it and graphviz, depending on the circumstances. If you create a basic template in Inkscape, you can use it to compose structural views of things by rendering the XML/SVG from a model. It's not a feature, per se, but just a way to tak

Re: Key Caching

2010-07-26 Thread Peter Schuller
(the following are best-effort answers resulting from taking the opportunity to look at the source; i may be missing something) > I am trying to use the key cache, and I had some basic questions as to how > it works. There seem to be settings based on % and total count, but not on > total cache si

Re: Cassandra Horizontal Scalability

2010-07-26 Thread SSam
Thanks for reply I am doing very simple test: I am stressing my webapplication with grinder testing tool. Number of testing clients is 250, each client just creates/deletes a User Object( max 500 bytes data). So data is purely WRITE ( I hope Cassandra is Write Optimized) . I am running -0.6

Re: Cassandra Graphical Modeling

2010-07-26 Thread uncle mantis
Thank you sir! Fantastic article BTW! It would be nice to have something draw the models like this automatically based on a schema. I am currently using PowerPoint's Table hierarchy and using tabs for each display row and just a line feed for a new column. It works pretty good! Again, thank you

Re: what causes MESSAGE-DESERIALIZER-POOL to spike

2010-07-26 Thread Anthony Molinaro
It's usually I/O which causes backup in MESSAGE-DESERIALIZER-POOL. You should check iostat and see what it looks like. It may be that you need more nodes in order to deal with the read/write rate. You can also use JMX to get latency values on reads and writes and see if the backup has a corresp

Re: Cassandra Horizontal Scalability

2010-07-26 Thread Bill de hÓra
Initially, try running stress.py against your servers. That will help eliminate pelops as the issue. Also, some more detail on your setup wouldn't hurt, 1300 anything a second is low. Is the TPS reads or writes, or a mix? What's the column family structure? Is the data large (eg are you writ

what causes MESSAGE-DESERIALIZER-POOL to spike

2010-07-26 Thread Dathan Pattishall
I have 4 nodes on enterprise type hardware (Lots of Ram 12GB, 16 i7 cores, RAID Disks). ~# /opt/cassandra/bin/nodetool --host=localhost --port=8181 tpstats Pool NameActive Pending Completed STREAM-STAGE 0 0 0 RESPONSE-STAGE

Re: Cassandra Graphical Modeling

2010-07-26 Thread Chaker Nakhli
Hi all, The example given by uncle Mantis was made using Adobe Illustrator. Cheers, --Chaker uncle mantis wrote: >I am giving PowerPoint's SmartArt's Table Hierarchy a try. Not too bad but I >would like to get something more along the lines of this example >http://www.javageneration.com/?p=70

Help! Cassandra Data Loader threads are getting stuck

2010-07-26 Thread Rana Aich
Hi All, I have to load huge quantity of data into Cassandra (~10Billion rows). I'm trying to load the Data from files using multithreading. The idea is each thread will read the TAB delimited file and process chunk of records. For example Thread1 reads line 1-1000 lines Thread 2 reads line 1001

Re: Cassandra behaviour

2010-07-26 Thread Peter Schuller
> be the most portable thing to do.  I had been thinking that the bloom > filters were created on startup, but further reading of the docs > indicates that they are in the SSTable Index.  What is cassandra > doing, then, when it's printing out that it's sampling indices while > it starts? It's rea

Re: Cassandra Graphical Modeling

2010-07-26 Thread uncle mantis
I am giving PowerPoint's SmartArt's Table Hierarchy a try. Not too bad but I would like to get something more along the lines of this example http://www.javageneration.com/?p=70 Regards, Michael On Mon, Jul 26, 2010 at 1:24 PM, Jesse McConnell wrote: > +1 for graphviz (if you like text files)

Re: Failing to create a 2 Node cluster on a Windows machine

2010-07-26 Thread Per Olesen
On Jul 26, 2010, at 8:39 PM, Peter Schuller wrote: >> Completely off topic from the list, but Jonathan do you (or others) by any >> means know how to create an alias for 127.0.0.2 on a mac? Cause I used >> 127.0.0.x on linux without a problem, but on my Mac, it seems to need some >> config to

Re: Failing to create a 2 Node cluster on a Windows machine

2010-07-26 Thread Peter Schuller
> Completely off topic from the list, but Jonathan do you (or others) by any > means know how to create an alias for 127.0.0.2 on a mac? Cause I used > 127.0.0.x on linux without a problem, but on my Mac, it seems to need some > config to be able to bind to 127.0.0.2 etc. sudo ifconfig lo0 add

Re: Cassandra behaviour

2010-07-26 Thread tsuraan
> My guess: > Your test is beating up your system. The system may need more memory > or disk throughput or CPU in order to keep up with that particular > test. Yeah, I am testing on a pretty wimpy machine; I just wanted to get some practice getting cassandra up and running, and I ran into this pro

Re: Cassandra behaviour

2010-07-26 Thread tsuraan
> Bloom filters are indeed linear in size with respect to the number of > items (assuming a constant target false positive rate). While I have > not looked at how Cassandra calculates the bloom filter sizes, I feel > pretty confident in saying that it won't dynamically replace bloom > filters with

Re: Failing to create a 2 Node cluster on a Windows machine

2010-07-26 Thread Per Olesen
On Jul 26, 2010, at 3:25 AM, Jonathan Ellis wrote: > I know on a mac you need to explicitly create an alias for 127.0.0.2 > before it can be used. Maybe something similar applies to Windows. Completely off topic from the list, but Jonathan do you (or others) by any means know how to create an

Re: Cassandra Graphical Modeling

2010-07-26 Thread Jesse McConnell
+1 for graphviz (if you like text files) jesse -- jesse mcconnell jesse.mcconn...@gmail.com On Mon, Jul 26, 2010 at 13:10, Jonathan Shook wrote: > +1 for Inkscape/SVG > > On Mon, Jul 26, 2010 at 1:07 PM, uncle mantis wrote: >> What do you all use for this? I am currently using MySQL Workbenc

Re: Cassandra Graphical Modeling

2010-07-26 Thread Jonathan Shook
+1 for Inkscape/SVG On Mon, Jul 26, 2010 at 1:07 PM, uncle mantis wrote: > What do you all use for this? I am currently using MySQL Workbench for my > SQL projects. > > PowerPoint? Visio? Gimp? Pencil and Paper? > > Thanks for the help! > > Regards, > > Michael >

Cassandra Graphical Modeling

2010-07-26 Thread uncle mantis
What do you all use for this? I am currently using MySQL Workbench for my SQL projects. PowerPoint? Visio? Gimp? Pencil and Paper? Thanks for the help! Regards, Michael

Re: SV: How to stop cassandra server, installed from debian/ubuntupackage

2010-07-26 Thread Dave Viner
Yes... if you're using debian cassandra you can do: /etc/init.d/cassandra stop On Mon, Jul 26, 2010 at 8:05 AM, Lee Parker wrote: > Which debian/ubuntu packages are you using? I am using the ones that are > maintained by Eric Evans and the init.d script stops the server correctly. > > Lee Par

Re: Cassandra Horizontal Scalability

2010-07-26 Thread SSam
Yes, I am sending requests to newly added node also. Actually I am testing my web application with Cassandra, 1300 is application TPS with Cassandra. So far we are using MYSQL, planning to move use Cassandra, but I am getting better TPS with MYSQL. ___

Visual Data Store Design Tool?

2010-07-26 Thread uncle mantis
Hey all. I am migrating from SQL to Cassandra. I use NaviCat and MySQL Workbench to design, develop and program databases for SQL. I am looking round trying to find the equivalent to MySQL Workbench but for Cassandra. I have not had much luck., Here is the link for MySQL Workbench. http://wb.mys

Re: Cassandra crashes after reboot

2010-07-26 Thread Peter Schuller
> I'm sorry for the lack of information > I'm using 0.6.3. > The move was moving the data dir and the commitlog dir > But i now removed them and let the system bootstrap from the ring. > i know i'm lacking in information here.. but i thought i needed to be > mentioned overhere this could happen. D

Re: Cassandra behaviour

2010-07-26 Thread Peter Schuller
> to play with.  Can anybody give me advice on how to make cassandra > keep running under a high insert load? I forgot to mention that if your insertion speed is simply legitimately faster than compaction, but you have left-over idle CPU on the system, then currently as far as I know you're out of

Re: Cassandra behaviour

2010-07-26 Thread Peter Schuller
[ 1 billion inserts, failed after 120m with out-of-mem ] > - is Cassandra's RAM use proportional to the number of values that > it's storing?  I know that it uses bloom filters for preventing > lookups of non-existent keys, but since bloom filters are designed to > give an accuracy/space tradeoff,

Re: Cassandra Horizontal Scalability

2010-07-26 Thread Benjamin Black
Are you sending all requests to a single server? Performance increases when you add nodes when you send traffic to all of them at once. Adding new ndoes does not cause the old nodes to speed up. Also, 1300 reqs/sec is extremely slow, so something else is likely wrong, as well. On Mon, Jul 26, 20

Key Caching

2010-07-26 Thread Wayne
I am trying to use the key cache, and I had some basic questions as to how it works. There seem to be settings based on % and total count, but not on total cache size like most caching mechanisms. Is the cache stored in the jvm heap or somewhere else? We have 24gb RAM nodes with a 8gb jvm heap size

Re: Cassandra behaviour

2010-07-26 Thread Jonathan Shook
My guess: Your test is beating up your system. The system may need more memory or disk throughput or CPU in order to keep up with that particular test. Check some of the posts on the list with "deferred processing" in the body to see why. Also, can you post the error log? On Mon, Jul 26, 2010 at

Re: Cassandra to store 1 billion small 64KB Blobs

2010-07-26 Thread Jonathan Shook
Some possibilities open up when using OPP, especially with aggregate keys. This is more of an option when RF==cluster size, but not necessarily a good reason to make RF=cluster size if you haven't already. For example, ':' and ';' make good boundary markers in aggregate keys, since they are alread

Re: Cassandra Horizontal Scalability

2010-07-26 Thread SSam
I have fully separated gigabit private subnet. I am using pelops cassandra java client with 150 client pool. From: Paul Prescod To: user@cassandra.apache.org Sent: Mon, July 26, 2010 12:21:06 PM Subject: Re: Cassandra Horizontal Scalability There are a l

Cassandra behaviour

2010-07-26 Thread tsuraan
I have a system where we're currently using Postgres for all our data storage needs, but on a large table the index checks for primary keys are really slowing us down on insert. Cassandra sounds like a good alternative (not saying postgres and cassandra are equivalent; just that I think they are b

Re: Cassandra Horizontal Scalability

2010-07-26 Thread Paul Prescod
There are a lot of variables that go into a proper benchmark. The bottleneck could be in many different places. How many client threads are you using? What kind of network? On Mon, Jul 26, 2010 at 8:29 AM, SSam wrote: > > From Cassandra Website: > >- *Elastic* > >Read and write throughp

Re: SV: How to stop cassandra server, installed from debian/ubuntupackage

2010-07-26 Thread Jonathan Shook
If only one instance of Cassandra is running on each node, then use something like pkill -f 'java.*cassandra' If more than one (not recommended for various reasons), then you should modify the scripts to put a unique token in the process name. Something like -Dprocname=... will work. Then you can

Re: Multiget capabilities

2010-07-26 Thread Jonathan Shook
CordiS, The general approach for this kind of change is to implement it yourself and submit a patch. In such a case, you may still have to be thoughtful and patient in order to get everyone on board. I wish you luck. On Mon, Jul 26, 2010 at 6:51 AM, CordiS wrote: > Thank you for nothing. > > 201

Cassandra Horizontal Scalability

2010-07-26 Thread SSam
>From Cassandra Website: * Elastic Read and write throughput both increase linearly as new machines are added, with no downtime or interruption to applications. I am testing TPS with Cassandra cluster. Initially I have tested with one node cluster , got 1300 TPS, added another

RE: CRUD test

2010-07-26 Thread Peter Minearo
One last question for me: If I wanted to find all the "Keys" within a SuperColumn Family would I do something like: client.get_slice(KEYSPACE, null, new ColumnParent("Super2"), slicePredicate, ConsistencyLevel.ALL); --OR -- client.get_slice(KEYSPACE, "", new ColumnParent("Super2"), slicePre

Re: Cassandra to store 1 billion small 64KB Blobs

2010-07-26 Thread Michael Widmann
Okay . That really made a knot into my brain - It twist's a little bit now I've to draw that on the whiteboard to understand it better ... but I've seen some very interesting cornerstones in your answer for our project. really thanks a lot mike 2010/7/26 aaron morton > I see, got carried away

Re: SV: How to stop cassandra server, installed from debian/ubuntupackage

2010-07-26 Thread Lee Parker
Which debian/ubuntu packages are you using? I am using the ones that are maintained by Eric Evans and the init.d script stops the server correctly. Lee Parker On Mon, Jul 26, 2010 at 9:22 AM, wrote: > This is how I have been doing it: > pkill cassandra > > then I do a netstat -anp | grep 8080 >

RE: Failing to create a 2 Node cluster on a Windows machine

2010-07-26 Thread Kochheiser,Todd W - TOK-DITT-1
If you are using XP/SP2, I think MS broke support for using anything other than 127.0.0.1 for loopback. Here is the KB article: http://support.microsoft.com/kb/884020 I've only tested Cassandra clusters using multiple Windows machines. The Cassandra documentation on setting up a cluster impli

Re: SV: How to stop cassandra server, installed from debian/ubuntupackage

2010-07-26 Thread michelan
This is how I have been doing it: pkill cassandra then I do a netstat -anp | grep 8080 I look for the java service I'd running and then kill that java I'd e.g. kill --Original Message-- From: Thorvaldsson Justus To: 'user@cassandra.apache.org' ReplyTo: user@cassandra.apache.org Subject: S

SV: How to stop cassandra server, installed from debian/ubuntu package

2010-07-26 Thread Thorvaldsson Justus
I use standard close, CTRL C, I don't run it as deamon Dunno but think it works fine =) -Ursprungligt meddelande- Från: o...@notrly.com [mailto:o...@notrly.com] Skickat: den 26 juli 2010 15:52 Till: user@cassandra.apache.org Ämne: How to stop cassandra server, installed from debian/ubuntu

Re: Can we filter a key or a column name using regular expression?

2010-07-26 Thread Miriam Allalouf
> You can query for things that start with a sub string but specifying a start > value and an empty end value or an end value that is the start value > concatenated with the max ascii character. Then just make multiple calls, > say getting 1000 cols/rows at a time. What do you mean by concatenate

How to stop cassandra server, installed from debian/ubuntu package

2010-07-26 Thread orly
Hi, this might be a dumb question, but I was wondering how do i stop the cassandra server.. I installed it using the debian package, so i start cassandra by running /etc/init.d/cassandra. I looked at the script and tried /etc/init.d/cassandra stop, but it looks like it just tries to start cassandra

Re: Cassandra to store 1 billion small 64KB Blobs

2010-07-26 Thread aaron morton
I see, got carried away thinking about it so here are some thoughts Your access patterns will determine the best storage design, so it's probably not the best solution. I would welcome thoughts from others. => Standard CF: Chunks * key is chunk hash * col named 'data' col value is chunk d

Re: Multiget capabilities

2010-07-26 Thread CordiS
Thank you for nothing. 2010/7/26 aaron morton > There is no way to request data from more than one ColumnFamily. The > general approach is to de-normalise the data so all the information you need > for a query can be returned from a single Column Family. > > I think this applies to both your que

Re: Multiget capabilities

2010-07-26 Thread aaron morton
There is no way to request data from more than one ColumnFamily. The general approach is to de-normalise the data so all the information you need for a query can be returned from a single Column Family. I think this applies to both your questions. Aaron On 26 Jul 2010, at 22:51, CordiS wrot

Cassandra Munin plugin problem

2010-07-26 Thread osishkin osishkin
Hi, I'm trying to use Munin to monitor cassandra. I've seen other people using munin here ,so I hope someone ran into this problem. The default plugins are working, so this is definitely a problem with the cassandra plugin. I keep getting errors such as : Exception in thread "main" java.lang.NoCl

Multiget capabilities

2010-07-26 Thread CordiS
Hello, I am interested in two features that i have not been able to found in API docs and mailing lists. First of all, is there any way to miss CF name in ColumnPath or ColumnParent (or better enumerate CFs to be retrieved). It would be commonly used to fetch all the data of a complex object iden

Re: about cassandra compression

2010-07-26 Thread john xie
thanks 2010/7/26 Ran Tavory > cassandra doesn't compress before storing, no. > It may be beneficial to compress, depending on the size of your data, > network latency, disk size and data compressability... You'll need to test. > I sometimes compress, depending on data size but it's done in the c

Re: about cassandra compression

2010-07-26 Thread Ran Tavory
cassandra doesn't compress before storing, no. It may be beneficial to compress, depending on the size of your data, network latency, disk size and data compressability... You'll need to test. I sometimes compress, depending on data size but it's done in the client, On Mon, Jul 26, 2010 at 1:31 PM

about cassandra compression

2010-07-26 Thread john xie
is cassandra compression before stored? when I stored the data, is compression beneficial to reduce the storage space?