Re: is it my cassandra cluster ok?

2010-08-25 Thread Benjamin Black
No, it means manually assign tokens to evenly distribute ring range to the existing nodes. On Wed, Aug 25, 2010 at 7:29 PM, john xie wrote: > load balancing? is it means add more nodes? > > > 2010/8/26 Ryan King >> >> Looks like you need to do some load balancing. >> >> -ryan >> >> On Wed, Aug

Re: too many open files 0.7.0 beta1

2010-08-25 Thread Aaron Morton
That looks like it. I've pushed the limits up to 65k and turned down the testing for now. Otherwise machines were dropping like flies. Thanks. AaronOn 26 Aug, 2010,at 04:16 PM, Dan Washusen wrote:Maybe you're seeing this: https://issues.apache.org/jira/browse/CASSANDRA-1416On Thu, Aug 26, 2010 at

Ordered Partitioner load balance problem

2010-08-25 Thread Edward Evans
I am currently using Cassandra 0.6.2 on four virtual nodes in two different data centers (A, B). My initial testing used the Random Partitioner and everything behaved as expected. I moved to the Ordered Partitioner using SHA256 hashes as the keys and subsequently these are the tokens (If the storie

unsubsribe

2010-08-25 Thread Ming Fang

Re: too many open files 0.7.0 beta1

2010-08-25 Thread Dan Washusen
Maybe you're seeing this: https://issues.apache.org/jira/browse/CASSANDRA-1416 On Thu, Aug 26, 2010 at 2:05 PM, Aaron Morton wrote: > Under 0.7.0 beta1 am seeing cassandra run out of files handles... > > Caused by: java.io.FileNotFoundException: /local1/junkbox/cassandra/data/ > junkbox.wetafx.co

too many open files 0.7.0 beta1

2010-08-25 Thread Aaron Morton
Under 0.7.0 beta1 am seeing cassandra run out of files handles...Caused by: java.io.FileNotFoundException: /local1/junkbox/cassandra/data/junkbox.wetafx.co.nz/ObjectIndex-e-31-Index.db (Too many open files)        at java.ioRandomAccessFile.open(Native Method)        at java.io.RandomAccessFile.(Ra

Re: 0.7.0.bet1 errors during start up

2010-08-25 Thread Jonathan Ellis
Yes, please open a ticket for the assertion error. (Once JIRA is back up...) On Wed, Aug 25, 2010 at 10:46 PM, Aaron Morton wrote: > yes, starting the others made things a bit happier. > Any thoughts on the assertion error that cause the startup to fail ? I've > seen it a couple of times. > Seem

Re: 0.7.0.bet1 errors during start up

2010-08-25 Thread Aaron Morton
yes, starting the others made things a bit happier. Any thoughts on the assertion error that cause the startup to fail ? I've seen it a couple of times. Seems to be from this line in CommitLogHeader.java157:             assert clHeader.cfDirtiedAt.size() <= clHeader.cfCount;ThanksAaronOn 26 Aug, 20

Re: 0.7.0.bet1 errors during start up

2010-08-25 Thread Jonathan Ellis
the one node you restarted thinks it's the only node in the cluster. starting the others will fix that. On Wed, Aug 25, 2010 at 10:10 PM, Aaron Morton wrote: > 0.7.0-bet1 4 node clustered, i'd managed to get it into some sort of awful > state (i think by accidently creating to many clients, it wa

0.7.0.bet1 errors during start up

2010-08-25 Thread Aaron Morton
0.7.0-bet1 4 node clustered, i'd managed to get it into some sort of awful state (i think by accidently creating to many clients, it was also complaining about running out of file handles). Anyway I killed it all and restarted just one node, thought I would let it settle down then start the others.

Get an error with "Uncaught exception in thread Thread[HINTED-HANDOFF-POOL:1,5,main]", please help me to solve it, thanks~

2010-08-25 Thread Huang Stanley
Hi all: I use Cassandra 0.7 beta, and I create two Cassandra nodes in a cluster. When second node try to join the cluster, I got the error message as below. Can someone advise me how to solve this problem. Thanks for your kindly assistance. regards, Stanley Huang ==

Re: is it my cassandra cluster ok?

2010-08-25 Thread john xie
load balancing? is it means add more nodes? 2010/8/26 Ryan King > Looks like you need to do some load balancing. > > -ryan > > On Wed, Aug 25, 2010 at 12:33 AM, john xie wrote: > > /opt/apache-cassandra-0.6.4/bin/nodetool --host 192.168.123.100 ring > > Address Status Load

Re: Follow-up post on cassandra configuration with some experiments on GC tuning

2010-08-25 Thread Edward Capriolo
On Tue, Aug 24, 2010 at 11:29 AM, Mikio Braun wrote: > -BEGIN PGP SIGNED MESSAGE- > Hash: SHA1 > > Dear all, > > thanks again for all the comments I got on my last post. I've played a > bit with different GC settings and got my Cassandra instance to run > very nicely with 8GB of heap. > >

Re: SEO friendly pagination

2010-08-25 Thread Ryan King
On Wed, Aug 25, 2010 at 11:20 AM, Petr Odut wrote: > Hi, > I've read about pagination in cassandra. My current implementation is > get_range_slices with startKey = lastKey + 1, but I need to get the > specified page directly. Is it any chance to do this? > > If you look at twitter, it has direct p

Re: get_slice slow

2010-08-25 Thread B. Todd Burruss
i thought about doing that be is obviously a bit more complicated. thx for confirming the problem. On 08/25/2010 09:58 AM, Jonathan Ellis wrote: in many cases, especially "give me the first column", slicing is faster -- lots of tombstones around is one case where it might not be. if you can

SEO friendly pagination

2010-08-25 Thread Petr Odut
Hi, I've read about pagination in cassandra. My current implementation is get_range_slices with startKey = lastKey + 1, but I need to get the specified page directly. Is it any chance to do this? If you look at twitter, it has direct pagination too: http://twitter.com/PetrOdut?page=1 http://twitte

Re: Get Range Slices is broken on 0.6.4 ??

2010-08-25 Thread Jeremy Hanna
if you look at the email that thread that I linked, there is a link to the artifacts under consideration in the vote. However if you got your 0.6 branch code from http://svn.apache.org/repos/asf/cassandra/branches/cassandra-0.6/ and you still see the error, then there's probably something else

Re: is it my cassandra cluster ok?

2010-08-25 Thread Ryan King
Looks like you need to do some load balancing. -ryan On Wed, Aug 25, 2010 at 12:33 AM, john xie wrote: > /opt/apache-cassandra-0.6.4/bin/nodetool --host 192.168.123.100 ring > Address       Status     Load          Range >      Ring > > 162027259805094200094770502377853667196 > 192.168.123.101Up

Feature Request: multiget_slice(s)

2010-08-25 Thread Jeremy Davis
Currently multiget_slice allows you to specify multiple Keys but only one slice. In my specific scenario it becomes difficult/impossible to iterate across the data set unless I can also specify the slice per key. This is because if one of the Key's doesn't have the same amount of data, then the con

Re: Get Range Slices is broken on 0.6.4 ??

2010-08-25 Thread Moleza Moleza
Thanks JH, Where is the code for 0.6.5? Is it accessible publicly? In subversion, is it on /cassandra-0.6 branch? (cassandra/branches/cassandra-0.6) Or, is there another place where you guys keep the 0.6.5 codebase (in which case I will have to wait for it). I tested with the 0.6-branch but the bug

Re: get_slice slow

2010-08-25 Thread Jonathan Ellis
in many cases, especially "give me the first column", slicing is faster -- lots of tombstones around is one case where it might not be. if you can reduce the tombstone volume, say by switching to a new row every 5 minutes, that would help a lot. On Wed, Aug 25, 2010 at 11:43 AM, B. Todd Burruss

Re: Cassandra and Lucene

2010-08-25 Thread Benjamin Black
Please put your storage-conf.xml and cassandra.in.sh files on pastie/dpaste/gist and send the link. (moving it back to the user list again) On Sun, Jul 25, 2010 at 11:51 PM, Michelan Arendse wrote: > I have 2 seeds in my cluster, with a replication of 2. I am using cassandra > 0.6.2. > > It keep

Re: get_slice slow

2010-08-25 Thread B. Todd Burruss
i did check sstables, and there are only three. i haven't done any major compacts. do u think it is taking so long because it must sift thru the deleted columns before compaction? so accessing a column by name instead of slice predicate is faster? On 08/24/2010 11:23 PM, Benjamin Black wro

Re: Get Range Slices is broken on 0.6.4 ??

2010-08-25 Thread Jeremy Hanna
0.6.5's release is being voted on now - http://www.mail-archive.com/d...@cassandra.apache.org/msg00788.html - so if all goes well, it will be out in a couple of days. On Aug 25, 2010, at 9:04 AM, Moleza Moleza wrote: > That was the Fastest Response ever (about 10 seconds). > When is 0.6.5 being

Re: Does the scan speed with CL.ALL is faster than CL.QUORUM and CL.ONE?

2010-08-25 Thread Benjamin Black
Did you run the tests in this order without changing anything but CL? You may be seeing the effects of OS page caching. Run then in the reverse order and see if the difference persists. On Tue, Aug 24, 2010 at 11:52 PM, ring_ayumi_king wrote: > Hi all, > > I ran my benchmark(OPP via get_range_sl

Re: Get Range Slices is broken on 0.6.4 ??

2010-08-25 Thread Moleza Moleza
That was the Fastest Response ever (about 10 seconds). When is 0.6.5 being released? Any ETA? Where is 0.6.5? isn't it in the subversion /cassandra-0.6 branch? I did checkout that code yesterday and built it successfully. Still it did not work, we had to revert back to 0.6.3. Those bugs you mention

Job opening cassandra Barcelona, Spain

2010-08-25 Thread Dimitry Lvovsky
Hi All, Please forgive the job offer spam. We're looking to add a developer with experience using Cassandra, to join our team in Barcelona. An ideal candidate will have a strong CS background (academic or otherwise) with high level Java skills and experience programing in Scala. Knowing y

Re: Get Range Slices is broken on 0.6.4 ??

2010-08-25 Thread Jonathan Ellis
yes. known bug, fixed in 0.6.5 (CASSANDRA-1145, CASSANDRA-1042) On Wed, Aug 25, 2010 at 8:53 AM, Moleza Moleza wrote: > HI, > We just recently tried to use 0.6.4 in our production environment and > had some serious problem. > The getRangeSlices functionality is broken. > We have a cluster of 5 m

Get Range Slices is broken on 0.6.4 ??

2010-08-25 Thread Moleza Moleza
HI, We just recently tried to use 0.6.4 in our production environment and had some serious problem. The getRangeSlices functionality is broken. We have a cluster of 5 machines. We use getRangeSlices to iterate over all of the keys in a cf (2062 keys total). We are using OrderPreservingPartitioner.

Re: java.util.NoSuchElementException

2010-08-25 Thread Jonathan Ellis
yes, you should create a ticket On Wed, Aug 25, 2010 at 2:52 AM, Aaron Morton wrote: > I'm running the v0.7-beta1 in a 4 nodes cluster and just doing some simple > testing. One of the nodes had been down (machine off, unclean shutdown) for > an hour or so not sure how many writes were going on, w

Re: get_slice slow

2010-08-25 Thread Jonathan Ellis
Cassandra doesn't deserialize entire rows to read a single column. On Tue, Aug 24, 2010 at 11:25 PM, Artie Copeland wrote: > Have you tried using a super column, it seems that having a row with over > 100K columns and growing would be alot for cassandra to deserialize?  what > is iostat and jmete

Re: Job Opportunity in Europe (Nosql, hadoop, crawling)

2010-08-25 Thread Thibaut Britz
Hello Raddi, If you are interested, please send us your resume to the email address mentioned at the blog post. If we can provide you the environment to work on this only depends only on your qualifications and skills. Thanks, Thibaut On Fri, Aug 20, 2010 at 4:55 AM, sharanabasava raddi wrote:

java.util.NoSuchElementException

2010-08-25 Thread Aaron Morton
I'm running the v0.7-beta1 in a 4 nodes cluster and just doing some simple testing. One of the nodes had been down (machine off, unclean shutdown) for an hour or so not sure how many writes were going on, when I bought it back up this message appears in the other 3 nodes...INFO [GOSSIP_STAGE:1] 201

is it my cassandra cluster ok?

2010-08-25 Thread john xie
/opt/apache-cassandra-0.6.4/bin/nodetool --host 192.168.123.100 ring Address Status Load Range Ring 162027259805094200094770502377853667196 192.168.123.101Up 183.43 GB 26404162423947656621914545677405489813 |<--| 192.168.123.5 Up 196.18 GB 97646479029625