Re: UnavailableException with 1 node down and RF=2?

2010-06-30 Thread Jonathan Ellis
Presumably the failure detector generated a false positive for a
second node temporarily

On Wed, Jun 30, 2010 at 10:55 PM, James Golick  wrote:
> Oops. I meant to say that I'm reading with CL.ONE.
>
> J.
>
> Sent from my iPhone.
>
> On 2010-07-01, at 1:39 AM, Benjamin Black  wrote:
>
>> .QUORUM or .ALL (they are the same with RF=2).
>>
>> On Wed, Jun 30, 2010 at 10:22 PM, James Golick  wrote:
>>> 4 nodes, RF=2, 1 node down.
>>> How can I get an UnavailableException in that scenario?
>>> - J.
>



-- 
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of Riptano, the source for professional Cassandra support
http://riptano.com


Re: UnavailableException with 1 node down and RF=2?

2010-06-30 Thread James Golick
Oops. I meant to say that I'm reading with CL.ONE. 

J.

Sent from my iPhone.

On 2010-07-01, at 1:39 AM, Benjamin Black  wrote:

> .QUORUM or .ALL (they are the same with RF=2).
> 
> On Wed, Jun 30, 2010 at 10:22 PM, James Golick  wrote:
>> 4 nodes, RF=2, 1 node down.
>> How can I get an UnavailableException in that scenario?
>> - J.


Re: UnavailableException with 1 node down and RF=2?

2010-06-30 Thread Benjamin Black
.QUORUM or .ALL (they are the same with RF=2).

On Wed, Jun 30, 2010 at 10:22 PM, James Golick  wrote:
> 4 nodes, RF=2, 1 node down.
> How can I get an UnavailableException in that scenario?
> - J.


UnavailableException with 1 node down and RF=2?

2010-06-30 Thread James Golick
4 nodes, RF=2, 1 node down.

How can I get an UnavailableException in that scenario?

- J.


Re: RE: live nodes list in ring

2010-06-30 Thread 王一锋

problem solved with a full cluster stop and then start~

Thanks Stephen.
But still wondering the reason


2010-07-01 







发件人: Stephen Hamer 
发送时间: 2010-07-01  08:39:13 
收件人: user@cassandra.apache.org 
抄送: 'huzhonghua'; 'GongJianTao(宫建涛)' 
主题: RE: live nodes list in ring 
 
When this happens to me I have to do a full cluster restart. Even doing a 
rolling restart across the cluster doesn’t seem to fix them, all of the nodes 
need to be stopped at the same time. After bringing everything back up the ring 
is correct.
 
Does anyone know how a cluster gets into this state?
 
Stephen
 
From: aaron morton [mailto:aa...@thelastpickle.com] 
Sent: Wednesday, June 30, 2010 1:42 PM
To: user@cassandra.apache.org
Cc: 'huzhonghua'; 'GongJianTao(宫建涛)'
Subject: Re: live nodes list in ring
 
At start up do you see log lines like this 
 
Gossiper.java (line 576) Node /192.168.34.30 is now part of the cluster
 
Are all the nodes listed? 
 
aaron
On 30 Jun 2010, at 22:50, 王一锋 wrote:



Hi,
 
In a cassandra cluster, when issueing ring command on every nodes, some can 
show all nodes in the cluster but some can only show some other nodes.
All nodes share the same seed list.
And even some of the nodes in the seed list have this problem.
Restarting the problematic nodes won't solve it.
Try closing firewalls with following commands
 
service iptables stop
 
Still won't work.
 
Anyone got a clue?
 
Thanks very much.
 
Yifeng
 


Re: Re: live nodes list in ring

2010-06-30 Thread 王一锋
To me, this happens all in a sudden. No nodes removed nor new nodes joined

2010-07-01 







发件人: Benjamin Black 
发送时间: 2010-07-01  09:55:59 
收件人: user 
抄送: 
主题: Re: live nodes list in ring 
 
Does this happen after you have changed the ring topology, especially
adding nodes?
2010/6/30 Stephen Hamer :
> When this happens to me I have to do a full cluster restart. Even doing a
> rolling restart across the cluster doesn't seem to fix them, all of the
> nodes need to be stopped at the same time. After bringing everything back up
> the ring is correct.
>
>
>
> Does anyone know how a cluster gets into this state?
>
>
>
> Stephen
>
>
>
> From: aaron morton [mailto:aa...@thelastpickle.com]
> Sent: Wednesday, June 30, 2010 1:42 PM
> To: user@cassandra.apache.org
> Cc: 'huzhonghua'; 'GongJianTao(宫建涛)'
> Subject: Re: live nodes list in ring
>
>
>
> At start up do you see log lines like this
>
>
>
> Gossiper.java (line 576) Node /192.168.34.30 is now part of the cluster
>
>
>
> Are all the nodes listed?
>
>
>
> aaron
>
> On 30 Jun 2010, at 22:50, 王一锋 wrote:
>
> Hi,
>
>
>
> In a cassandra cluster, when issueing ring command on every nodes, some can
> show all nodes in the cluster but some can only show some other nodes.
>
> All nodes share the same seed list.
>
> And even some of the nodes in the seed list have this problem.
>
> Restarting the problematic nodes won't solve it.
>
> Try closing firewalls with following commands
>
>
>
> service iptables stop
>
>
>
> Still won't work.
>
>
>
> Anyone got a clue?
>
>
>
> Thanks very much.
>
>
>
> Yifeng
>
>


Re: Re: live nodes list in ring

2010-06-30 Thread 王一锋
Yes, but not all nodes.
Just the part that the restarting node can show on its ring.

2010-07-01 







发件人: aaron morton 
发送时间: 2010-07-01  04:42:56 
收件人: user 
抄送: 'huzhonghua'; 'GongJianTao(宫建涛 )' 
主题: Re: live nodes list in ring 
 
At start up do you see log lines like this 


Gossiper.java (line 576) Node /192.168.34.30 is now part of the cluster


Are all the nodes listed? 


aaron
On 30 Jun 2010, at 22:50, 王一锋 wrote:


Hi,

In a cassandra cluster, when issueing ring command on every nodes, some can 
show all nodes in the cluster but some can only show some other nodes.
All nodes share the same seed list.
And even some of the nodes in the seed list have this problem.
Restarting the problematic nodes won't solve it.
Try closing firewalls with following commands

service iptables stop

Still won't work.

Anyone got a clue?

Thanks very much.

Yifeng


Re: Implementing Counter on Cassandra

2010-06-30 Thread Benjamin Black
ZK is way overkill for counters.  memcache and redis are much better at the job.

On Tue, Jun 29, 2010 at 12:32 PM, Jonathan Shook  wrote:
> Until then, a pragmatic solution, however undesirable, would be to
> only have a single logical thread/task/actor that is allowed to
> read,modify,update. If this doesn't work for your application, then a
> (distributed) lock manager may be used until such time that you can
> take it out. Some are using ZooKeeper for this.
>
>
> On Tue, Jun 29, 2010 at 11:45 AM, Ryan King  wrote:
>> On Tue, Jun 29, 2010 at 9:42 AM, Utku Can Topçu  wrote:
>>> Hey Guys,
>>>
>>> Currently in a project I'm involved in, I need to have some columns holding
>>> incremented data.
>>> The easy approach for implementing a counter with increments is right now as
>>> I figured out is "read -> increment -> insert" however this approach is not
>>> an atomic operation and can easily be corrupted in time.
>>>
>>> Do you have any best practices in implementing an atomic counter on
>>> Cassandra?
>>
>> https://issues.apache.org/jira/browse/CASSANDRA-1072
>>
>


Re: live nodes list in ring

2010-06-30 Thread Benjamin Black
Does this happen after you have changed the ring topology, especially
adding nodes?

2010/6/30 Stephen Hamer :
> When this happens to me I have to do a full cluster restart. Even doing a
> rolling restart across the cluster doesn't seem to fix them, all of the
> nodes need to be stopped at the same time. After bringing everything back up
> the ring is correct.
>
>
>
> Does anyone know how a cluster gets into this state?
>
>
>
> Stephen
>
>
>
> From: aaron morton [mailto:aa...@thelastpickle.com]
> Sent: Wednesday, June 30, 2010 1:42 PM
> To: user@cassandra.apache.org
> Cc: 'huzhonghua'; 'GongJianTao(宫建涛)'
> Subject: Re: live nodes list in ring
>
>
>
> At start up do you see log lines like this
>
>
>
> Gossiper.java (line 576) Node /192.168.34.30 is now part of the cluster
>
>
>
> Are all the nodes listed?
>
>
>
> aaron
>
> On 30 Jun 2010, at 22:50, 王一锋 wrote:
>
> Hi,
>
>
>
> In a cassandra cluster, when issueing ring command on every nodes, some can
> show all nodes in the cluster but some can only show some other nodes.
>
> All nodes share the same seed list.
>
> And even some of the nodes in the seed list have this problem.
>
> Restarting the problematic nodes won't solve it.
>
> Try closing firewalls with following commands
>
>
>
> service iptables stop
>
>
>
> Still won't work.
>
>
>
> Anyone got a clue?
>
>
>
> Thanks very much.
>
>
>
> Yifeng
>
>


RE: live nodes list in ring

2010-06-30 Thread Stephen Hamer
When this happens to me I have to do a full cluster restart. Even doing a 
rolling restart across the cluster doesn’t seem to fix them, all of the nodes 
need to be stopped at the same time. After bringing everything back up the ring 
is correct.

Does anyone know how a cluster gets into this state?

Stephen

From: aaron morton [mailto:aa...@thelastpickle.com]
Sent: Wednesday, June 30, 2010 1:42 PM
To: user@cassandra.apache.org
Cc: 'huzhonghua'; 'GongJianTao(宫建涛)'
Subject: Re: live nodes list in ring

At start up do you see log lines like this

Gossiper.java (line 576) Node /192.168.34.30 is now part of the cluster

Are all the nodes listed?

aaron
On 30 Jun 2010, at 22:50, 王一锋 wrote:


Hi,

In a cassandra cluster, when issueing ring command on every nodes, some can 
show all nodes in the cluster but some can only show some other nodes.
All nodes share the same seed list.
And even some of the nodes in the seed list have this problem.
Restarting the problematic nodes won't solve it.
Try closing firewalls with following commands

service iptables stop

Still won't work.

Anyone got a clue?

Thanks very much.

Yifeng



Re: Cluster-wide pause

2010-06-30 Thread Jonathan Ellis
use a larger heap or throttle your writes

On Wed, Jun 30, 2010 at 12:12 PM, Steve Lihn  wrote:
> Jon,
> I am experimenting writing 8 million rows into Cassandra and also
> experienced some random timeouts, even with 10-second timeout parameter.
> How do I avoid such timeout at all cost? (I.e. At this time, my priority is
> to finish the end-to-end test. Don't want the program to fail at all.)
> This is a one-node server with ms1G and mx2GB. The rest in cassandra.in.sh
> are default I think.
>
> Steve
>
> On Wed, Jun 30, 2010 at 1:58 AM, Jonathan Ellis  wrote:
>>
>> You could be seeing GC pauses. Did you increase the heap size you gave
>> Cassandra, when you increased your VM size?
>>
>> On Tue, Jun 29, 2010 at 11:57 AM, Oren Benjamin 
>> wrote:
>> > Hi all - first timer here.
>> >
>> > I'm experimenting with Cassandra on Rackspace Cloud.  Started with 4GB
>> > nodes and saw read latency spikes while streaming was taking place, so I
>> > increased to 8GB to see if limited memory was the issue.  Now I'm seeing
>> > very strange behavior during any period that writes are taking place.  The
>> > entire (6 node) cluster seems to pause for periods of as much as 5-8 sec.
>> >  By that I mean all the stats (cpu, disk, network IO monitored via dstat)
>> > drop to zero or near zero on all nodes simultaneously.  Does anyone have
>> > experience with Cassandra on Rackspace or any idea what's going on here?
>> >
>> > The pauses are short enough that it's difficult to introspect the
>> > application and determine what it's doing during the pause, but long enough
>> > to cause unacceptable latency for any service built on top of it.
>> >
>> > Any ideas or debugging methods would be greatly appreciated,
>> >
>> >  -- Oren
>>
>>
>>
>> --
>> Jonathan Ellis
>> Project Chair, Apache Cassandra
>> co-founder of Riptano, the source for professional Cassandra support
>> http://riptano.com
>
>



-- 
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of Riptano, the source for professional Cassandra support
http://riptano.com


Cassandra Pig Integration

2010-06-30 Thread Mubarak Seyed
I am trying to query data using Pig 0.7.0, when i run
contrib/pig/bin/pig_cassandra -x local filename.pig, i am getting an error

ERROR 2117: Unexpected error when launching map reduce job.

org.apache.pig.impl.logicalLayer.FrontendException: ERROR 1066: Unable to
open iterator for alias topnames
at org.apache.pig.PigServer.openIterator(PigServer.java:521)
at
org.apache.pig.tools.grunt.GruntParser.processDump(GruntParser.java:544)
at
org.apache.pig.tools.pigscript.parser.PigScriptParser.parse(PigScriptParser.java:241)
at
org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:162)
at
org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:138)
at org.apache.pig.tools.grunt.Grunt.exec(Grunt.java:89)
at org.apache.pig.Main.main(Main.java:391)
Caused by: org.apache.pig.impl.logicalLayer.FrontendException: ERROR 1002:
Unable to store alias topnames
at org.apache.pig.PigServer.store(PigServer.java:577)
at org.apache.pig.PigServer.openIterator(PigServer.java:504)
... 6 more
Caused by: org.apache.pig.backend.executionengine.ExecException: ERROR 2117:
Unexpected error when launching map reduce job.
at
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher.launchPig(MapReduceLauncher.java:209)
at
org.apache.pig.backend.hadoop.executionengine.HExecutionEngine.execute(HExecutionEngine.java:308)
at
org.apache.pig.PigServer.executeCompiledLogicalPlan(PigServer.java:835)
at org.apache.pig.PigServer.store(PigServer.java:569)
... 7 more
Caused by: java.lang.RuntimeException: Could not resolve error that occured
when launching map reduce job: java.lang.ExceptionInInitializerError
at
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher$JobControlThreadExceptionHandler.uncaughtException(MapReduceLauncher.java:510)
at java.lang.Thread.dispatchUncaughtException(Thread.java:1855)


My pig file would look like

rows = LOAD 'cassandra://Keyspace1/Standard1' USING CassandraStorage();
cols = FOREACH rows GENERATE flatten($1);
colnames = FOREACH cols GENERATE $0;
namegroups = GROUP colnames BY $0;
namecounts = FOREACH namegroups GENERATE COUNT($1), group;
orderednames = ORDER namecounts BY $0;
topnames = LIMIT orderednames 50;
dump topnames;

Does anyone how to fix this issue?

Thanks,
Mubarak Seyed.


Re: live nodes list in ring

2010-06-30 Thread aaron morton
At start up do you see log lines like this 

Gossiper.java (line 576) Node /192.168.34.30 is now part of the cluster

Are all the nodes listed? 

aaron
On 30 Jun 2010, at 22:50, 王一锋 wrote:

> Hi,
>  
> In a cassandra cluster, when issueing ring command on every nodes, some can 
> show all nodes in the cluster but some can only show some other nodes.
> All nodes share the same seed list.
> And even some of the nodes in the seed list have this problem.
> Restarting the problematic nodes won't solve it.
> Try closing firewalls with following commands
>  
> service iptables stop
>  
> Still won't work.
>  
> Anyone got a clue?
>  
> Thanks very much.
>  
> Yifeng



Re: simple authentication in cascal

2010-06-30 Thread Chris Shorrock
Steve,

Looking at the Cassandra docs it appears as though this shouldn't be
too hard to implement, as it should simply involve adding a login
method to the cascal session template. I'll make a ticket in github to
track this , I hope that I have a little more time in the next few
days to investigate further. Cheers.

Chris

On Wednesday, June 30, 2010, Steve Lihn  wrote:
> Chris,
> What I have in mind is to use SimpleAuthenticator in conjuction with the 
> access.properties and passwd.properties.
> This use case is actually somewhat personal. I have servers in the AWS cloud, 
> during development and testing, I may connect from home or wireless points on 
> the road.
> I don't want to be bothered with opening up the firewall based on the IP I 
> have at that moment. So I am thinking to open up 9160 and use authentication 
> to block unauthorized writes, which may corrupt the database. But I couldn't 
> find any info on how to use SimpleAuthenticator in the scala client.
>
> Steve
>
> On Sat, Jun 26, 2010 at 3:07 PM, Chris Shorrock  wrote:
>
> Hey Steve,
> What type of authentication are you looking to do? Cassandra itself doesn't, 
> as far as I know, support any sort of user setup or authentication so it's on 
> the onus of the application developer to lock things down appropriately.
>
>
> Chris
>
> On Fri, Jun 25, 2010 at 5:42 PM, Steve Lihn  wrote:
>
> Searching how to use simple authentication in cascal. Couldn't find anything 
> in API docs. Does anybody know how to do it?
>
> Thanks,
> Steve
>
>
>
>


Re: Compaction-pool pause -- was: Cluster-wide pause

2010-06-30 Thread Steve Lihn
Jon,
Looks like this compaction process took 360 seconds. What is it and how do I
avoid it?

INFO [COMPACTION-POOL:1] 2010-06-30 14:56:08,667 CompactionManager.java
(line 246) Compacting
[org.apache.cassandra.io.SSTableReader(path='/mnt/itops/cdata/TFO/CurrentHolding-129-Data.db'),org.apache.cassandra.io.SSTableReader(path='/mnt/itops/cdata/TFO/CurrentHolding-150-Data.db'),org.apache.cassandra.io.SSTableReader(path='/mnt/itops/cdata/TFO/CurrentHolding-172-Data.db'),org.apache.cassandra.io.SSTableReader(path='/mnt/itops/cdata/TFO/CurrentHolding-194-Data.db')]
 INFO [COMPACTION-POOL:1] 2010-06-30 15:02:09,537 CompactionManager.java
(line 326) Compacted to /mnt/itops/cdata/TFO/CurrentHolding-197-Data.db.
1880884459/1880884459 bytes for 6306383 keys.  Time: 360870ms.

On Wed, Jun 30, 2010 at 3:12 PM, Steve Lihn  wrote:

> Jon,
> I am experimenting writing 8 million rows into Cassandra and also
> experienced some random timeouts, even with 10-second timeout parameter.
> How do I avoid such timeout at all cost? (I.e. At this time, my priority is
> to finish the end-to-end test. Don't want the program to fail at all.)
> This is a one-node server with ms1G and mx2GB. The rest in cassandra.in.share 
> default I think.
>
> Steve
>
>
>


Re: Cluster-wide pause

2010-06-30 Thread Steve Lihn
Jon,
I am experimenting writing 8 million rows into Cassandra and also
experienced some random timeouts, even with 10-second timeout parameter.
How do I avoid such timeout at all cost? (I.e. At this time, my priority is
to finish the end-to-end test. Don't want the program to fail at all.)
This is a one-node server with ms1G and mx2GB. The rest in
cassandra.in.share default I think.

Steve

On Wed, Jun 30, 2010 at 1:58 AM, Jonathan Ellis  wrote:

> You could be seeing GC pauses. Did you increase the heap size you gave
> Cassandra, when you increased your VM size?
>
> On Tue, Jun 29, 2010 at 11:57 AM, Oren Benjamin 
> wrote:
> > Hi all - first timer here.
> >
> > I'm experimenting with Cassandra on Rackspace Cloud.  Started with 4GB
> nodes and saw read latency spikes while streaming was taking place, so I
> increased to 8GB to see if limited memory was the issue.  Now I'm seeing
> very strange behavior during any period that writes are taking place.  The
> entire (6 node) cluster seems to pause for periods of as much as 5-8 sec.
>  By that I mean all the stats (cpu, disk, network IO monitored via dstat)
> drop to zero or near zero on all nodes simultaneously.  Does anyone have
> experience with Cassandra on Rackspace or any idea what's going on here?
> >
> > The pauses are short enough that it's difficult to introspect the
> application and determine what it's doing during the pause, but long enough
> to cause unacceptable latency for any service built on top of it.
> >
> > Any ideas or debugging methods would be greatly appreciated,
> >
> >  -- Oren
>
>
>
> --
> Jonathan Ellis
> Project Chair, Apache Cassandra
> co-founder of Riptano, the source for professional Cassandra support
> http://riptano.com
>


Re: simple authentication in cascal

2010-06-30 Thread Steve Lihn
Chris,
What I have in mind is to use SimpleAuthenticator in conjuction with the
access.properties and passwd.properties.
This use case is actually somewhat personal. I have servers in the AWS
cloud, during development and testing, I may connect from home or wireless
points on the road.
I don't want to be bothered with opening up the firewall based on the IP I
have at that moment. So I am thinking to open up 9160 and use authentication
to block unauthorized writes, which may corrupt the database. But I couldn't
find any info on how to use SimpleAuthenticator in the scala client.

Steve

On Sat, Jun 26, 2010 at 3:07 PM, Chris Shorrock wrote:

> Hey Steve,
>
> What type of authentication are you looking to do? Cassandra itself
> doesn't, as far as I know, support any sort of user setup or authentication
> so it's on the onus of the application developer to lock things down
> appropriately.
>
> Chris
>
>
> On Fri, Jun 25, 2010 at 5:42 PM, Steve Lihn  wrote:
>
>> Searching how to use simple authentication in cascal. Couldn't find
>> anything in API docs. Does anybody know how to do it?
>>
>> Thanks,
>> Steve
>>
>
>


live nodes list in ring

2010-06-30 Thread 王一锋
Hi, 

In a cassandra cluster, when issueing ring command on every nodes, some can 
show all nodes in the cluster but some can only show some other nodes.
All nodes share the same seed list.
And even some of the nodes in the seed list have this problem.
Restarting the problematic nodes won't solve it.
Try closing firewalls with following commands 

service iptables stop 

Still won't work.

Anyone got a clue?

Thanks very much.

Yifeng